Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 38185200CD1 for ; Wed, 26 Jul 2017 18:24:06 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 36C1A16924A; Wed, 26 Jul 2017 16:24:06 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5589C169249 for ; Wed, 26 Jul 2017 18:24:05 +0200 (CEST) Received: (qmail 45615 invoked by uid 500); 26 Jul 2017 16:24:04 -0000 Mailing-List: contact issues-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@impala.incubator.apache.org Delivered-To: mailing list issues@impala.incubator.apache.org Received: (qmail 45593 invoked by uid 99); 26 Jul 2017 16:24:02 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Jul 2017 16:24:02 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 6C6ECC02BE for ; Wed, 26 Jul 2017 16:24:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -100.002 X-Spam-Level: X-Spam-Status: No, score=-100.002 tagged_above=-999 required=6.31 tests=[RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id uiYXLcVMHbkp for ; Wed, 26 Jul 2017 16:24:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id B65655F6D2 for ; Wed, 26 Jul 2017 16:24:00 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 4B473E00C7 for ; Wed, 26 Jul 2017 16:24:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 06F8D24817 for ; Wed, 26 Jul 2017 16:24:00 +0000 (UTC) Date: Wed, 26 Jul 2017 16:24:00 +0000 (UTC) From: "Matthew Jacobs (JIRA)" To: issues@impala.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (IMPALA-5724) TestStatestore timed out in exhaustive jenkins run MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 26 Jul 2017 16:24:06 -0000 Matthew Jacobs created IMPALA-5724: -------------------------------------- Summary: TestStatestore timed out in exhaustive jenkins run Key: IMPALA-5724 URL: https://issues.apache.org/jira/browse/IMPALA-5724 Project: IMPALA Issue Type: Bug Components: Distributed Exec Affects Versions: Impala 2.10.0 Environment: rhel7 Reporter: Matthew Jacobs Priority: Critical Attachments: statestored.log On a recent exhaustive jenkins run (on rhel7), TestStatestore timed out: {code} 08:41:31 [gw2] PASSED unittests/test_file_parser.py::TestTestFileParser::test_parse_commented_out_test_as_comment 08:41:31 unittests/test_result_verifier.py::TestResultVerifier::test_result_row_indexing[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] 08:41:31 [gw2] PASSED unittests/test_result_verifier.py::TestResultVerifier::test_result_row_indexing[exec_option: {'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] 08:41:31 [gw3] PASSED statestore/test_statestore.py::TestStatestore::test_update_is_delta 08:41:45 [gw0] PASSED statestore/test_statestore.py::TestStatestore::test_failure_detected Build timed out (after 1,440 minutes). Marking the build as failed. 20:56:59 Build was aborted 20:56:59 Archiving artifacts 20:56:59 20:56:59 [gw0] node down: Not properly terminated 20:56:59 [gw0] FAILED statestore/test_statestore.py::TestStatestore::test_topic_persistence 20:56:59 Replacing crashed slave gw0 20:57:05 Recording test results 20:57:08 Email was triggered for: Failure 20:57:08 Sending email for trigger: Failure 20:57:08 Sending email to: impala-jenkins@cloudera.com 20:57:08 20:57:08 Deleting project workspace... 20:57:08 done 20:57:08 20:57:08 Finished: FAILURE {code} The statestore logs show a lot of errors like {code} I0726 08:24:01.102785 30978 statestore.cc:526] Preparing initial test_skipped_b1501e92-7215-11e7-a5fa-02581563417c topic update for python-test-client-b1507018-7215-11e7-a5fa-02581563417c. Size = 8.00 B I0726 08:24:01.103085 30978 thrift-util.cc:123] TSocket::open() connect() Connection refused I0726 08:24:01.415092 30978 status.cc:55] RPC Error: Client for localhost:45518 hits an unexpected exception: TProtocolException: Invalid data, type: N6apache6thrift8protocol18TProtocolExceptionE rpc send completed: true @ 0x12590d6 impala::Status::Status() @ 0x15ee502 impala::ClientConnection<>::DoRpc<>() @ 0x15e7431 impala::Statestore::SendTopicUpdate() @ 0x15e9610 impala::Statestore::DoSubscriberUpdate() @ 0x15fecfe boost::_mfi::mf3<>::operator()() @ 0x15fd5a5 boost::_bi::list4<>::operator()<>() @ 0x15fb3de boost::_bi::bind_t<>::operator()<>() @ 0x15f88d3 boost::detail::function::void_function_obj_invoker2<>::invoke() @ 0x15f4dfd boost::function2<>::operator()() @ 0x15f06ef impala::ThreadPool<>::WorkerThread() @ 0x160038d boost::_mfi::mf1<>::operator()() @ 0x15ffe17 boost::_bi::list2<>::operator()<>() @ 0x15fedfd boost::_bi::bind_t<>::operator()() @ 0x15fd88c boost::detail::function::void_function_obj_invoker0<>::invoke() @ 0x13d6148 boost::function0<>::operator()() @ 0x16a7031 impala::Thread::SuperviseThread() @ 0x16afb38 boost::_bi::list4<>::operator()<>() @ 0x16afa7b boost::_bi::bind_t<>::operator()() @ 0x16afa3e boost::detail::thread_data<>::run() @ 0x1ba055a thread_proxy @ 0x7f23cdfa9df3 start_thread @ 0x7f23cdcd71ad __clone I0726 08:24:01.415179 30978 client-cache.cc:170] Broken Connection, destroy client for localhost:45518 I0726 08:24:01.415273 30978 statestore.cc:697] Unable to send topic update message to subscriber python-test-client-b1507018-7215-11e7-a5fa-02581563417c, received error: RPC Error: Client for localhost:45518 hits an unexpected exception: TProtocolException: Invalid data, type: N6apache6thrift8protocol18TProtocolExceptionE rpc send completed: true {code} I've attached the full statestored log. -- This message was sent by Atlassian JIRA (v6.4.14#64029)