Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DB44EF55E for ; Sat, 20 Apr 2013 01:47:19 +0000 (UTC) Received: (qmail 25564 invoked by uid 500); 20 Apr 2013 01:47:19 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 25452 invoked by uid 500); 20 Apr 2013 01:47:19 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 25444 invoked by uid 99); 20 Apr 2013 01:47:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 20 Apr 2013 01:47:19 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of josh.elser@gmail.com designates 209.85.216.174 as permitted sender) Received: from [209.85.216.174] (HELO mail-qc0-f174.google.com) (209.85.216.174) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 20 Apr 2013 01:47:12 +0000 Received: by mail-qc0-f174.google.com with SMTP id z24so2366258qcq.19 for ; Fri, 19 Apr 2013 18:46:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:message-id:date:from:user-agent:mime-version:to:subject :references:in-reply-to:content-type:content-transfer-encoding; bh=NC4oUeXQ63DTx6IEt3G3ycu913kJVVVHunxplVHM2Bo=; b=Pg01RIKCYWsPHs5E+wim2lKpXqC0tc1yQh8xiHq/xNoR9trbWITxJdAIwlnAPVzRdE 75yD8CqJIeYEG9J14KgFks5oJyAgdziiUJxUbcRlfVNoxjWTGTKsA0twDVspHqBBPPFX O/CTDRgQkbyF9jOT5goFLojKQIuEJRWOaa+HH8LNGT3ZTLGhtKG3sN6aU2MpCrUEILwq jrqIhMHgI4u0SI2lLY4R3rq1cdDOUTotCAc8HWpBQAPdAmYo9JuCSkR4g7SM7h5wPrnx FuK1ocX8HULbnsHgMExtoP46KfOFa9awKF8rDqnFShW3hELXLqCJQFBLxBrZfuzqELKl zmIQ== X-Received: by 10.49.76.101 with SMTP id j5mr17710593qew.34.1366422411651; Fri, 19 Apr 2013 18:46:51 -0700 (PDT) Received: from [192.168.2.19] (pool-173-69-170-178.bltmmd.fios.verizon.net. [173.69.170.178]) by mx.google.com with ESMTPS id u13sm20938740qac.7.2013.04.19.18.46.50 (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 19 Apr 2013 18:46:51 -0700 (PDT) Message-ID: <5171F39E.6000603@gmail.com> Date: Fri, 19 Apr 2013 21:47:10 -0400 From: Josh Elser User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130418 Thunderbird/17.0.5 MIME-Version: 1.0 To: dev@accumulo.apache.org Subject: Re: org.apache.accumulo.test.TestAccumuloSplitRecovery 1.5 hangs indefinitely References: <5171EEF2.4020004@gmail.com> In-Reply-To: <5171EEF2.4020004@gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Thought about it more, and remembered about the JUnit temp dir. Found that the two TServers both lost their ZK lock. Perhaps the configuration is just a little too constrained? On 04/19/2013 09:27 PM, Josh Elser wrote: > Is anyone else seeing this? I didn't have this happening early this > week (Wednesday, maybe?). > > It gets stuck trying to get the Connector: > > at > org.apache.accumulo.core.util.UtilWaitThread.sleep(UtilWaitThread.java:26) > at > org.apache.accumulo.core.client.impl.ServerClient.executeRaw(ServerClient.java:112) > at > org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:71) > at > org.apache.accumulo.core.client.impl.ConnectorImpl.(ConnectorImpl.java:64) > at > org.apache.accumulo.core.client.ZooKeeperInstance.getConnector(ZooKeeperInstance.java:227) > at > org.apache.accumulo.core.client.ZooKeeperInstance.getConnector(ZooKeeperInstance.java:222) > at > org.apache.accumulo.test.TestAccumuloSplitRecovery.test(TestAccumuloSplitRecovery.java:87) > > The ZKMain was running, as was the Master; I'm not quite sure how to > debug it. I grabbed stacktraces from each process when it happened, > and it appears that I can reliably reproduce it (about 3/3 so far).