Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EDB2B1094D for ; Fri, 27 Dec 2013 03:47:29 +0000 (UTC) Received: (qmail 93814 invoked by uid 500); 27 Dec 2013 03:46:54 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 93773 invoked by uid 500); 27 Dec 2013 03:46:51 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 93728 invoked by uid 99); 27 Dec 2013 03:46:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Dec 2013 03:46:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of ramkrishna.s.vasudevan@gmail.com designates 209.85.220.52 as permitted sender) Received: from [209.85.220.52] (HELO mail-pa0-f52.google.com) (209.85.220.52) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Dec 2013 03:46:41 +0000 Received: by mail-pa0-f52.google.com with SMTP id ld10so8902798pab.39 for ; Thu, 26 Dec 2013 19:46:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=G3UD080I40oQ3HyEqsfgJovKi6DwNzwWb14CkY+3+Iw=; b=WklYixeFhRBheJlQluAzlWRq6juEumWF+5tShFKDSsSK1rmKORfWmFBoHUtLzj+Kqh 1tTB33xO8NK+Q+wnd3d4Xa3tmE37rsfU00WLKEUS8nL5bq5YbZjhXTA3wg3KKaISVeBS tIHxsoTH6tsfFJpoV0LAb2FG6WJZe/nl1vSnlmeKmQevXiI8KtMf930PsaUCZF4SHVCF CA86QbzDSSsNB7q6vugq6F8SKO8+WMSf377py+1t4c0bNnWUSaObKF0cL5WAVtdXbrQz yEFxPTjCGRDKCAS0BQskGj2zPt7EPE07ccI/POKXp3ziQB94OFPIAwQir6yEXzMXI6iE VdlA== MIME-Version: 1.0 X-Received: by 10.68.106.130 with SMTP id gu2mr47662807pbb.59.1388115980727; Thu, 26 Dec 2013 19:46:20 -0800 (PST) Received: by 10.68.204.164 with HTTP; Thu, 26 Dec 2013 19:46:20 -0800 (PST) In-Reply-To: <1388093863.58706.YahooMailNeo@web140604.mail.bf1.yahoo.com> References: <1388085758.75214.YahooMailNeo@web140606.mail.bf1.yahoo.com> <1388093863.58706.YahooMailNeo@web140604.mail.bf1.yahoo.com> Date: Fri, 27 Dec 2013 09:16:20 +0530 Message-ID: Subject: Re: Master (should not?) abort startup on Unexpected PENDING_OPEN state From: ramkrishna vasudevan To: "user@hbase.apache.org" , lars hofhansl Cc: Jean-Marc Spaggiari Content-Type: multipart/alternative; boundary=047d7b6d7c1cf03b2f04ee7beff3 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b6d7c1cf03b2f04ee7beff3 Content-Type: text/plain; charset=ISO-8859-1 Is this in the 0.94 version or in the 0.96 version? On Fri, Dec 27, 2013 at 3:07 AM, lars hofhansl wrote: > Yeah, sounds like a but. Mind filing a jira? > > > > ________________________________ > From: Jean-Marc Spaggiari > To: user ; lars hofhansl > Sent: Thursday, December 26, 2013 12:05 PM > Subject: Re: Master (should not?) abort startup on Unexpected PENDING_OPEN > state > > > > It was aborting each time I was trying. I tried at least 10 times. Failed > 10 times. I have deleted the znodes and restarted and it started correctly. > > I might be able to reproduce the situation. > > > > > 2013/12/26 lars hofhansl > > When you start the master again, does it abort again? > > > > > > > >________________________________ > > From: Jean-Marc Spaggiari > >To: user > >Sent: Thursday, December 26, 2013 6:00 AM > >Subject: Master (should not?) abort startup on Unexpected PENDING_OPEN > state > > > > > > > >I think I stopped my master while it was doing a big balancing. At > restart, > >I'm getting the exception below and master exit. All RS are able to start > >correctly, but not the master. > > > >Since master is not starting I can not manually assign this region from > the > >shell. I guess I can simply delete the znode about the region, restart and > >hbck, but my opinion is that we should not abord the startup when such > >exception occurs. > > > >JM > > > > > > > >java.lang.IllegalStateException: Unexpected state : > > >page,www\x1Fhttp\x1F-1\x1F/vote/comment/27996/1/\x1Fnull,1379104524006.17bee313797fc1ce982c0e31fdb6620c. > >state=PENDING_OPEN, ts=1388065670415, server=node6,60020,1388027343261 .. > >Cannot transit it to OFFLINE. > > at > > >org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:1890) > > at > > >org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1690) > > at > > >org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1426) > > at > > >org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1398) > > at > > >org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1393) > > at > > >org.apache.hadoop.hbase.master.handler.ClosedRegionHandler.process(ClosedRegionHandler.java:105) > > at > >org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:175) > > at > > >java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > > at > > >java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > > at java.lang.Thread.run(Thread.java:744) > --047d7b6d7c1cf03b2f04ee7beff3--