Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 65DCD176EE for ; Mon, 18 May 2015 05:40:05 +0000 (UTC) Received: (qmail 65854 invoked by uid 500); 18 May 2015 05:40:01 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 65789 invoked by uid 500); 18 May 2015 05:40:01 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 65777 invoked by uid 99); 18 May 2015 05:40:00 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 18 May 2015 05:40:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 84A20C5922 for ; Mon, 18 May 2015 05:40:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.1 X-Spam-Level: X-Spam-Status: No, score=-0.1 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id DxC0T9miStgG for ; Mon, 18 May 2015 05:39:49 +0000 (UTC) Received: from mail-wg0-f50.google.com (mail-wg0-f50.google.com [74.125.82.50]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id B6CFC428F6 for ; Mon, 18 May 2015 05:39:48 +0000 (UTC) Received: by wgjc11 with SMTP id c11so12769101wgj.0 for ; Sun, 17 May 2015 22:39:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-type:content-transfer-encoding:mime-version:subject :message-id:date:references:in-reply-to:to; bh=yzstzoIYYc9r1AkSfByVjRCvRuCLZcOUDva9Dq8jYQI=; b=tZzZXqE6nYSkPUssfyjtlF/ZfOPWjMfPpgcHOhrQN1vFPgAE0c5wwTZOH/IDCG2SkC vP/9UHlVf9sQDIwJSQWCjPSrkW0I0a3/F96D9HzHuYjJsbF1wJzpZtUw/WIEUmCIugS3 h7RRUkrUa6hOsBEmoaTCY9EyAel9D4bdVDH8VGIqnQUnfGeAXg0a6THF6S2zbJ75Dnb5 tRDoq0lkwFBOAMlkKr4I83U8ihNlS/WTzjE5hGJIZX+YFW4oanRx/2zcGHzixLnckZ7L FERs4/HJMOItDiDtvKzCAGKxtpZ8mfC4yW9dRzoN1BiYPH4FDCgfqO/fJXUjdxAWPDaI xDVA== X-Received: by 10.194.78.49 with SMTP id y17mr41335856wjw.131.1431927587859; Sun, 17 May 2015 22:39:47 -0700 (PDT) Received: from [192.168.178.33] (dslb-178-005-179-169.178.005.pools.vodafone-ip.de. [178.5.179.169]) by mx.google.com with ESMTPSA id it5sm10633624wid.3.2015.05.17.22.39.47 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sun, 17 May 2015 22:39:47 -0700 (PDT) From: Lars George Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (1.0) Subject: Re: HBase failing to restart in single-user mode Message-Id: Date: Mon, 18 May 2015 07:39:46 +0200 References: In-Reply-To: To: "user@hbase.apache.org" X-Mailer: iPhone Mail (12F70) I noticed similar ZK related issues but those went away after changing the Z= K directory to a permanent directory along with the HBase root directory. Bo= th point now to a location in my home folder and restarts work fine now. Not= much help but wanted to at least state that. =20 Lars=20 Sent from my iPhone > On 18 May 2015, at 05:55, tsuna wrote: >=20 > Hi all, > For testing on my laptop (OSX with JDK 1.7.0_45) I usually build the > latest version from branch-1.0 and use the following config: >=20 > > > hbase.rootdir > file:///tmp/hbase-${user.name} > > > hbase.online.schema.update.enable > true > > > zookeeper.session.timeout > 300000 > > > hbase.zookeeper.property.tickTime > 2000000 > > > hbase.zookeeper.dns.interface > lo0 > > > hbase.regionserver.dns.interface > lo0 > > > hbase.master.dns.interface > lo0 > > >=20 > Since at least a month ago (perhaps longer, I don=E2=80=99t remember exact= ly) > I can=E2=80=99t restart HBase. The very first time it starts up fine, but= > subsequent startup attempts all fail with: >=20 > 2015-05-17 20:39:19,024 INFO [RpcServer.responder] ipc.RpcServer: > RpcServer.responder: starting > 2015-05-17 20:39:19,024 INFO [RpcServer.listener,port=3D49809] > ipc.RpcServer: RpcServer.listener,port=3D49809: starting > 2015-05-17 20:39:19,029 INFO [main] http.HttpRequestLog: Http request > log for http.requests.regionserver is not defined > 2015-05-17 20:39:19,030 INFO [main] http.HttpServer: Added global > filter 'safety' > (class=3Dorg.apache.hadoop.hbase.http.HttpServer$QuotingInputFilter) > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > static_user_filter > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserFi= lter) > to context regionserver > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > static_user_filter > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserFi= lter) > to context static > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > static_user_filter > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserFi= lter) > to context logs > 2015-05-17 20:39:19,033 INFO [main] http.HttpServer: Jetty bound to port 4= 9811 > 2015-05-17 20:39:19,033 INFO [main] mortbay.log: jetty-6.1.26 > 2015-05-17 20:39:19,157 INFO [main] mortbay.log: Started > SelectChannelConnector@0.0.0.0:49811 > 2015-05-17 20:39:19,222 INFO [M:0;localhost:49807] > zookeeper.RecoverableZooKeeper: Process > identifier=3Dhconnection-0x4f708099 connecting to ZooKeeper > ensemble=3Dlocalhost:2181 > 2015-05-17 20:39:19,222 INFO [M:0;localhost:49807] > zookeeper.ZooKeeper: Initiating client connection, > connectString=3Dlocalhost:2181 sessionTimeout=3D10000 > watcher=3Dhconnection-0x4f7080990x0, quorum=3Dlocalhost:2181, > baseZNode=3D/hbase > 2015-05-17 20:39:19,223 INFO > [M:0;localhost:49807-SendThread(localhost:2181)] zookeeper.ClientCnxn: > Opening socket connection to server localhost/127.0.0.1:2181. Will not > attempt to authenticate using SASL (unknown error) > 2015-05-17 20:39:19,223 INFO > [M:0;localhost:49807-SendThread(localhost:2181)] zookeeper.ClientCnxn: > Socket connection established to localhost/127.0.0.1:2181, initiating > session > 2015-05-17 20:39:19,223 INFO > [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181] > server.NIOServerCnxnFactory: Accepted socket connection from > /127.0.0.1:49812 > 2015-05-17 20:39:19,223 INFO > [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181] server.ZooKeeperServer: > Client attempting to establish new session at /127.0.0.1:49812 > 2015-05-17 20:39:19,224 INFO [SyncThread:0] server.ZooKeeperServer: > Established session 0x14d651aaec00002 with negotiated timeout 4000000 > for client /127.0.0.1:49812 > 2015-05-17 20:39:19,224 INFO > [M:0;localhost:49807-SendThread(localhost:2181)] zookeeper.ClientCnxn: > Session establishment complete on server localhost/127.0.0.1:2181, > sessionid =3D 0x14d651aaec00002, negotiated timeout =3D 4000000 > 2015-05-17 20:39:19,249 INFO [M:0;localhost:49807] > regionserver.HRegionServer: ClusterId : > 6ad7eddd-2886-4ff0-b377-a2ff42c8632f > 2015-05-17 20:39:49,208 ERROR [main] master.HMasterCommandLine: Master exi= ting > java.lang.RuntimeException: Master not active after 30 seconds > at org.apache.hadoop.hbase.util.JVMClusterUtil.startup(JVMClusterUt= il.java:194) > at org.apache.hadoop.hbase.LocalHBaseCluster.startup(LocalHBaseClus= ter.java:445) > at org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster(HM= asterCommandLine.java:197) > at org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCom= mandLine.java:139) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) > at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerComm= andLine.java:126) > at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:2002) >=20 >=20 > I noticed that this has something to do with the ZooKeeper data. If I > rm -rf $TMPDIR/hbase-tsuna/zookeeper then I can start HBase again. > But of course HBase won=E2=80=99t work properly because while some tables > exist on the filesystem, they no longer exist in ZK, etc. >=20 > Does anybody know what could be left behind in ZK that could make it > hang during startup? I looked at a jstack output while it was paused > during 30s and didn=E2=80=99t find anything noteworthy. >=20 > --=20 > Benoit "tsuna" Sigoure