Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7983F17608 for ; Tue, 19 May 2015 05:07:29 +0000 (UTC) Received: (qmail 67444 invoked by uid 500); 19 May 2015 05:07:27 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 67367 invoked by uid 500); 19 May 2015 05:07:27 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 67351 invoked by uid 99); 19 May 2015 05:07:26 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 May 2015 05:07:26 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 82C7D1A2F0E for ; Tue, 19 May 2015 05:07:26 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 4.902 X-Spam-Level: **** X-Spam-Status: No, score=4.902 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, KAM_BADIPHTTP=2, NORMAL_HTTP_TO_IP=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, WEIRD_PORT=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ixzsTb6nzMSb for ; Tue, 19 May 2015 05:07:15 +0000 (UTC) Received: from mail-wg0-f54.google.com (mail-wg0-f54.google.com [74.125.82.54]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 74E8642AF1 for ; Tue, 19 May 2015 05:07:15 +0000 (UTC) Received: by wgbgq6 with SMTP id gq6so3847522wgb.3 for ; Mon, 18 May 2015 22:07:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=nYDKkn/v5+7r1AKEEx9uBRcu5Fe+5K5NFB36GksHu3Y=; b=fDz3HrV7ya78djXOpIJe01mB01mf+yL9dU5L0C/sr7TjiIcxDAJAfcg1gpzRv0GaiA nnbgk9mOvJ/AhfhMf0APBZ06pLOcNKBAi+DkVll6x9GPP+Bqi3MS5d3JuMdU0rI25L9w ncqCHXAu+ttTguksfUO6s1PPZhzu6fB1791ojqe9UqhRbOQuR1ZZnw824HV39X5dmWE1 uBvEg5i2LDsWueRjcvlxzRxl5SvcpJ3JOfuGbkks8z+MQdKGh1RF76zUWvv32T4sb1sD HaomSfPqiy9FGbVa8KqCUQDB1JuoDXtxcs9yoaU8Isep0kFOND6tH5DBNqyoM9ecEnHV yBmA== MIME-Version: 1.0 X-Received: by 10.194.61.208 with SMTP id s16mr51128729wjr.135.1432012034628; Mon, 18 May 2015 22:07:14 -0700 (PDT) Received: by 10.28.184.198 with HTTP; Mon, 18 May 2015 22:07:14 -0700 (PDT) In-Reply-To: References: Date: Mon, 18 May 2015 22:07:14 -0700 Message-ID: Subject: Re: HBase failing to restart in single-user mode From: Nick Dimiduk To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=047d7b86d3e6a3655c0516684823 --047d7b86d3e6a3655c0516684823 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Wait. Benoit, you mean restart the laptop or stop/start HBase? I agree that contents of /tmp are not stable across system reboot, across stop/start of HBase process there should be no problems. Should. For what it's worth, on the Mac and local mode testing, I usually use $HBASE_HOME/data. This is usually not on /tmp. On Monday, May 18, 2015, anil gupta wrote: > Hi Benoit, > I think you need to move the directory out of "/tmp" and give it a shot. > /tmp/hbase-${user.name} > /zk will get cleaned up during restart. > > > ~Anil > > On Mon, May 18, 2015 at 9:45 PM, tsuna = > > wrote: > > > I added this to hbase-site.xml: > > > > > > hbase.zookeeper.property.dataDir > > /tmp/hbase-${user.name}/zk > > > > > > Didn=E2=80=99t change anything. Once I kill/shutdown HBase, it won=E2= =80=99t come back > up. > > > > On Mon, May 18, 2015 at 1:14 AM, Viral Bajaria > > > wrote: > > > Same for me, I had faced similar issues especially on my virtual > machines > > > since I would restart them more often than my host machine. > > > > > > Moving ZK from /tmp which could get cleared on reboots fixed the issu= e > > for > > > me. > > > > > > Thanks, > > > Viral > > > > > > > > > On Sun, May 17, 2015 at 10:39 PM, Lars George > > > wrote: > > > > > >> I noticed similar ZK related issues but those went away after changi= ng > > the > > >> ZK directory to a permanent directory along with the HBase root > > directory. > > >> Both point now to a location in my home folder and restarts work fin= e > > now. > > >> Not much help but wanted to at least state that. > > >> > > >> Lars > > >> > > >> Sent from my iPhone > > >> > > >> > On 18 May 2015, at 05:55, tsuna = > > wrote: > > >> > > > >> > Hi all, > > >> > For testing on my laptop (OSX with JDK 1.7.0_45) I usually build t= he > > >> > latest version from branch-1.0 and use the following config: > > >> > > > >> > > > >> > > > >> > hbase.rootdir > > >> > file:///tmp/hbase-${user.name} > > >> > > > >> > > > >> > hbase.online.schema.update.enable > > >> > true > > >> > > > >> > > > >> > zookeeper.session.timeout > > >> > 300000 > > >> > > > >> > > > >> > hbase.zookeeper.property.tickTime > > >> > 2000000 > > >> > > > >> > > > >> > hbase.zookeeper.dns.interface > > >> > lo0 > > >> > > > >> > > > >> > hbase.regionserver.dns.interface > > >> > lo0 > > >> > > > >> > > > >> > hbase.master.dns.interface > > >> > lo0 > > >> > > > >> > > > >> > > > >> > Since at least a month ago (perhaps longer, I don=E2=80=99t rememb= er > exactly) > > >> > I can=E2=80=99t restart HBase. The very first time it starts up f= ine, but > > >> > subsequent startup attempts all fail with: > > >> > > > >> > 2015-05-17 20:39:19,024 INFO [RpcServer.responder] ipc.RpcServer: > > >> > RpcServer.responder: starting > > >> > 2015-05-17 20:39:19,024 INFO [RpcServer.listener,port=3D49809] > > >> > ipc.RpcServer: RpcServer.listener,port=3D49809: starting > > >> > 2015-05-17 20:39:19,029 INFO [main] http.HttpRequestLog: Http > request > > >> > log for http.requests.regionserver is not defined > > >> > 2015-05-17 20:39:19,030 INFO [main] http.HttpServer: Added global > > >> > filter 'safety' > > >> > (class=3Dorg.apache.hadoop.hbase.http.HttpServer$QuotingInputFilte= r) > > >> > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > > >> > static_user_filter > > >> > > > >> > > > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserF= ilter) > > >> > to context regionserver > > >> > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > > >> > static_user_filter > > >> > > > >> > > > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserF= ilter) > > >> > to context static > > >> > 2015-05-17 20:39:19,031 INFO [main] http.HttpServer: Added filter > > >> > static_user_filter > > >> > > > >> > > > (class=3Dorg.apache.hadoop.hbase.http.lib.StaticUserWebFilter$StaticUserF= ilter) > > >> > to context logs > > >> > 2015-05-17 20:39:19,033 INFO [main] http.HttpServer: Jetty bound = to > > >> port 49811 > > >> > 2015-05-17 20:39:19,033 INFO [main] mortbay.log: jetty-6.1.26 > > >> > 2015-05-17 20:39:19,157 INFO [main] mortbay.log: Started > > >> > SelectChannelConnector@0.0.0.0:49811 > > >> > 2015-05-17 20:39:19,222 INFO [M:0;localhost:49807] > > >> > zookeeper.RecoverableZooKeeper: Process > > >> > identifier=3Dhconnection-0x4f708099 connecting to ZooKeeper > > >> > ensemble=3Dlocalhost:2181 > > >> > 2015-05-17 20:39:19,222 INFO [M:0;localhost:49807] > > >> > zookeeper.ZooKeeper: Initiating client connection, > > >> > connectString=3Dlocalhost:2181 sessionTimeout=3D10000 > > >> > watcher=3Dhconnection-0x4f7080990x0, quorum=3Dlocalhost:2181, > > >> > baseZNode=3D/hbase > > >> > 2015-05-17 20:39:19,223 INFO > > >> > [M:0;localhost:49807-SendThread(localhost:2181)] > zookeeper.ClientCnxn: > > >> > Opening socket connection to server localhost/127.0.0.1:2181. Will > > not > > >> > attempt to authenticate using SASL (unknown error) > > >> > 2015-05-17 20:39:19,223 INFO > > >> > [M:0;localhost:49807-SendThread(localhost:2181)] > zookeeper.ClientCnxn: > > >> > Socket connection established to localhost/127.0.0.1:2181, > initiating > > >> > session > > >> > 2015-05-17 20:39:19,223 INFO > > >> > [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181] > > >> > server.NIOServerCnxnFactory: Accepted socket connection from > > >> > /127.0.0.1:49812 > > >> > 2015-05-17 20:39:19,223 INFO > > >> > [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181] server.ZooKeeperServer= : > > >> > Client attempting to establish new session at /127.0.0.1:49812 > > >> > 2015-05-17 20:39:19,224 INFO [SyncThread:0] server.ZooKeeperServe= r: > > >> > Established session 0x14d651aaec00002 with negotiated timeout > 4000000 > > >> > for client /127.0.0.1:49812 > > >> > 2015-05-17 20:39:19,224 INFO > > >> > [M:0;localhost:49807-SendThread(localhost:2181)] > zookeeper.ClientCnxn: > > >> > Session establishment complete on server localhost/127.0.0.1:2181, > > >> > sessionid =3D 0x14d651aaec00002, negotiated timeout =3D 4000000 > > >> > 2015-05-17 20:39:19,249 INFO [M:0;localhost:49807] > > >> > regionserver.HRegionServer: ClusterId : > > >> > 6ad7eddd-2886-4ff0-b377-a2ff42c8632f > > >> > 2015-05-17 20:39:49,208 ERROR [main] master.HMasterCommandLine: > Master > > >> exiting > > >> > java.lang.RuntimeException: Master not active after 30 seconds > > >> > at > > >> > > > org.apache.hadoop.hbase.util.JVMClusterUtil.startup(JVMClusterUtil.java:1= 94) > > >> > at > > >> > > > org.apache.hadoop.hbase.LocalHBaseCluster.startup(LocalHBaseCluster.java:= 445) > > >> > at > > >> > > > org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster(HMasterComm= andLine.java:197) > > >> > at > > >> > > > org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.= java:139) > > >> > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70= ) > > >> > at > > >> > > > org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.j= ava:126) > > >> > at > > org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:2002) > > >> > > > >> > > > >> > I noticed that this has something to do with the ZooKeeper data. > If I > > >> > rm -rf $TMPDIR/hbase-tsuna/zookeeper then I can start HBase again. > > >> > But of course HBase won=E2=80=99t work properly because while some= tables > > >> > exist on the filesystem, they no longer exist in ZK, etc. > > >> > > > >> > Does anybody know what could be left behind in ZK that could make = it > > >> > hang during startup? I looked at a jstack output while it was > paused > > >> > during 30s and didn=E2=80=99t find anything noteworthy. > > >> > > > >> > -- > > >> > Benoit "tsuna" Sigoure > > >> > > > > > > > > -- > > Benoit "tsuna" Sigoure > > > > > > -- > Thanks & Regards, > Anil Gupta > --047d7b86d3e6a3655c0516684823--