Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6249E10F82 for ; Thu, 27 Nov 2014 14:54:51 +0000 (UTC) Received: (qmail 87674 invoked by uid 500); 27 Nov 2014 14:54:49 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 87602 invoked by uid 500); 27 Nov 2014 14:54:49 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 87585 invoked by uid 99); 27 Nov 2014 14:54:48 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Nov 2014 14:54:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yuzhihong@gmail.com designates 209.85.160.170 as permitted sender) Received: from [209.85.160.170] (HELO mail-yk0-f170.google.com) (209.85.160.170) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Nov 2014 14:54:22 +0000 Received: by mail-yk0-f170.google.com with SMTP id q200so2244766ykb.15 for ; Thu, 27 Nov 2014 06:53:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=0PDCN+1IdiWamQl6aTEVG3r9ykCWQNojNkDDUVGNXdE=; b=t8c/geZhMPEhOvv8KZhI/ghPfDehWAoaF6K371hkkPphfkczYRVauTErdp5XoREGqA NAaxDHKgWgzgJhGQlVTouyLjf6mMt3oeUOugBLeXpnKsgPvBE8bZ3Ia1yx5AOcpMY5w2 5X88+G0oIfFuH4OLFNUBlQRjQRq393T9TbMLUA8M3ilTBR4X22pTVUeiiPncfzez6Uz5 P54qe4jZwxu22oU+mXBiRxf+moiM5FgXvWJT+ydJ/HWqVvrBcsp6R1c0wOJTaU4dsEoB wbEjw0uzQ4onK4NYz4nYxmgEiI58uSWd29SKMazINevwU+0VvWmipPQDZThN9gex5/3z QQ6w== MIME-Version: 1.0 X-Received: by 10.236.220.194 with SMTP id o62mr37563336yhp.32.1417100016746; Thu, 27 Nov 2014 06:53:36 -0800 (PST) Received: by 10.170.180.7 with HTTP; Thu, 27 Nov 2014 06:53:36 -0800 (PST) In-Reply-To: References: Date: Thu, 27 Nov 2014 06:53:36 -0800 Message-ID: Subject: Re: Zookeeper shuting down. From: Ted Yu To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a11c22f9c1c098c0508d84fe8 X-Virus-Checked: Checked by ClamAV on apache.org --001a11c22f9c1c098c0508d84fe8 Content-Type: text/plain; charset=UTF-8 bq. Cannot open channel to 1 at election address /172.10.195.299:3888 Can you check zookeeper log on 172.10.195.299 ? Cheers On Thu, Nov 27, 2014 at 12:29 AM, wrote: > Hi, > > Please find the log from the master node, I am using hbase-0.94.12 and > zookeeper-3.4.5 > > 014-11-27 12:32:21,444 [myid:0] - INFO > [Thread-1:QuorumCnxManager$Listener@486] - My election bind port: > 0.0.0.0/0.0.0.0:3888 > 2014-11-27 12:32:21,459 [myid:0] - INFO > [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:2181:QuorumPeer@670] - LOOKING > 2014-11-27 12:32:21,461 [myid:0] - INFO > [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:2181:FastLeaderElection@740] - New > election. My id = 0, proposed zxid=0x40 > 2014-11-27 12:32:21,464 [myid:0] - INFO > [WorkerReceiver[myid=0]:FastLeaderElection@542] - Notification: 0 > (n.leader), 0x40 (n.zxid), 0x1 (n.round), LOOKING (n.state), 0 (n.sid), > 0x1 (n.peerEPoch), LOOKING (my state) > 2014-11-27 12:32:21,468 [myid:0] - WARN > [WorkerSender[myid=0]:QuorumCnxManager@368] - Cannot open channel to 1 at > election address /172.10.195.299:3888 > java.net.ConnectException: Connection refused > at java.net.PlainSocketImpl.socketConnect(Native Method) > at > > java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339) > at > > java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200) > at > java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182) > at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) > at java.net.Socket.connect(Socket.java:579) > at > > org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:354) > at > > org.apache.zookeeper.server.quorum.QuorumCnxManager.toSend(QuorumCnxManager.java:327) > at > > org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.process(FastLeaderElection.java:393) > at > > org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.run(FastLeaderElection.java:365) > at java.lang.Thread.run(Thread.java:724) > > Thanks & Regards > Dhamodharan Ramalingam > > > > From: Ted Yu > To: "user@hbase.apache.org" > Date: 11/25/2014 08:28 PM > Subject: Re: Zookeepr shuting down. > > > > Can you pastebin zookeeper log from the master node ? > > What hbase / zookeeper version are you using ? > > Cheers > > On Mon, Nov 24, 2014 at 11:09 PM, wrote: > > > Hi > > > > I have a cluster of 3 systems each of 32GB RAM and 1 TB HD. I have > > clustered all the three and able to start and run Hadoop Successfully. > > > > I have installed Hbase on the master node. Now am trying to start > > Zookeeper in the cluster. When I start zookeeper and give command > > ./zkServer.sh status its telling 'Zookeeper might not be running... '. > > When I start zookeeper on all Nodes, zookeper on Master node becomes > > follower and when an MR is run , It will run 90% on reducer part then > > throws error 'Not able to find the region server.'. When checked > zookeeper > > logs, Am viewing 'Connection error'. > > > > There is proper ssh across all 3 systems. I have tried giving iptables > > -flush too, but no luck!! > > > > I have tried another option by allowing Hbase to handle its own > zookeeper. > > HQuarampeer starts successfully, and I am able to process 3GB file > through > > MR for first time. But from second run zookeeper on one of the nodes > > getting killed. not able to figure out the issue. > > > > Please help!!! > > > > Thanks & Regards > > Dhamodharan > > > > =====-----=====-----===== > > Notice: The information contained in this e-mail > > message and/or attachments to it may contain > > confidential or privileged information. If you are > > not the intended recipient, any dissemination, use, > > review, distribution, printing or copying of the > > information contained in this e-mail message > > and/or attachments to it are strictly prohibited. If > > you have received this communication in error, > > please notify us by reply e-mail or telephone and > > immediately and permanently delete the message > > and any attachments. Thank you > > > > > > > > --001a11c22f9c1c098c0508d84fe8--