Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AF26EDAAA for ; Wed, 11 Jul 2012 23:25:32 +0000 (UTC) Received: (qmail 52148 invoked by uid 500); 11 Jul 2012 23:25:32 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 52088 invoked by uid 500); 11 Jul 2012 23:25:32 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 52078 invoked by uid 99); 11 Jul 2012 23:25:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jul 2012 23:25:32 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of davidnickerson4mailinglists@gmail.com designates 209.85.213.52 as permitted sender) Received: from [209.85.213.52] (HELO mail-yw0-f52.google.com) (209.85.213.52) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jul 2012 23:25:28 +0000 Received: by yhpp61 with SMTP id p61so2035343yhp.39 for ; Wed, 11 Jul 2012 16:25:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=nXAgog2Jq8XcZ11zNau6Z1cVwKJKrkwPdfIwe2AmcSY=; b=0Rw9Ogf8Dfr5RcM3bz2AzwhMT6shl5gOISWRcBf4HqqCvDyGAs9C6oqRSYDDw40B8a zMgApQm0SPk26X/+oJwjSLkpKR5Lc9oFNx0oSj/uZqxAd5xQBK/emM1CgY71+3quPn2m 7WvPMSOdAKhMl1g/kI71iwie+ppMpiEDsBkxx9ijP7PhZNLnxGMxkKaJQDUi5flEJRE4 vtV7UM2vy3yP7oW5PZ2c7829uqSsW4q6pcF+iYna6KqRk6KWv49p06DmpAGpxnuQdBJz pPEYGochoy5rTlzfWjtkmJuHZCL4wq0gfQqSuNjXbxiL3S4YtRSrJroWroiB90kJPHIQ /bJg== MIME-Version: 1.0 Received: by 10.50.85.196 with SMTP id j4mr15675117igz.30.1342049107072; Wed, 11 Jul 2012 16:25:07 -0700 (PDT) Received: by 10.64.90.198 with HTTP; Wed, 11 Jul 2012 16:25:06 -0700 (PDT) In-Reply-To: References: <91FBB522-4A81-4059-9CC9-125D12D1946E@jordanzimmerman.com> Date: Wed, 11 Jul 2012 19:25:06 -0400 Message-ID: Subject: Re: ZK not starting up due to socket timeouts From: David Nickerson To: user@zookeeper.apache.org Content-Type: multipart/alternative; boundary=e89a8f3b9beb4c54fb04c4962821 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f3b9beb4c54fb04c4962821 Content-Type: text/plain; charset=ISO-8859-1 For cross-reference, the leap second bug's effect on ZooKeeper has been posted about before: http://zookeeper-user.578899.n2.nabble.com/leap-second-excitement-td7577634.html On Wed, Jul 11, 2012 at 6:31 PM, Narayanan A R < narayanan.arunachalam@gmail.com> wrote: > Fixed the problem. It had to do with the leap second bug. Ran the following > command on all the three servers and is now working fine. > > date;/etc/init.d/ntp stop;/etc/init.d/ntpd stop; date `date > +"%m%d%H%M%C%y.%S"`;/etc/init.d/ntp start;/etc/init.d/ntpd start > > You can find some overview of this problem here: > > http://www.wired.com/wiredenterprise/2012/07/leap-second-bug-wreaks-havoc-with-java-linux/ > > > On Wed, Jul 11, 2012 at 2:35 PM, Narayanan A R < > narayanan.arunachalam@gmail.com> wrote: > > > Thanks David. Here they are: > > > > http://pastebin.com/STTnLf9s > > http://pastebin.com/PiTgUWpA > > http://pastebin.com/4V3AjT34 > > > > > > On Wed, Jul 11, 2012 at 4:32 AM, David Nickerson < > > davidnickerson4mailinglists@gmail.com> wrote: > > > >> Narayanan, I don't think the attachments made it through. Can you link > to > >> the logs in Pastebin? > >> > >> On Tue, Jul 10, 2012 at 5:24 PM, Narayanan A R < > >> narayanan.arunachalam@gmail.com> wrote: > >> > >> > Yeah I tried that. Right now I have that set to 2 mins. > >> > > >> > On Tue, Jul 10, 2012 at 1:47 PM, Jordan Zimmerman < > >> > jordan@jordanzimmerman.com> wrote: > >> > > >> > > Another thing you might need to do is to increase initLimit and > >> > syncLimit. > >> > > It might be timing out when its syncing. > >> > > > >> > > -JZ > >> > > > >> > > On Jul 10, 2012, at 1:46 PM, Narayanan A R wrote: > >> > > > >> > > > I have 3 servers in the cluster, all bare metal machines and > >> installed > >> > > ZK 3.4.3. Following is the config in all the servers. > >> > > > > >> > > > tickTime=2000 > >> > > > dataDir=/opt/zookeeper-3.4.3/data > >> > > > clientPort=2181 > >> > > > initLimit=60 > >> > > > syncLimit=60 > >> > > > server.1=10.7.78.77:2888:3888 > >> > > > server.2=10.7.66.54:2888:3888 > >> > > > server.3=10.7.56.61:2888:3888 > >> > > > > >> > > > When I startup the instances, I see socket timed out in all the > >> > > instances. I have attached logs of all the three machine. > >> > > > > >> > > > Regards, > >> > > > ARN > >> > > > >> > > > >> > > >> > > > > > --e89a8f3b9beb4c54fb04c4962821--