From user-return-11811-archive-asf-public=cust-asf.ponee.io@zookeeper.apache.org Fri Jan 25 13:14:25 2019 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 18721180608 for ; Fri, 25 Jan 2019 13:14:24 +0100 (CET) Received: (qmail 86391 invoked by uid 500); 25 Jan 2019 12:14:23 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 86379 invoked by uid 99); 25 Jan 2019 12:14:23 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Jan 2019 12:14:23 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id B6B4DC0313 for ; Fri, 25 Jan 2019 12:14:22 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.798 X-Spam-Level: * X-Spam-Status: No, score=1.798 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id lhMbH--d8UbH for ; Fri, 25 Jan 2019 12:14:20 +0000 (UTC) Received: from mail-wr1-f46.google.com (mail-wr1-f46.google.com [209.85.221.46]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 6DC155F180 for ; Fri, 25 Jan 2019 12:14:19 +0000 (UTC) Received: by mail-wr1-f46.google.com with SMTP id f7so10095875wrp.1 for ; Fri, 25 Jan 2019 04:14:19 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=tebda//l2XCGgVDwV4dvQjC+x7eZZsu7DXII2nTQ2HA=; b=ZMTXInHiZkQtkSHz7JZI70YNT97zJXOvLWPyem0L4SjnkZimk8NaKK9IDz4bp6JvuS /eyQRJd/o3zxFBrtNjI4wJCFZBwg6n//U3/uHzrNOUiKzPDb+lQCm2dQ4UWM5/N4y5bS jwvyWYb8vp1cDJHBs8GkEY90q3W7TSKtX0szjzc8Xa3O+xK8ZxORKv20O3CXC91Y8WnL HnhnryW5BvyrvmoGgslxwg9zqLc/SPs9FztjXG9Ca2g4vgB1nPLLJJW0U1/0gYC7RPDI RbGAIC4zjcwIaU43sPtIgs5gSzRcAlzOiygomWztf9twAAEmlXZ5IylyUzpMKqjEHgXr KqQg== X-Gm-Message-State: AJcUukd9lH1xRmigxLZAnRfGSJF9LwmS55rp3RKyb9vi8NZhI7XCMPAF FTYwzlVNxUr5s7/jbBoviOc1RshgefAy9aAN5E661eOf X-Google-Smtp-Source: ALg8bN44JehG0jZ4O1GhaVc90JXohHyVS41DsvwfvF0BtLysljAcf40P5R+ys9VauNqJTACbTOD7J1DlO8MyOc/p7Kk= X-Received: by 2002:adf:c505:: with SMTP id q5mr10925855wrf.84.1548418458638; Fri, 25 Jan 2019 04:14:18 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Andor Molnar Date: Fri, 25 Jan 2019 13:14:07 +0100 Message-ID: Subject: Re: [**SPAM**] RE: ZK Server does not join quorum after restart To: user@zookeeper.apache.org Content-Type: multipart/alternative; boundary="0000000000003126a80580474460" --0000000000003126a80580474460 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi Ian, Would you please attach logs from all participants of the ensemble or try to find an exception from when the follower is trying to join? Regards, Andor On Fri, Jan 25, 2019 at 1:37 AM Ian Spence wrote: > Hi Daniel, > > Thanks for the quick reply. We use static IP addresses on all of the > servers so it did not change after the reboot. > > Thanks, > -Ian > > From: Daniel Chan on behalf of Daniel Chan < > daniel.cw.chan@oracle.com> > Reply-To: "user@zookeeper.apache.org" > Date: Thursday, January 24, 2019 at 16:36 > To: "user@zookeeper.apache.org" > Subject: [**SPAM**] RE: ZK Server does not join quorum after restart > > > If its IP address got changed, then you hit a known bug > https://issues.apache.org/jira/browse/ZOOKEEPER-1506 and you need to > bounce the cluster. > > Thanks, > Daniel > > -----Original Message----- > From: Ian Spence Ian.Spence@globalrelay.net>> > Sent: Thursday, January 24, 2019 2:36 PM > To: user@zookeeper.apache.org > Subject: ZK Server does not join quorum after restart > > Hello > > We have a cluster of 5 ZK servers, all running ZK 3.4.6 on Java 1.8 on > CentOS 6. These are physical devices, not virtual machines. > > One server required hardware maintenance, and was restarted. When the zk > software was restarted, it did not rejoin the quorum as a follower. > > Running =E2=80=9Cstat=E2=80=9D or =E2=80=9Cmntr=E2=80=9D commands returns= : =E2=80=9CThis ZooKeeper instance is not > currently serving requests=E2=80=9D > > I googled this message and came across this bug: > https://urldefense.proofpoint.com/v2/url?u=3Dhttps-3A__issues.apache.org_= jira_browse_ZOOKEEPER-2D2164&d=3DDwIGaQ&c=3DRoP1YumCXCgaWHvlZYR8PZh8Bv7qIrM= UB65eapI_JnE&r=3DJE3yjNS4hXa8nS9n2uFCwEqMvv18hzzEnqunUhCoEns&m=3DS_8TazqwUb= EfRtAYQCn8kA7F2tiGUBaVr3c_nj0Fh8A&s=3DFGIs9YOjwdYrzBH8om70Jx11KemHKRDsMY_kZ= K6cpK0&e=3D > > Does anybody know if there is a work-around to this issue? We=E2=80=99ve = seen this > problem multiple times in the past and our current solution is to bring > down the zk cluster (which is a huge outage-causing pain). > > Thanks > > - Ian > > --0000000000003126a80580474460--