Return-Path: X-Original-To: apmail-curator-dev-archive@minotaur.apache.org Delivered-To: apmail-curator-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6942910233 for ; Fri, 12 Jul 2013 22:12:16 +0000 (UTC) Received: (qmail 20870 invoked by uid 500); 12 Jul 2013 22:12:16 -0000 Delivered-To: apmail-curator-dev-archive@curator.apache.org Received: (qmail 20838 invoked by uid 500); 12 Jul 2013 22:12:16 -0000 Mailing-List: contact dev-help@curator.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@curator.incubator.apache.org Delivered-To: mailing list dev@curator.incubator.apache.org Received: (qmail 20830 invoked by uid 99); 12 Jul 2013 22:12:16 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Jul 2013 22:12:16 +0000 X-ASF-Spam-Status: No, hits=-2000.3 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 12 Jul 2013 22:12:14 +0000 Received: (qmail 18917 invoked by uid 99); 12 Jul 2013 22:11:53 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Jul 2013 22:11:53 +0000 Date: Fri, 12 Jul 2013 22:11:53 +0000 (UTC) From: "Jordan Zimmerman (JIRA)" To: dev@curator.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (CURATOR-47) Two leaders in the leader election recipe MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CURATOR-47?page=3Dcom.atlassian= .jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D1370= 7449#comment-13707449 ]=20 Jordan Zimmerman edited comment on CURATOR-47 at 7/12/13 10:11 PM: ------------------------------------------------------------------- I ran the test for a while (without the InstanceSpec changes) and was not a= ble to reproduce (see log below). If you post your InstanceSpec changes I'l= l try those. BTW, I recently started an email thread on user@ZK titled "rol= ling config change considered harmful". Please have a look: http://search-hadoop.com/m/6hmDl1xZTxY1&subj=3DRe+Rolling+config+change+con= sidered+harmful+ t002043: Starting test. t002057: Stopping server 0 iteration 0 t002905: 0 takes leadership. Leader was:-1 t006030: Stopping server 1 iteration 1 t006144: 0 looses leadership. t007507: 1 takes leadership. Leader was:-1 t011064: Stopping server 2 iteration 2 t011178: 1 looses leadership. t014713: 0 takes leadership. Leader was:-1 t015817: Stopping server 0 iteration 3 t015922: 0 looses leadership. t016572: 1 takes leadership. Leader was:-1 t020794: Stopping server 1 iteration 4 t020899: 1 looses leadership. t022001: 0 takes leadership. Leader was:-1 t025944: Stopping server 2 iteration 5 t026060: 0 looses leadership. t029048: 1 takes leadership. Leader was:-1 t031184: Stopping server 0 iteration 6 t031289: 1 looses leadership. t031796: 0 takes leadership. Leader was:-1 t035777: Stopping server 1 iteration 7 t035892: 0 looses leadership. t037935: 1 takes leadership. Leader was:-1 t040831: Stopping server 2 iteration 8 t040954: 1 looses leadership. t044511: 0 takes leadership. Leader was:-1 t046298: Stopping server 0 iteration 9 t046402: 0 looses leadership. t047248: 1 takes leadership. Leader was:-1 t051618: Stopping server 1 iteration 10 t051721: 1 looses leadership. t052597: 0 takes leadership. Leader was:-1 t057122: Stopping server 2 iteration 11 t057239: 0 looses leadership. t062421: Stopping server 0 iteration 12 t066103: 0 takes leadership. Leader was:-1 t067015: Stopping server 1 iteration 13 t067120: 0 looses leadership. t070908: 0 takes leadership. Leader was:-1 t072351: Stopping server 2 iteration 14 t072465: 0 looses leadership. t074737: 0 takes leadership. Leader was:-1 t077254: Stopping server 0 iteration 15 t077356: 0 looses leadership. t077564: 0 takes leadership. Leader was:-1 t082598: Stopping server 1 iteration 16 t082709: 0 looses leadership. t083460: 0 takes leadership. Leader was:-1 t088007: Stopping server 2 iteration 17 t088122: 0 looses leadership. t089385: 0 takes leadership. Leader was:-1 t093239: Stopping server 0 iteration 18 t093342: 0 looses leadership. t093599: 0 takes leadership. Leader was:-1 t097752: Stopping server 1 iteration 19 t097862: 0 looses leadership. t099856: 0 takes leadership. Leader was:-1 t103008: Stopping server 2 iteration 20 t103122: 0 looses leadership. t104398: 0 takes leadership. Leader was:-1 t108164: Stopping server 0 iteration 21 t108266: 0 looses leadership. t108842: 0 takes leadership. Leader was:-1 t112872: Stopping server 1 iteration 22 t112975: 0 looses leadership. t114180: 0 takes leadership. Leader was:-1 t118172: Stopping server 2 iteration 23 t118288: 0 looses leadership. t120987: 0 takes leadership. Leader was:-1 t123105: Stopping server 0 iteration 24 t123206: 0 looses leadership. t124172: 0 takes leadership. Leader was:-1 t128560: Stopping server 1 iteration 25 t128667: 0 looses leadership. t131217: 0 takes leadership. Leader was:-1 t133301: Stopping server 2 iteration 26 t133415: 0 looses leadership. t135576: 0 takes leadership. Leader was:-1 t138116: Stopping server 0 iteration 27 t138219: 0 looses leadership. t138493: 0 takes leadership. Leader was:-1 t143080: Stopping server 1 iteration 28 t143193: 0 looses leadership. =20 was (Author: randgalt): I ran the test for a while (without the InstanceSpec changes) and was n= ot able to reproduce (see log below). If you post your InstanceSpec changes= I'll try those. BTW, I recently started an email thread on user@ZK titled = "rolling config change considered harmful". Please have a look: http://qnalist.com/questions/3983279/rolling-config-change-considered-harmf= ul t002043: Starting test. t002057: Stopping server 0 iteration 0 t002905: 0 takes leadership. Leader was:-1 t006030: Stopping server 1 iteration 1 t006144: 0 looses leadership. t007507: 1 takes leadership. Leader was:-1 t011064: Stopping server 2 iteration 2 t011178: 1 looses leadership. t014713: 0 takes leadership. Leader was:-1 t015817: Stopping server 0 iteration 3 t015922: 0 looses leadership. t016572: 1 takes leadership. Leader was:-1 t020794: Stopping server 1 iteration 4 t020899: 1 looses leadership. t022001: 0 takes leadership. Leader was:-1 t025944: Stopping server 2 iteration 5 t026060: 0 looses leadership. t029048: 1 takes leadership. Leader was:-1 t031184: Stopping server 0 iteration 6 t031289: 1 looses leadership. t031796: 0 takes leadership. Leader was:-1 t035777: Stopping server 1 iteration 7 t035892: 0 looses leadership. t037935: 1 takes leadership. Leader was:-1 t040831: Stopping server 2 iteration 8 t040954: 1 looses leadership. t044511: 0 takes leadership. Leader was:-1 t046298: Stopping server 0 iteration 9 t046402: 0 looses leadership. t047248: 1 takes leadership. Leader was:-1 t051618: Stopping server 1 iteration 10 t051721: 1 looses leadership. t052597: 0 takes leadership. Leader was:-1 t057122: Stopping server 2 iteration 11 t057239: 0 looses leadership. t062421: Stopping server 0 iteration 12 t066103: 0 takes leadership. Leader was:-1 t067015: Stopping server 1 iteration 13 t067120: 0 looses leadership. t070908: 0 takes leadership. Leader was:-1 t072351: Stopping server 2 iteration 14 t072465: 0 looses leadership. t074737: 0 takes leadership. Leader was:-1 t077254: Stopping server 0 iteration 15 t077356: 0 looses leadership. t077564: 0 takes leadership. Leader was:-1 t082598: Stopping server 1 iteration 16 t082709: 0 looses leadership. t083460: 0 takes leadership. Leader was:-1 t088007: Stopping server 2 iteration 17 t088122: 0 looses leadership. t089385: 0 takes leadership. Leader was:-1 t093239: Stopping server 0 iteration 18 t093342: 0 looses leadership. t093599: 0 takes leadership. Leader was:-1 t097752: Stopping server 1 iteration 19 t097862: 0 looses leadership. t099856: 0 takes leadership. Leader was:-1 t103008: Stopping server 2 iteration 20 t103122: 0 looses leadership. t104398: 0 takes leadership. Leader was:-1 t108164: Stopping server 0 iteration 21 t108266: 0 looses leadership. t108842: 0 takes leadership. Leader was:-1 t112872: Stopping server 1 iteration 22 t112975: 0 looses leadership. t114180: 0 takes leadership. Leader was:-1 t118172: Stopping server 2 iteration 23 t118288: 0 looses leadership. t120987: 0 takes leadership. Leader was:-1 t123105: Stopping server 0 iteration 24 t123206: 0 looses leadership. t124172: 0 takes leadership. Leader was:-1 t128560: Stopping server 1 iteration 25 t128667: 0 looses leadership. t131217: 0 takes leadership. Leader was:-1 t133301: Stopping server 2 iteration 26 t133415: 0 looses leadership. t135576: 0 takes leadership. Leader was:-1 t138116: Stopping server 0 iteration 27 t138219: 0 looses leadership. t138493: 0 takes leadership. Leader was:-1 t143080: Stopping server 1 iteration 28 t143193: 0 looses leadership. =20 > Two leaders in the leader election recipe > ----------------------------------------- > > Key: CURATOR-47 > URL: https://issues.apache.org/jira/browse/CURATOR-47 > Project: Apache Curator > Issue Type: Bug > Components: Recipes > Affects Versions: 2.1.0-incubating > Environment: Windows and Java 1.7 > Reporter: Germ=C3=A1n Blanco > Fix For: TBD > > Attachments: TestLeaderElection.java > > > When using the Leader Election recipe and doing a rolling restart of the = zookeeper servers in the cluster, it seems that at some points there are tw= o participants that see themselves as leaders. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrato= rs For more information on JIRA, see: http://www.atlassian.com/software/jira