Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6DB26108AA for ; Sat, 9 Nov 2013 20:11:48 +0000 (UTC) Received: (qmail 98983 invoked by uid 500); 9 Nov 2013 20:11:47 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 98948 invoked by uid 500); 9 Nov 2013 20:11:47 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 98940 invoked by uid 99); 9 Nov 2013 20:11:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Nov 2013 20:11:47 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of zkquestions@gmail.com designates 209.85.219.67 as permitted sender) Received: from [209.85.219.67] (HELO mail-oa0-f67.google.com) (209.85.219.67) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 09 Nov 2013 20:11:43 +0000 Received: by mail-oa0-f67.google.com with SMTP id j17so933004oag.10 for ; Sat, 09 Nov 2013 12:11:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=B76ozB9xmXaTY+OJMPeyUMkbme1hwkwsCmGsbFeXIjI=; b=j3CuN681is1/7dYv1qIyc6hDvVYsMn0TL+DyV8g58Cslp54iCkeOgaBF0Ui5QAzxwy bJuJ5O40zQSAWJYd46SCPZtEHzC2iEz5tv7e940Y+0QEk28aOLzuuXzPPcWLLniz6KcQ DL4Ly3niAmEmUyHclJsdFaqc/LG/S53H6MVh5QGi0xfvUf1FpDpMA+aa1QyW0sXPRRK4 yOEamDapsDiaFcbMKKNIFgVoZTly12jbDApoNT/ae/MtaOgN/AD+hP1wlS8+1CIjQS/L nfTkKMfOwItRins6MNAJ/APunItkFnYCG4zfI0icVUV1YztKr6EwkETvjZxipBgI+nOx RehQ== MIME-Version: 1.0 X-Received: by 10.182.66.164 with SMTP id g4mr10531244obt.47.1384027882275; Sat, 09 Nov 2013 12:11:22 -0800 (PST) Received: by 10.60.55.69 with HTTP; Sat, 9 Nov 2013 12:11:22 -0800 (PST) In-Reply-To: References: Date: Sat, 9 Nov 2013 12:11:22 -0800 Message-ID: Subject: Re: Problem recovering from a bad reconfig (3.5) From: zk questions To: user@zookeeper.apache.org Content-Type: multipart/alternative; boundary=089e0160c35e48473304eac41a81 X-Virus-Checked: Checked by ClamAV on apache.org --089e0160c35e48473304eac41a81 Content-Type: text/plain; charset=ISO-8859-1 Just realized that attachments don't go through, here it is linked instead: https://docs.google.com/file/d/0B3K7QIlpXfSXdl92aHhxUXVDRkk/edit?usp=drive_web On Sat, Nov 9, 2013 at 10:59 AM, zk questions wrote: > Hi, > > I've been testing out the dynamic reconfig feature of 3.5 along with using > this patch (https://issues.apache.org/jira/browse/ZOOKEEPER-1691) and I'm > having an issue where my zk cluster won't allow me to perform further > reconfigs. > So here's what I'm doing: > 1) Start nodes 1 and 2 > 2) Invoke reconfig on 1 to add 2; this suceeds > 3) Start node 3 with the initial configuration with the dynamic config set > to just 2 and 3, where 2 isn't a leader (manually verified) > 4) Invoke reconfig on 2 to add 3; this fails, with an error indicating > that another reconfig in progress > 5) Then I restart 3 with the configuration containing just 1 and 3 > 6) Then I try again to add 3 to the cluster by invoking reconfig on 1 to > add 3; and again I see an error indicating that another reconfig is in > progress > > FWIW: I'm testing this scenario to simulate the situation where I'm > automating the reconfig process and the dynamic configuration for 3 ends up > containing a node that isn't the leader. > > I was wondering what I should do in this situation to recover from the > failure at step 3 so that we can fix the dynamic config and then attempt a > proper reconfig (steps 4 - 6)? > > I've also attached a tar containing a script to automatically reproduce > the steps and problem I'm seeing above. > > Thanks. > --089e0160c35e48473304eac41a81--