Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 75997200BA7 for ; Fri, 21 Oct 2016 15:59:48 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 74481160AE8; Fri, 21 Oct 2016 13:59:48 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id BA46C160AE0 for ; Fri, 21 Oct 2016 15:59:47 +0200 (CEST) Received: (qmail 9256 invoked by uid 500); 21 Oct 2016 13:59:46 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 9241 invoked by uid 99); 21 Oct 2016 13:59:46 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 Oct 2016 13:59:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 04E17C0118 for ; Fri, 21 Oct 2016 13:59:46 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.02 X-Spam-Level: X-Spam-Status: No, score=-0.02 tagged_above=-999 required=6.31 tests=[HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id pqmAaCmxBNwt for ; Fri, 21 Oct 2016 13:59:44 +0000 (UTC) Received: from mail-lf0-f47.google.com (mail-lf0-f47.google.com [209.85.215.47]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 7F8B15F39A for ; Fri, 21 Oct 2016 13:59:43 +0000 (UTC) Received: by mail-lf0-f47.google.com with SMTP id l131so139856692lfl.2 for ; Fri, 21 Oct 2016 06:59:43 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=gEeSqAtwoCzwuAH9459fuyTMcFFy4pTTda1enWNgv/M=; b=K4uP1dBGCipHh/S6Fnz9mA5A3f7mGMzXLypSyYV749hc6NJt9aNVpPJ/aAOoxMY846 rFW6m5UsnIHEANlNLLPqWsB82HZ2Gc6pf92YPBrhFBZHtpiCc8b/2GQoO8rK2wULSbrI 65PvdJXGtzWVwuih7FD5Is95vc9M6x376d2lWNfbtiehaZ8RcWu0hwddvVhgq/03OSi8 0XR3gj+51V6ZQlZ6uMh3Y1US3a8wj75hPtLHLjTmLdT4lhrKN8mNN69kSIlNhd3TTs5/ 831Et2uj9a3f1AUppg8Vd+JF6AKpVRlwNDTUX9b2vk3+zS56E12vk2mee06vrNOy3i27 jb1g== X-Gm-Message-State: AA6/9RlloeM7zJqnepnIee74m3xa95Cp02nVg96i8J0oUP7k7OZNeKTmcRHVULMElJJ5tA== X-Received: by 10.28.183.213 with SMTP id h204mr10094749wmf.87.1477058380746; Fri, 21 Oct 2016 06:59:40 -0700 (PDT) Received: from fpj-mac.home (host86-142-23-199.range86-142.btcentralplus.com. [86.142.23.199]) by smtp.gmail.com with ESMTPSA id r63sm4353380wmb.24.2016.10.21.06.59.39 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 21 Oct 2016 06:59:40 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2104\)) Subject: Re: rolling upgrade with wiped disk From: Flavio Junqueira In-Reply-To: <30894FD248BF284589A13D667130AD5A76F8ADD2@MX106CL01.corp.emc.com> Date: Fri, 21 Oct 2016 14:59:38 +0100 Content-Transfer-Encoding: quoted-printable Message-Id: <1C9568CC-491D-40B1-9A4C-5449C7231D99@apache.org> References: <30894FD248BF284589A13D667130AD5A6E682A7C@MX106CL01.corp.emc.com> <2430F7F3-559C-4FE2-93B0-1878D4908AA5@apache.org> <30894FD248BF284589A13D667130AD5A76F8ADD2@MX106CL01.corp.emc.com> To: user@zookeeper.apache.org X-Mailer: Apple Mail (2.2104) archived-at: Fri, 21 Oct 2016 13:59:48 -0000 > On 21 Oct 2016, at 14:35, Vazhenin, Maksim = wrote: >=20 > Thanks for reply, Flavio, >=20 > "You have to be careful when you do this. You may end up losing quorum = on txns if you wipe out the disk of a server prematurely." > What steps could be performed to not loose quorum? If quorum is lost = what steps are to be performed to fix it? > Is there a chance to loose data stored in zk in this scenario? If you wipe out the next server before a quorum is formed, then you = could end up losing transactions. Making sure that there is a leader and = enough followers before the next iteration is very critical. > On node failure / node replacement it is inevitable to have disk wiped = for one server (this should be handled by zk as this will happen = eventually on all systems running zk). > Does upgrading zk version on node with wiped disk adds a situation not = supported by zk? >=20 > "On your step 4, you can tell that a server is ready once clients are = able to connect to it successfully." > Do you mean client should have only this server in the connection = string, and once this client get respond for any 'get' operation it = means that it is safe to go ahead? That's a way of testing, yes. A way of checking this manually is to use = the zkCli under bin with the address of the server that you're bringing = up. If you want to automate it, then you probably want some java code = that waits until a session is sync connected, and the only address in = the connect string is the one of the server you're interested in. Thanks, -Flavio=20 >=20 > Thanks, > Maksim >=20 > -----Original Message----- > From: Flavio Junqueira [mailto:fpj@apache.org]=20 > Sent: Friday, October 21, 2016 3:59 PM > To: user@zookeeper.apache.org > Subject: Re: rolling upgrade with wiped disk >=20 > Hi Maksim, >=20 > You have to be careful when you do this. You may end up losing quorum = on txns if you wipe out the disk of a server prematurely. >=20 > On your step 4, you can tell that a server is ready once clients are = able to connect to it successfully. >=20 > Thanks, > -Flavio >=20 >> On 21 Oct 2016, at 09:11, Vazhenin, Maksim = wrote: >>=20 >> Hi, >>=20 >> Is this a supported scenario for doing rolling upgrade of zookeeper = (v3.4.5) to later version (say v3.4.9): >>=20 >> 1. Shutdown server A (v3.4.5) >> 2. Wipe disk with zookeeper data on server A 3. Start server A with=20= >> new zk version (v3.4.9) 4. Wait till reconstruction complete on = server=20 >> A (what is the indicator for completion?) 5. Go to server B and = repeat=20 >> 1-4. >>=20 >> Thanks, >> Maksim >=20