From user-return-6594-apmail-zookeeper-user-archive=zookeeper.apache.org@zookeeper.apache.org Fri Jul 19 18:14:31 2013 Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D84E8CBA2 for ; Fri, 19 Jul 2013 18:14:31 +0000 (UTC) Received: (qmail 61490 invoked by uid 500); 19 Jul 2013 18:14:30 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 61287 invoked by uid 500); 19 Jul 2013 18:14:30 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 61276 invoked by uid 99); 19 Jul 2013 18:14:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Jul 2013 18:14:30 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [209.85.220.45] (HELO mail-pa0-f45.google.com) (209.85.220.45) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Jul 2013 18:14:22 +0000 Received: by mail-pa0-f45.google.com with SMTP id bg2so1121430pad.4 for ; Fri, 19 Jul 2013 11:13:40 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer :x-gm-message-state; bh=tIgUzdvxHLd4MW8HGUDdmFUuM5YAFRFusdgjs/B7BPU=; b=YFAqAOdWR7fzNEHbVmjR6Ojd8mPCcL4oMpjTIePf7O0r94/2nQH5mwWsfdD9K7sYC6 rd42U1QjOOyMf/ayLaLCX0flbMfLbS+I3rw3z6ESxmSQOzJzWt6jimLMJ/HHayP/QuoJ rPJQS4W+vvLafCe5ITbMAXS/+1wQrth/0MqC7rf7C/uwFb8j90UW16KA0FUeoljRXc6R +fNTXyvS04CTpsdaYi6hkHMtncZ4lWhOkDRIR5Ma5g3vDWdxKINFf3f092iUyAako1OH Ij9HLkyexww6Fun4NYh3FoPyXTYo2ouTIa3SSeDYT4+gDAlkNLbNgL42Wu/p52j9W3vM TtYA== X-Received: by 10.68.179.101 with SMTP id df5mr18787798pbc.33.1374257620807; Fri, 19 Jul 2013 11:13:40 -0700 (PDT) Received: from [192.168.0.179] (c-76-103-249-200.hsd1.ca.comcast.net. [76.103.249.200]) by mx.google.com with ESMTPSA id wg6sm20891314pbc.3.2013.07.19.11.13.39 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 19 Jul 2013 11:13:40 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\)) Subject: Re: Zookeeper ensemble backup questions? From: Jordan Zimmerman In-Reply-To: Date: Fri, 19 Jul 2013 11:13:38 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: <7527D9F1-F9A5-44C7-9739-790ECBE138CE@jordanzimmerman.com> References: To: user@zookeeper.apache.org X-Mailer: Apple Mail (2.1508) X-Gm-Message-State: ALoCoQk/tZk3jY+O7jbDIdDEKe/LeBwHQPbkEyGvs+JygB4Bv7RPUMY2pfGgN8/i3W9AjsUNMehq X-Virus-Checked: Checked by ClamAV on apache.org I'm a bit late to this one but I don't understand the need for complete = backup of ZK data. In my experience, 99% of ZNodes are ephemeral. = Therefore, it would be wrong to restore those nodes. In a disaster, the = connection sessions would expire and you would not want ephemeral nodes = restored. This is why in Exhibitor I took the approach of selective = restore. -Jordan On Jul 19, 2013, at 11:00 AM, jack ma wrote: > I asked those question in the thread > = http://mail-archives.apache.org/mod_mbox/zookeeper-user/201307.mbox/%3cCAB= +cfdwhOV0JfB04=3DMpO_+i-4ou=3DVbL=3DEG2XS557+j+698jx3A@mail.gmail.com%3e, > but there is no response for that. >=20 > So I posted those questions again here, hopefully I could get helps > from the community. >=20 > I want to make sure I am fully understanding the procedures of = zookeeper > backup and disaster recovery: >=20 > For the backup procedures at zookeeper assemble: > (1) Login to any host which state is "Serving" > Question: > Do I have to login to leader node, or any node is ok? > (2) Copy latest snapshot file and transaction log from version-2 = directory. > Question: > How to make sure we do not copy corrupt files if the > snapshot/transaction log is in the middle of update? Do we have to = shutdown > the node to make the copy? > besides the transaction log and snapshot, do we have = to > copy other files such as the ecoch files >=20 > For the disaster recovery procedures at zookeeper assemble: > (1) recreate the machines for the zookeeper ensemble > (2) copy snapshot/transaction log we backed up into the zookeeper > dataDir\version-2 and logDir\version2. > Question: > Do we have to copy the epoch files? > Do we have to copy snapshot/transaction log backed up = to > all the zookeeper node, or just the first node we starts? >=20 > Appreciate your time and help. > Jack