Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 452CD9548 for ; Sat, 4 Feb 2012 00:02:03 +0000 (UTC) Received: (qmail 25357 invoked by uid 500); 4 Feb 2012 00:02:02 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 25305 invoked by uid 500); 4 Feb 2012 00:02:02 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 25287 invoked by uid 99); 4 Feb 2012 00:02:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Feb 2012 00:02:01 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [216.82.250.83] (HELO mail120.messagelabs.com) (216.82.250.83) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Feb 2012 00:01:51 +0000 X-Env-Sender: jh033q@att.com X-Msg-Ref: server-7.tower-120.messagelabs.com!1328313689!62008329!1 X-Originating-IP: [144.160.112.28] X-StarScan-Version: 6.5.5; banners=-,-,- X-VirusChecked: Checked Received: (qmail 9102 invoked from network); 4 Feb 2012 00:01:29 -0000 Received: from sbcsmtp3.sbc.com (HELO tlpi048.enaf.dadc.sbc.com) (144.160.112.28) by server-7.tower-120.messagelabs.com with DHE-RSA-AES256-SHA encrypted SMTP; 4 Feb 2012 00:01:29 -0000 Received: from enaf.dadc.sbc.com (localhost.localdomain [127.0.0.1]) by tlpi048.enaf.dadc.sbc.com (8.14.5/8.14.5) with ESMTP id q1401SsI020189 for ; Fri, 3 Feb 2012 18:01:29 -0600 Received: from dalint01.pst.cso.att.com (dalint01.pst.cso.att.com [135.31.133.159]) by tlpi048.enaf.dadc.sbc.com (8.14.5/8.14.5) with ESMTP id q1401P1W020096 for ; Fri, 3 Feb 2012 18:01:25 -0600 Received: from clpd033.sldc.sbc.com (clpd033.sldc.sbc.com [132.201.211.55]) by dalint01.pst.cso.att.com (RSA Interceptor) for ; Fri, 3 Feb 2012 18:01:09 -0600 Received: from sldc.sbc.com (localhost.localdomain [127.0.0.1]) by clpd033.sldc.sbc.com (8.14.4/8.14.4) with ESMTP id q14019Tg004293 for ; Fri, 3 Feb 2012 18:01:09 -0600 Received: from mxr-sna-0101.corp.wayport.net (mxr-sna-0101.corp.wayport.net [216.12.234.2]) by clpd033.sldc.sbc.com (8.14.4/8.14.4) with ESMTP id q14016IN004180 for ; Fri, 3 Feb 2012 18:01:06 -0600 Received: from exc-aus-0001.wayad.corp.wayport.net ([216.12.225.147]) by mxr-sna-0101.corp.wayport.net with ESMTP/TLS/AES128-SHA; 04 Feb 2012 00:01:06 +0000 Received: from exc-aus-0001.wayad.corp.wayport.net ([216.12.225.147]) by exc-aus-0001 ([216.12.225.147]) with mapi; Fri, 3 Feb 2012 18:01:05 -0600 From: Jason Harmon To: "user@zookeeper.apache.org" Date: Fri, 3 Feb 2012 18:01:03 -0600 Subject: RE: Deployment planning question Thread-Topic: Deployment planning question Thread-Index: Acziv+jfacia8II+T+qulo06RhU5nAADhNsQ Message-ID: <8E4F1F51FEC5C54B9FCA172E165FA17906F660FA96@exc-aus-0001> References: <8E4F1F51FEC5C54B9FCA172E165FA17906F660FA07@exc-aus-0001> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-RSA-Inspected: yes X-RSA-Classifications: public X-RSA-Action: allow X-Virus-Checked: Checked by ClamAV on apache.org My preference, of course, would be to have three datacenters...if one is pa= rtitioned off, zookeeper would not respond, which would be perfect. In that= scenario, our other two would still be up and running in the other two dat= acenters. Of course I only have two, and I can't host anything externally due to corp= orate security restrictions. It sure would be nice for a smaller-scale depl= oyment to just have a simple mirrored option, as opposed to the mandatory 3= server/quorum requirement.=20 I'm investigating a potential third site I can get access to, but I'm just = trying to fully explore any other potential options. Thanks for the quick response Ben -----Original Message----- From: Benjamin Reed [mailto:breed@apache.org]=20 Sent: Friday, February 03, 2012 4:05 PM To: user@zookeeper.apache.org Subject: Re: Deployment planning question what do you want to happen if both datacenters are up, but there is a parti= tion so that they cannot communicate with each other? answering that questi= on may get you closer to an answer. i think your two main options are to: 1) designate a data center that you require to be up for things to work. th= en you can put two in that one and one in the other 2) otherwise you need to setup a machine in another datacenter or amazon in= stance or something that will in effect decide which data center can run if= one of the datacenters goes down or you are in the partition scenario abov= e. ben On Fri, Feb 3, 2012 at 1:01 PM, Jason Harmon wrote: > We are planning a ZooKeeper deployment, but are struggling with some aspe= cts of our current architecture. We have two data centers; most application= s use load balancing between the two data centers to ensure redundancy for = disaster recovery etc. ZooKeeper has been challenging for us, because we ca= n't quite figure out how to structure our servers. We aren't starting with = lots of clients, so 3 servers should be a good fit initially. However, if w= e put two servers in one datacenter, and one server in the other, we're set= up for a quorum failure if we have a power outage at the datacenter with tw= o boxes. > Short of setting up a third data center for our division, what options do= we have for deploying ZooKeeper effectively, with disaster recovery in min= d etc. > Thanks! > > Jason Harmon > Senior Software Architect >