Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@minotaur.apache.org Received: (qmail 13892 invoked from network); 6 Mar 2010 20:02:04 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 Mar 2010 20:02:04 -0000 Received: (qmail 61841 invoked by uid 500); 6 Mar 2010 20:01:46 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 61824 invoked by uid 500); 6 Mar 2010 20:01:46 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 61816 invoked by uid 99); 6 Mar 2010 20:01:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Mar 2010 20:01:46 +0000 X-ASF-Spam-Status: No, hits=4.4 required=10.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of waite.134@googlemail.com designates 72.14.220.156 as permitted sender) Received: from [72.14.220.156] (HELO fg-out-1718.google.com) (72.14.220.156) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 06 Mar 2010 20:01:42 +0000 Received: by fg-out-1718.google.com with SMTP id d23so1075867fga.11 for ; Sat, 06 Mar 2010 12:01:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=sKEVWjrPMaliTzhH1Ai0p0Ro+f6dZ5PP2FHAZ+qSzRA=; b=PCfd99bw+hz9iIeb81ITgcB3q52At/vBoEAOxJ73rGKpuNcdFWxmdI+4I8vyOUzP4Z qAvROgL3J2NPM3RNAsQMKfOHHKh46NZKrCdUfcT5A/TIX024P2tYLWFWRMwIds01bnhZ h78RF07JO22yToTY/aNhCfkiFFHRglnHWeFuU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=DLzlJMpAeKQKL/2Yy1DJeXIIwijAZKw3BrFj6j1EgSQboRTsvlRcWJEJ34ODUKM4x3 exVcpThgvy5B53roVacHUWl5Kcx3ZlrGHdQmkVTbErRajiN9jSSnQvvrWSW1GyKdZe4q thiFhGoNQVjC5uKw4VfqXyEPcqkKU0Y/upXwc= MIME-Version: 1.0 Received: by 10.103.84.17 with SMTP id m17mr1814951mul.6.1267905680153; Sat, 06 Mar 2010 12:01:20 -0800 (PST) In-Reply-To: References: <8bc75ecf1003060150n396f5590g8a3cb4f341d28167@mail.gmail.com> Date: Sat, 6 Mar 2010 20:01:19 +0000 Message-ID: <8bc75ecf1003061201rf8c9e5m36e9e684f798f677@mail.gmail.com> Subject: Re: Managing multi-site clusters with Zookeeper From: Martin Waite To: zookeeper-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016e65b635cacb8b50481274bdc --0016e65b635cacb8b50481274bdc Content-Type: text/plain; charset=ISO-8859-1 I take your point about reliability, but I have no option other than finding a multi-site solution. Unfortunately, in my experience sites are much less reliable than individual machines, and so in a way coping with site failure is more important than individual machine failure. I imagine that the risk profile changes according to the number of machines you have, however. Thanks for the input Martin On 6 March 2010 18:29, Ted Dunning wrote: > What you describe is relatively reasonable, even though Zookeeper is not > normally distributed across multiple data centers with all members getting > full votes. If you account for the limited throughput that this will > impose > on your applications that use ZK, then I think that this can work well. > Probably, you would have local ZK clusters for higher transaction rate > applications. > > You should also consider very carefully whether having multiple data > centers > increases or decreases your overall reliability. Unless you design very > carefully, this will normally substantially degrade reliability. Making > sure that it increases reliability is a really big task that involves a lot > of surprising (it was to me) considerations and considerable hardware and > time investments. > > Good luck! > > On Sat, Mar 6, 2010 at 1:50 AM, Martin Waite >wrote: > > > Is this a viable approach, or am I taking Zookeeper out of its > application > > domain and just asking for trouble ? > > > > > > -- > Ted Dunning, CTO > DeepDyve > --0016e65b635cacb8b50481274bdc--