Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 48966 invoked from network); 2 Jul 2008 12:50:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 2 Jul 2008 12:50:02 -0000 Received: (qmail 89924 invoked by uid 500); 2 Jul 2008 12:50:01 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 89602 invoked by uid 500); 2 Jul 2008 12:50:00 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 89591 invoked by uid 99); 2 Jul 2008 12:50:00 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Jul 2008 05:50:00 -0700 X-ASF-Spam-Status: No, hits=-2.8 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [192.6.10.2] (HELO colossus.hpl.hp.com) (192.6.10.2) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Jul 2008 12:49:08 +0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by colossus.hpl.hp.com (Postfix) with ESMTP id 916C16BAA0 for ; Wed, 2 Jul 2008 13:49:27 +0100 (BST) X-Virus-Scanned: amavisd-new at hplb.hpl.hp.com Received: from colossus.hpl.hp.com ([127.0.0.1]) by localhost (colossus.hpl.hp.com [127.0.0.1]) (amavisd-new, port 10024) with LMTP id BiVDcAXN3Wgz for ; Wed, 2 Jul 2008 13:49:26 +0100 (BST) Received: from ha-node-br2.hpl.hp.com (ha-node-br2.hpl.hp.com [16.25.144.57]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by colossus.hpl.hp.com (Postfix) with ESMTP id 7E97B6BA28 for ; Wed, 2 Jul 2008 13:49:26 +0100 (BST) MailScanner-NULL-Check: 1215607743.84345@pCL90SZ6VKTqZAGjQsf+kQ Received: from [16.25.171.118] (morzine.hpl.hp.com [16.25.171.118]) by ha-node-br2.hpl.hp.com (8.14.1/8.13.4) with ESMTP id m62Cn1x7020588 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Wed, 2 Jul 2008 13:49:02 +0100 (BST) Message-ID: <486B7949.8010103@apache.org> Date: Wed, 02 Jul 2008 13:49:13 +0100 From: Steve Loughran User-Agent: Thunderbird 2.0.0.14 (X11/20080421) MIME-Version: 1.0 To: core-dev@hadoop.apache.org Subject: Re: porting HDFS to Zookeeper?? References: <811fc3830806301336r393d3b61k3f9e43b5ef7909fd@mail.gmail.com> <48696E79.5030009@yahoo-inc.com> <811fc3830807010141t76246fd0m9205c7dfe87c7069@mail.gmail.com> In-Reply-To: <811fc3830807010141t76246fd0m9205c7dfe87c7069@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-HPL-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: m62Cn1x7020588 X-HPL-MailScanner: Found to be clean X-HPL-MailScanner-From: stevel@apache.org X-Virus-Checked: Checked by ClamAV on apache.org Sangmin Lee wrote: > Thank you for the replies. > I meant porting HDFS to Zookeeper to make namenode fault-tolerant and highly > available rather than the simple usage of Zookeeper. Basically by storing > namespace metadata in Zookeeper, we may be able to get highly available > replicated namenodes. So I meant using Zookeeper as frontend of HDFS. Anyway > it seems to me that there has been no actual work going on regarding this, > isn't it? :-) > > -sangmin I think you've just volunteered. @work we use our own tuple space thing, Anubis: http://smartfrog.svn.sourceforge.net/viewvc/smartfrog/trunk/core/components/anubis/doc/ it lets you do the election/voting stuff to see who is in charge in an HA cluster, though to deal with a partition event robustly is hard; you need somebody that really understands the maths behind HA distributed computing. I am not in that category, so we are lucky there are others nearby who do, but it implies there is a block of code I am very reluctant to approach. In fact, its a bit like security. just like it is easy to do an insecure system, but very hard to do a secure one, its easy to be low-availability, really, really hard to be HA. -steve