Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@locus.apache.org Received: (qmail 10013 invoked from network); 16 Dec 2008 17:45:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Dec 2008 17:45:01 -0000 Received: (qmail 204 invoked by uid 500); 16 Dec 2008 17:45:14 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 170 invoked by uid 500); 16 Dec 2008 17:45:14 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 144 invoked by uid 99); 16 Dec 2008 17:45:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Dec 2008 09:45:13 -0800 X-ASF-Spam-Status: No, hits=-4.0 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [192.18.98.43] (HELO brmea-mail-2.sun.com) (192.18.98.43) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Dec 2008 17:44:52 +0000 Received: from fe-amer-10.sun.com ([192.18.109.80]) by brmea-mail-2.sun.com (8.13.6+Sun/8.12.9) with ESMTP id mBGHiVOX001007 for ; Tue, 16 Dec 2008 17:44:31 GMT Received: from conversion-daemon.mail-amer.sun.com by mail-amer.sun.com (Sun Java System Messaging Server 6.2-8.04 (built Feb 28 2007)) id <0KBZ00601CR2RT00@mail-amer.sun.com> (original mail from Thomas.Johnson@Sun.COM) for zookeeper-user@hadoop.apache.org; Tue, 16 Dec 2008 10:44:31 -0700 (MST) Received: from [129.148.70.228] by mail-amer.sun.com (Sun Java System Messaging Server 6.2-8.04 (built Feb 28 2007)) with ESMTPSA id <0KBZ003TYD9T3JH0@mail-amer.sun.com> for zookeeper-user@hadoop.apache.org; Tue, 16 Dec 2008 10:44:19 -0700 (MST) Date: Tue, 16 Dec 2008 12:45:38 -0500 From: Thomas Vinod Johnson Subject: What happens when a server loses all its state? Sender: Thomas.Johnson@Sun.COM To: zookeeper-user@hadoop.apache.org Message-id: <4947E942.3020209@sun.com> MIME-version: 1.0 Content-type: text/plain; format=flowed; charset=ISO-8859-1 Content-transfer-encoding: 7BIT User-Agent: Thunderbird 2.0.0.9 (X11/20080119) X-Virus-Checked: Checked by ClamAV on apache.org What is the expected behavior if a server in a ZooKeeper service restarts with all its prior state lost? Empirically, everything seems to work*. Is this something that one can count on, as part of ZooKeeper design, or are there known conditions under which this could cause problems, either liveness or violation of ZooKeeper guarantees? I'm really most interested in a situation where a single server loses state, but insights into issues when more than one server loses state and other interesting failure scenarios are appreciated. Thanks. * The restarted server appears to catch up to the latest snapshot (from the current leader?).