Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3B6C718301 for ; Wed, 15 Jul 2015 18:06:49 +0000 (UTC) Received: (qmail 39884 invoked by uid 500); 15 Jul 2015 18:06:39 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 39834 invoked by uid 500); 15 Jul 2015 18:06:39 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Delivered-To: moderator for user@zookeeper.apache.org Received: (qmail 11837 invoked by uid 500); 15 Jul 2015 17:54:29 -0000 Delivered-To: apmail-hadoop-zookeeper-user@hadoop.apache.org X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.487 X-Spam-Level: *** X-Spam-Status: No, score=3.487 tagged_above=-999 required=6.31 tests=[DKIM_ADSP_CUSTOM_MED=0.001, NML_ADSP_CUSTOM_MED=1.2, SPF_SOFTFAIL=0.972, URIBL_BLOCKED=0.001, URI_HEX=1.313] autolearn=disabled Date: Wed, 15 Jul 2015 10:54:21 -0700 (MST) From: Vikas Mehta To: zookeeper-user@hadoop.apache.org Message-ID: <1436982861611-7581277.post@n2.nabble.com> Subject: locking/leader election and dealing with session loss MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit When using zookeeper for leader election or distributed locking, my assumption is that as soon as the lock owner sees session transition to 'CONNECTING' state, it should commit suicide to avoid the risk of multiple owners. Please correct my assumption if I am wrong or there is a better way to guarantee a single lock owner/leader. If above assumption is correct, I am trying to figure out how I can improve the availability of the application (leader/lock owner) when zookeeper ensemble is broken (eg. undergoing zookeeper leader election for prolonged period of time). Options I have considered: 1/ use multiple ensembles for leader election/locking to avoid SPOF (complex to implement) 2/ extend the zookeeper protocol to provide client more info on connection loss, like zookeeper leader election in progress so that client can decide when it is ok to not commit suicide and still guarantee a single application leader/lock owner. (haven't been able to prove that this will guarantee single application leader/lock owner). If this has been already answered or solved, please point me to the post/doc. -- View this message in context: http://zookeeper-user.578899.n2.nabble.com/locking-leader-election-and-dealing-with-session-loss-tp7581277.html Sent from the zookeeper-user mailing list archive at Nabble.com.