Return-Path: X-Original-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1830A78F9 for ; Fri, 5 Aug 2011 10:16:05 +0000 (UTC) Received: (qmail 27388 invoked by uid 500); 5 Aug 2011 10:16:04 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 27007 invoked by uid 500); 5 Aug 2011 10:15:56 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 26974 invoked by uid 99); 5 Aug 2011 10:15:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Aug 2011 10:15:52 +0000 X-ASF-Spam-Status: No, hits=-2000.7 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Aug 2011 10:15:47 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 263748F8F1 for ; Fri, 5 Aug 2011 10:15:27 +0000 (UTC) Date: Fri, 5 Aug 2011 10:15:27 +0000 (UTC) From: "Justin Joseph (JIRA)" To: common-issues@hadoop.apache.org Message-ID: <597395128.11111.1312539327153.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <69773583.10351.1310107516869.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HADOOP-7455) Introduce HA Service Protocol Interface MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HADOOP-7455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13079886#comment-13079886 ] Justin Joseph commented on HADOOP-7455: --------------------------------------- Neutral state is the protocol which ensures that only one Active Namenode will be present at a time. The approach says 'Avoid the possibility of having two Active Namenodes; rather than do fencing later to prevent the inconsistencies that can be caused by two Active Namenodes'. This is the key to understand the intent behind neutral state. The protocol demands that every decision to transition to a role or continue in the current role should be based on the connection maintained to the distributed coordinator, Zookeeper, with quorum size of 3 or more. The election of Active Namenode happens via the Leader Election recipe of Zookeeper. After the election, status quo is maintained till the Active Namenode goes down or the Namenode is unable to access the Zookeeper quorum. When Active Namenode is down, Standby will come to know about it through the watcher set on the Zookeeper & transitions to become Active. Once the Namenode (active / standby) loses the connection to Zookeeper, it will relinquish it's current role & stays idle by transitioning to neutral state. Once the Zookeeper service is available again to the Namenode, it conducts the active instance election once again. Based on the result of the election it decides the new role. This approach can ensure that there are no multiple Active Namenodes, even in the following scenarios a) Active Namenode gets network partitioned and Standby Namenode transitions as Active. b) Both Active & Standby Namenode are network partitioned c) Zookeeper ensemble is network partitioned > Introduce HA Service Protocol Interface > --------------------------------------- > > Key: HADOOP-7455 > URL: https://issues.apache.org/jira/browse/HADOOP-7455 > Project: Hadoop Common > Issue Type: Sub-task > Components: util > Reporter: Suresh Srinivas > Assignee: Suresh Srinivas > Fix For: HA Branch (HDFS-1623) > > Attachments: HADOOP-7455.1.patch, HADOOP-7455.2.patch, HADOOP-7455.3.patch, HADOOP-7455.4.patch, HADOOP-7455.5.patch, HDFS-7454.patch > > > This jira introduces a protocol interface to be implemented by services that provide HA functionality. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira