hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-8279) Auto-HA: Allow manual failover to be invoked from zkfc.
Date Fri, 20 Apr 2012 06:05:38 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-8279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Todd Lipcon updated HADOOP-8279:
--------------------------------

    Attachment: hadoop-8279.txt

New rev fixes a number of issues:
- when ZKFC makes RPCs to the NN, it has to switch to its own UGI, so it has the right krb
credentials
- add a service ACL for ZKProtocol
- fix a potential deadlock with lock inversion in waitForActiveAttempt

I did some testing by running a pseudo-distributed secure cluster, and doing a failover back
and forth in a while loop from another shell. Ran a couple hundred of these successfully.
Also tested behavior when one of the NNs was down, etc.

I also ran one of the zkfcs under jcarder to look for potential deadlocks (only found the
one mentioned above)

Will continue testing this tomorrow, and also do some code cleanup. Mingjie, if you have a
chance to test this, would love to hear how it works for you.
                
> Auto-HA: Allow manual failover to be invoked from zkfc.
> -------------------------------------------------------
>
>                 Key: HADOOP-8279
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8279
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: ha
>    Affects Versions: Auto Failover (HDFS-3042)
>            Reporter: Mingjie Lai
>            Assignee: Todd Lipcon
>             Fix For: Auto Failover (HDFS-3042)
>
>         Attachments: hadoop-8279.txt, hadoop-8279.txt
>
>
> HADOOP-8247 introduces a configure flag to prevent potential status inconsistency between
zkfc and namenode, by making auto and manual failover mutually exclusive.
> However, as described in 2.7.2 section of design doc at HDFS-2185, we should allow manual
and auto failover co-exist, by:
> - adding some rpc interfaces at zkfc
> - manual failover shall be triggered by haadmin, and handled by zkfc if auto failover
is enabled. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message