hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HADOOP-8220) ZKFailoverController doesn't handle failure to become active correctly
Date Tue, 27 Mar 2012 04:30:03 GMT
ZKFailoverController doesn't handle failure to become active correctly

                 Key: HADOOP-8220
                 URL: https://issues.apache.org/jira/browse/HADOOP-8220
             Project: Hadoop Common
          Issue Type: Bug
          Components: ha
    Affects Versions: 0.23.3, 0.24.0
            Reporter: Todd Lipcon
            Assignee: Todd Lipcon
            Priority: Critical

The ZKFC doesn't properly handle the case where the monitored service fails to become active.
Currently, it catches the exception and logs a warning, but then continues on, after calling
quitElection(). This causes a NPE when it later tries to use the same zkClient instance while
handling that same request. There is a test case, but the test case doesn't ensure that the
node that had the failure is later able to recover properly.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message