hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-3026) HA: Handle failure during HA state transition
Date Thu, 10 May 2012 15:28:49 GMT

     [ https://issues.apache.org/jira/browse/HDFS-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Aaron T. Myers updated HDFS-3026:

    Attachment: HDFS-3026.patch

I looked into what it would take to make the RPC server support a semi-shutdown state, wherein
it could return one final response to the client who initiated a shutdown from an RPC, but
cancel all other RPCs and not accept any further incoming connections. To do so requires a
fair bit of surgery to the o.a.h.ipc.Server shutdown code. Since clients initiating HA state
transitions must already handle the case where an RPC to the NN times out, it doesn't seem
worth it to do so.

In the patch attached, I've removed the delayed shutdown code and instead just shutdown the
NN immediately upon failure to fully perform an HA state transition.
> HA: Handle failure during HA state transition
> ---------------------------------------------
>                 Key: HDFS-3026
>                 URL: https://issues.apache.org/jira/browse/HDFS-3026
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, name-node
>    Affects Versions: 2.0.0
>            Reporter: Aaron T. Myers
>            Assignee: Aaron T. Myers
>         Attachments: HDFS-3026-HDFS-1623.patch, HDFS-3026.patch
> This JIRA is to address a TODO in NameNode about handling the possibility of an incomplete
HA state transition.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message