hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-4591) HA clients can fail to fail over while Standby NN is performing long checkpoint
Date Wed, 13 Mar 2013 19:58:13 GMT

     [ https://issues.apache.org/jira/browse/HDFS-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Aaron T. Myers updated HDFS-4591:
---------------------------------

       Resolution: Fixed
    Fix Version/s: 2.0.5-beta
     Hadoop Flags: Reviewed
           Status: Resolved  (was: Patch Available)

Thanks a lot for the reviews, Todd. I've just committed this to trunk and branch-2.
                
> HA clients can fail to fail over while Standby NN is performing long checkpoint
> -------------------------------------------------------------------------------
>
>                 Key: HDFS-4591
>                 URL: https://issues.apache.org/jira/browse/HDFS-4591
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, namenode
>    Affects Versions: 2.0.4-alpha
>            Reporter: Aaron T. Myers
>            Assignee: Aaron T. Myers
>             Fix For: 2.0.5-beta
>
>         Attachments: HDFS-4591.patch, HDFS-4591.patch, HDFS-4591.patch
>
>
> Clients know to fail over to talk to the Active NN when they perform an RPC to the Standby
NN and it throws a StandbyException. However, most places in the code that check if the NN
is in the standby state do so inside the FSNS fsLock. Since this lock is held for the duration
of the saveNamespace during a checkpoint, StandbyExceptions will not be thrown during this
time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message