hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-4591) HA clients can fail to fail over while Standby NN is performing long checkpoint
Date Tue, 12 Mar 2013 19:49:13 GMT

    [ https://issues.apache.org/jira/browse/HDFS-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13600392#comment-13600392
] 

Aaron T. Myers commented on HDFS-4591:
--------------------------------------

The TestDataDirs test failure is already known and is tracked by HDFS-4586.
                
> HA clients can fail to fail over while Standby NN is performing long checkpoint
> -------------------------------------------------------------------------------
>
>                 Key: HDFS-4591
>                 URL: https://issues.apache.org/jira/browse/HDFS-4591
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, namenode
>    Affects Versions: 2.0.4-alpha
>            Reporter: Aaron T. Myers
>            Assignee: Aaron T. Myers
>         Attachments: HDFS-4591.patch, HDFS-4591.patch
>
>
> Clients know to fail over to talk to the Active NN when they perform an RPC to the Standby
NN and it throws a StandbyException. However, most places in the code that check if the NN
is in the standby state do so inside the FSNS fsLock. Since this lock is held for the duration
of the saveNamespace during a checkpoint, StandbyExceptions will not be thrown during this
time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message