hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-2753) Standby namenode stuck in safemode during a failover
Date Tue, 10 Jan 2012 21:58:39 GMT

    [ https://issues.apache.org/jira/browse/HDFS-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183626#comment-13183626
] 

Todd Lipcon commented on HDFS-2753:
-----------------------------------

The test adds blocks while the SBN is down. This makes them get queued up in the block received
list of that BPServiceActor.
When it restarts, the DN calls register(), followed by reportReceivedDeletedBlocks(), followed
by blockReport(). So the received blocks always show up first.

If you comment out the fix, the test case reliably fails with the error you described (stuck
in safemode).
                
> Standby namenode stuck in safemode during a failover
> ----------------------------------------------------
>
>                 Key: HDFS-2753
>                 URL: https://issues.apache.org/jira/browse/HDFS-2753
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ha, name-node
>    Affects Versions: HA branch (HDFS-1623)
>            Reporter: Hari Mankude
>            Assignee: Hari Mankude
>         Attachments: HDFS-2753.patch, hdfs-2753.txt
>
>
> Write traffic initiated from the client. Manual failover is done by killing NN and converting
a  different standby to active. NN is restarted as standby. The restarted standby stays in
safemode forever. More information in the description.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message