hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-2795) HA: Standby NN takes a long time to recover from a dead DN starting up
Date Mon, 16 Jan 2012 07:47:53 GMT
HA: Standby NN takes a long time to recover from a dead DN starting up
----------------------------------------------------------------------

                 Key: HDFS-2795
                 URL: https://issues.apache.org/jira/browse/HDFS-2795
             Project: Hadoop HDFS
          Issue Type: Sub-task
          Components: data-node, ha, name-node
    Affects Versions: HA branch (HDFS-1623)
            Reporter: Aaron T. Myers
            Assignee: Aaron T. Myers
            Priority: Critical


To reproduce:

# Start an HA cluster with a DN.
# Write several blocks to the FS with replication 1.
# Shutdown the DN
# Wait for the NNs to declare the DN dead. All blocks will be under-replicated.
# Restart the DN.

Note that upon restarting the DN, the active NN will immediately get all block locations from
the initial BR. The standby NN will not, and instead will slowly add block locations for a
subset of the previously-missing blocks on every DN heartbeat.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message