hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-3087) Decomissioning on NN restart can complete without blocks being replicated
Date Tue, 25 Mar 2014 11:37:38 GMT

    [ https://issues.apache.org/jira/browse/HDFS-3087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13946400#comment-13946400
] 

Hudson commented on HDFS-3087:
------------------------------

FAILURE: Integrated in Hadoop-Yarn-trunk #520 (See [https://builds.apache.org/job/Hadoop-Yarn-trunk/520/])
HDFS-3087. Decomissioning on NN restart can complete without blocks being replicated. Contributed
by Rushabh S Shah. (kihwal: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1580886)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/DatanodeDescriptor.java
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/DatanodeManager.java
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestDecommission.java


> Decomissioning on NN restart can complete without blocks being replicated
> -------------------------------------------------------------------------
>
>                 Key: HDFS-3087
>                 URL: https://issues.apache.org/jira/browse/HDFS-3087
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 0.23.0
>            Reporter: Kihwal Lee
>            Assignee: Rushabh S Shah
>            Priority: Critical
>             Fix For: 3.0.0, 2.5.0
>
>         Attachments: HDFS-3087.patch
>
>
> If a data node is added to the exclude list and the name node is restarted, the decomissioning
happens right away on the data node registration. At this point the initial block report has
not been sent, so the name node thinks the node has zero blocks and the decomissioning completes
very quick, without replicating the blocks on that node.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message