hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-11860) Ozone: SCM: SCMContainerPlacementCapacity#chooseNode sometimes does not remove chosen node from healthy list.
Date Thu, 26 Apr 2018 21:51:08 GMT

    [ https://issues.apache.org/jira/browse/HDFS-11860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16455074#comment-16455074
] 

Hudson commented on HDFS-11860:
-------------------------------

SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #14070 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/14070/])
HDFS-11860. Ozone: SCM: SCMContainerPlacementCapacity#chooseNode (omalley: rev 56966a0535b6cfbd4f3a90124ee7282610b77a87)
* (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/ozone/scm/container/placement/algorithms/SCMContainerPlacementCapacity.java


> Ozone: SCM: SCMContainerPlacementCapacity#chooseNode sometimes does not remove chosen
node from healthy list.
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-11860
>                 URL: https://issues.apache.org/jira/browse/HDFS-11860
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ozone
>    Affects Versions: HDFS-7240
>            Reporter: Xiaoyu Yao
>            Assignee: Xiaoyu Yao
>            Priority: Major
>             Fix For: HDFS-7240
>
>         Attachments: HDFS-11860-HDFS-7240.001.patch, HDFS-11860-HDFS-7240.002.patch
>
>
> This was caught in Jenkins run randomly. After debugging, found the cause is the 
> logic when two random index happens to be the same below where the node id was returned
without being removed from the healthy list for next round of selection. As a result, there
could be duplicated datanodes chosen for the pipeline and the machine list size smaller than
expected. I will post a fix soon. 
> {code}
> SCMContainerPlacementCapacity#chooseNode
>      // There is a possibility that both numbers will be same.
>      // if that is so, we just return the node.
>      if (firstNodeNdx == secondNodeNdx) {
>       return healthyNodes.get(firstNodeNdx);
>      }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message