hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jing Zhao (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-9866) BlockManager#chooseExcessReplicasStriped may weaken rack fault tolerance
Date Mon, 29 Feb 2016 18:49:18 GMT

    [ https://issues.apache.org/jira/browse/HDFS-9866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172359#comment-15172359

Jing Zhao commented on HDFS-9866:

Thanks for reporting the issue, [~tfukudom]! I think there may be some other issues there.
I will dig further into this.

> BlockManager#chooseExcessReplicasStriped may weaken rack fault tolerance
> ------------------------------------------------------------------------
>                 Key: HDFS-9866
>                 URL: https://issues.apache.org/jira/browse/HDFS-9866
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: namenode
>    Affects Versions: 3.0.0
>            Reporter: Takuya Fukudome
>            Assignee: Jing Zhao
>             Fix For: 3.0.0
>         Attachments: HDFS-9866.000.patch
> In [~tfukudom]'s system tests, we find the following issue:
> A striped block group B has redundant internal block replicas. 9 internal blocks are
stored in 10 datanodes across 6 racks. Datanode d1 and d2 both store a replica for internal
block b1. d1's rack contains multiple internal blocks while d2's rack only has b1. Then when
choosing a duplicated replica to delete,  the current implementation may wrongly choose d2
thus causes the total number of racks to be decreased to 5.

This message was sent by Atlassian JIRA

View raw message