hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ming Ma (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-7281) Missing block is marked as corrupted block
Date Fri, 24 Oct 2014 03:58:33 GMT

     [ https://issues.apache.org/jira/browse/HDFS-7281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Ming Ma updated HDFS-7281:
    Attachment: HDFS-7281.patch

Thanks, Yongjun. Besides missing block is marked as corrupted block, corrupted block is marked
as missing block; that is because corrupted block has no live replica. So it will be useful
to clarify that is the intention as well.

So here is the patch that separates missing block from corrupted block. Will appreciate others'
opinions on the definition of missing and corrupted block.

1. A block is missing if and only if all DNs of its expected replicas are dead. 
2. A block is corrupted if and only if all its available replicas are corrupted. So if a block
has 3 replicas; one of the DN is dead, the other two replicas are corrupted; it will be marked
as corrupted.

> Missing block is marked as corrupted block
> ------------------------------------------
>                 Key: HDFS-7281
>                 URL: https://issues.apache.org/jira/browse/HDFS-7281
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Ming Ma
>         Attachments: HDFS-7281.patch
> In the situation where the block lost all its replicas, fsck shows the block is missing
as well as corrupted. Perhaps it is better not to mark the block corrupted in this case. The
reason it is marked as corrupted is numCorruptNodes == numNodes == 0 in the following code.
> {noformat}
> BlockManager
>     final boolean isCorrupt = numCorruptNodes == numNodes;
> {noformat}
> Would like to clarify if it is the intent to mark missing block as corrupted or it is
just a bug.

This message was sent by Atlassian JIRA

View raw message