hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-3998) Speed up fsck
Date Wed, 03 Oct 2012 11:54:07 GMT

    [ https://issues.apache.org/jira/browse/HDFS-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13468504#comment-13468504

Steve Loughran commented on HDFS-3998:

"Block with one replica is still bad" -not so much bad as at exceedingly at risk, which is
why it's almost max priority in the replication queue. Having some way to view that queue
could be a way to see what's happening there
> Speed up fsck
> -------------
>                 Key: HDFS-3998
>                 URL: https://issues.apache.org/jira/browse/HDFS-3998
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: name-node
>            Reporter: Ming Ma
> We have some big clusters. Sometimes we want to find out the list of missing blocks or
blocks with only one replica quickly. Currently fsck has to take a path as input and it then
recursively check for inconsistency. That could take a long time to find the missing blocks
and the files the missing blocks belong to. It will be useful to speed this up. For example,
it could go directly to missing blocks stored in NN and do the file lookup instead.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message