From hdfs-issues-return-49861-apmail-hadoop-hdfs-issues-archive=hadoop.apache.org@hadoop.apache.org Wed Oct 3 11:54:17 2012 Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 696A6D288 for ; Wed, 3 Oct 2012 11:54:17 +0000 (UTC) Received: (qmail 58112 invoked by uid 500); 3 Oct 2012 11:54:16 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 56339 invoked by uid 500); 3 Oct 2012 11:54:10 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 56195 invoked by uid 99); 3 Oct 2012 11:54:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Oct 2012 11:54:07 +0000 Date: Wed, 3 Oct 2012 22:54:07 +1100 (NCT) From: "Steve Loughran (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <529513323.158663.1349265247993.JavaMail.jiratomcat@arcas> In-Reply-To: <2037623648.156543.1349215268362.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HDFS-3998) Speed up fsck MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13468504#comment-13468504 ] Steve Loughran commented on HDFS-3998: -------------------------------------- "Block with one replica is still bad" -not so much bad as at exceedingly at risk, which is why it's almost max priority in the replication queue. Having some way to view that queue could be a way to see what's happening there > Speed up fsck > ------------- > > Key: HDFS-3998 > URL: https://issues.apache.org/jira/browse/HDFS-3998 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Reporter: Ming Ma > > We have some big clusters. Sometimes we want to find out the list of missing blocks or blocks with only one replica quickly. Currently fsck has to take a path as input and it then recursively check for inconsistency. That could take a long time to find the missing blocks and the files the missing blocks belong to. It will be useful to speed this up. For example, it could go directly to missing blocks stored in NN and do the file lookup instead. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira