Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: hdfs-issues@hadoop.apache.org
Message-ID: <1020535813.566151269908967780.JavaMail.jira@brutus.apache.org>
Date: Tue, 30 Mar 2010 00:29:27 +0000 (UTC)
From: "Dmytro Molkov (JIRA)" <jira@apache.org>
To: hdfs-issues@hadoop.apache.org
Subject: [jira] Commented: (HDFS-854) Datanode should scan devices in
 parallel to generate block report
In-Reply-To: <1826844919.1261898129464.JavaMail.jira@brutus.apache.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


    [ https://issues.apache.org/jira/browse/HDFS-854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851177#action_12851177 ] 

Dmytro Molkov commented on HDFS-854:
------------------------------------

The errors in the build system again look like they are not related to the actual patch. Two errors in BlockScanner are

java.lang.RuntimeException: java.util.zip.ZipException: error reading zip file
	at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1715)
	at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1529)
	at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1475)
	at org.apache.hadoop.conf.Configuration.set(Configuration.java:596)
	at org.apache.hadoop.conf.Configuration.setLong(Configuration.java:731)
	at org.apache.hadoop.hdfs.TestDatanodeBlockScanner.blockCorruptionRecoveryPolicy(TestDatanodeBlockScanner.java:277)
	at org.apache.hadoop.hdfs.TestDatanodeBlockScanner.testBlockCorruptionRecoveryPolicy(TestDatanodeBlockScanner.java:269)

and there is one more error with HDFSProxy which cannot have anything to do with my code changes.

> Datanode should scan devices in parallel to generate block report
> -----------------------------------------------------------------
>
>                 Key: HDFS-854
>                 URL: https://issues.apache.org/jira/browse/HDFS-854
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: data-node
>    Affects Versions: 0.22.0
>            Reporter: dhruba borthakur
>            Assignee: Dmytro Molkov
>             Fix For: 0.22.0
>
>         Attachments: HDFS-854-2.patch, HDFS-854.patch, HDFS-854.patch.1
>
>
> A Datanode should scan its disk devices in parallel so that the time to generate a block report is reduced. This will reduce the startup time of a cluster.
> A datanode has 12 disk (each of 1 TB) to store HDFS blocks. There is a total of 150K blocks on these 12 disks. It takes the datanode upto 20 minutes to scan these devices to generate the first block report.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.