hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lohit Vijayarenu (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HDFS-854) Datanode should scan devices in parallel to generate block report
Date Thu, 21 Jan 2010 21:59:54 GMT

    [ https://issues.apache.org/jira/browse/HDFS-854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803501#action_12803501
] 

Lohit Vijayarenu commented on HDFS-854:
---------------------------------------

Or probably keep last added/deleted blocks in memory and send block report and doing disk
scan once in a while?

> Datanode should scan devices in parallel to generate block report
> -----------------------------------------------------------------
>
>                 Key: HDFS-854
>                 URL: https://issues.apache.org/jira/browse/HDFS-854
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: data-node
>            Reporter: dhruba borthakur
>            Assignee: Dmytro Molkov
>
> A Datanode should scan its disk devices in parallel so that the time to generate a block
report is reduced. This will reduce the startup time of a cluster.
> A datanode has 12 disk (each of 1 TB) to store HDFS blocks. There is a total of 150K
blocks on these 12 disks. It takes the datanode upto 20 minutes to scan these devices to generate
the first block report.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message