hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cagdas Gerede (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3022) Fast Cluster Restart
Date Mon, 05 May 2008 23:11:56 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12594390#action_12594390

Cagdas Gerede commented on HADOOP-3022:

What is the replica count in this benchmark? 
I am guessing it is 3 (6 million / 500 nodes =12,000 objects / per node. 36000 / 12000 = 3
Could you clarify?

What does "process block reports" include? Does it include the time for generation of block
reports in datanode and the time for namenode to receive the block reports? Or is it only
the time to process all block reports not including receiving time? 

I was wondering how the numbers would be affected if you had the same number of objects but
1000 datanodes instead of 500 datanodes and 250 datanodes instead of 500 datanodes. 
Do you have any guess?

> Fast Cluster Restart
> --------------------
>                 Key: HADOOP-3022
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3022
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: dfs
>            Reporter: Robert Chansler
>            Assignee: Konstantin Shvachko
>             Fix For: 0.18.0
> This item introduces a discussion of how to reduce the time necessary to start a large
cluster from tens of minutes to a handful of minutes.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message