hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hairong Kuang (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-5588) hadoop commands seem extremely slow in 0.20 branch
Date Mon, 30 Mar 2009 17:52:50 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12693864#action_12693864
] 

Hairong Kuang commented on HADOOP-5588:
---------------------------------------

Koji did some experiments with the patch. He is too busy to post the results. I am doing this
for him.

Directory size with 10,000 files.
About 450 mappers. Each mapper calling dfs -get 10000 times.

Without the fix, namenode was showing 20-30 getblocklocations per sec and 30-40 threads blocked.
With the fix, 600 getblocklocations per sec and almost no blocked threads. 

> hadoop commands seem extremely slow in 0.20 branch
> --------------------------------------------------
>
>                 Key: HADOOP-5588
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5588
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs, fs
>    Affects Versions: 0.20.0
>         Environment: 0.20-branch and trunk
>            Reporter: Koji Noguchi
>            Assignee: Hairong Kuang
>            Priority: Blocker
>             Fix For: 0.20.0, 0.21.0
>
>         Attachments: globStatus.patch, globStatus1.patch
>
>
> hadoop dfs get, rm, -mkdir- ,cp, mv, ls, etc   mydir/fileA mydir/fileB mydir/fileC ...
> seem to be very slow in 0.20 branch. 
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message