hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hairong Kuang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-5588) hadoop commands seem extremely slow in 0.20 branch
Date Fri, 27 Mar 2009 00:31:50 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hairong Kuang updated HADOOP-5588:
----------------------------------

    Affects Version/s: 0.20.0
        Fix Version/s: 0.20.0
             Assignee: Hairong Kuang

A suspect is HADOOP-3497 which introduced a listing call on the parent directory no matter
the path contains globs or not in globStatus. One of our users calls "dfs -get" on many small
files under one dir. It has the same effect of calling dfs -ls many times on a large directory,
thus causing NN to do lots of gc and making it less responsive.

> hadoop commands seem extremely slow in 0.20 branch
> --------------------------------------------------
>
>                 Key: HADOOP-5588
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5588
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs, fs
>    Affects Versions: 0.20.0
>         Environment: 0.20-branch and trunk
>            Reporter: Koji Noguchi
>            Assignee: Hairong Kuang
>            Priority: Blocker
>             Fix For: 0.20.0
>
>
> hadoop dfs -get/rm/mkdir/etc   mydir/fileA mydir/fileB mydir/fileC ...
> seem to be very slow in 0.20 branch. 
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message