hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allen Wittenauer (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HDFS-1049) utility to list all files less than X replication
Date Sat, 20 Mar 2010 01:04:27 GMT

    [ https://issues.apache.org/jira/browse/HDFS-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847676#action_12847676

Allen Wittenauer commented on HDFS-1049:

This goes back to the whole "traversing the file system" issue that impacts lots of other
tools and common ops procedures. It takes *forever* on a large fs plus adds to the NN load
when we could just have the NN spit out the answer to the question directly.  [This would
be great functionality for the mythical hadoop dfs -find command.]

> utility to list all files less than X replication
> -------------------------------------------------
>                 Key: HDFS-1049
>                 URL: https://issues.apache.org/jira/browse/HDFS-1049
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: tools
>    Affects Versions: 0.20.2
>            Reporter: Allen Wittenauer
> It would be great to have a utility that lists all files that have a replication less
than X.  While fsck provides this output and it isn't that tricky to parse, it would still
be nice if Hadoop had this functionality out of the box.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message