hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johan Oskarson (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-369) Added ability to copy all part-files into one output file
Date Wed, 19 Jul 2006 11:58:17 GMT
     [ http://issues.apache.org/jira/browse/HADOOP-369?page=all ]

Johan Oskarson updated HADOOP-369:

    Attachment: dircat.patch

I've added two patches, one is the requested cat feature (cat whole directory)

However, a simple test shows that this is very very much slower then saving it through a filestream.
Why I do not know :)

So I've changed the copymerge patch as suggested and uploaded the new patch.

> Added ability to copy all part-files into one output file
> ---------------------------------------------------------
>                 Key: HADOOP-369
>                 URL: http://issues.apache.org/jira/browse/HADOOP-369
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: dfs
>    Affects Versions: 0.4.0
>            Reporter: Johan Oskarson
>            Priority: Trivial
>         Attachments: copymerge.patch, copymerge.patch, dircat.patch
> Since we use the hadoop output in non-hadoop applications it's nice to be able to merge
the part-files into one output file on the local filesystem.
> So I've added a dfsshell feature that streams from all files in a directory to one output

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message