hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "eric baldeschwieler (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-2501) Implement utility-tools for working with SequenceFiles
Date Tue, 22 Jan 2008 07:08:34 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12561233#action_12561233
] 

eric baldeschwieler commented on HADOOP-2501:
---------------------------------------------

It would be great if we had a way of catting out the key/values from a sequence file in the
same format consumed by streaming.

It would be good if such a tool could take start and end offsets, so it could be used with
splits.




> Implement utility-tools for working with SequenceFiles
> ------------------------------------------------------
>
>                 Key: HADOOP-2501
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2501
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: io
>            Reporter: Arun C Murthy
>            Assignee: Enis Soztutar
>
> It would be nice to implement a bunch of utilities to work with SequenceFiles:
>  * info (print-out header information such as key/value types, compression type/codec
etc.)
>  * cat
>  * head/tail
>  * merge multiple seq-files into one
>  * ...
> I'd imagine this would look like:
> {noformat}
> $ bin/hadoop seq -info /user/joe/blah.seq
> $ bin/hadoop seq -head -n 10 /user/joe/blah.seq
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message