hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3315) New binary file format
Date Mon, 28 Apr 2008 06:39:55 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12592749#action_12592749
] 

Alejandro Abdelnur commented on HADOOP-3315:
--------------------------------------------

I would like to see something like Andrzej is mentioning, the metadata section at the end.

Our use case is specifically the example hi mentions, the record count.

Currently keep the count in a shadow file, ie _NAME.counter, using a custom SequenceOutputFormat.
The problem with this approach is that we are doubling the number of files in HDFS for SequenceFiles.




> New binary file format
> ----------------------
>
>                 Key: HADOOP-3315
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3315
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: io
>            Reporter: Owen O'Malley
>
> SequenceFile's block compression format is too complex and requires 4 codecs to compress
or decompress. It would be good to have a file format that only needs 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message