hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johan Oskarsson (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-1434) Let users add compression types
Date Fri, 25 May 2007 17:14:16 GMT
Let users add compression types
-------------------------------

                 Key: HADOOP-1434
                 URL: https://issues.apache.org/jira/browse/HADOOP-1434
             Project: Hadoop
          Issue Type: Improvement
          Components: mapred
            Reporter: Johan Oskarsson
            Priority: Minor


This is probably a special case, but we're considering serving data from the generated sequence
files to avoid having to convert to other file format.

However, using block compression means we'd have to read up to almost one mb (default) of
data to find the data. Our records are so small that compressing
them using records compression increases the size of the file compared to no compression.


I'd like to make a modified version of the BlockCompressWriter that ends a block depending
on features of the key appended.
There's currently no easy way of adding this without modifying SequenceFile directly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message