hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Koji Noguchi (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-3460) SequenceFileAsBinaryOutputFormat
Date Wed, 28 May 2008 20:19:46 GMT
SequenceFileAsBinaryOutputFormat
--------------------------------

                 Key: HADOOP-3460
                 URL: https://issues.apache.org/jira/browse/HADOOP-3460
             Project: Hadoop Core
          Issue Type: New Feature
          Components: mapred
            Reporter: Koji Noguchi
            Priority: Minor


Add an OutputFormat to write raw bytes as keys and values to a SequenceFile.

In C++-Pipes, we're using SequenceFileAsBinaryInputFormat to read Sequencefiles.
However, we current don't have a way to *write* a sequencefile efficiently without going through
extra (de)serializations.

I'd like to store the correct classnames for key/values but use BytesWritable to write
(in order for the next java or pig code to be able to read this sequencefile).


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message