hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Koji Noguchi (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-3460) SequenceFileAsBinaryOutputFormat
Date Wed, 28 May 2008 20:19:46 GMT

                 Key: HADOOP-3460
                 URL: https://issues.apache.org/jira/browse/HADOOP-3460
             Project: Hadoop Core
          Issue Type: New Feature
          Components: mapred
            Reporter: Koji Noguchi
            Priority: Minor

Add an OutputFormat to write raw bytes as keys and values to a SequenceFile.

In C++-Pipes, we're using SequenceFileAsBinaryInputFormat to read Sequencefiles.
However, we current don't have a way to *write* a sequencefile efficiently without going through
extra (de)serializations.

I'd like to store the correct classnames for key/values but use BytesWritable to write
(in order for the next java or pig code to be able to read this sequencefile).

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message