hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-2834) Iterator for MapFileOutputFormat
Date Tue, 22 Apr 2008 21:53:22 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Owen O'Malley updated HADOOP-2834:
----------------------------------

    Status: Open  (was: Patch Available)

I'm sorry, but you've got a few accidental white space diffs.

I also think it would be a very good idea if your Reader.next validated that the types of
the key and value that were passed in matched the types in the sequence file. Otherwise, the
user can accidentally cross the streams and get random data corruption and run time errors.

> Iterator for MapFileOutputFormat
> --------------------------------
>
>                 Key: HADOOP-2834
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2834
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.17.0
>            Reporter: Andrzej Bialecki 
>            Assignee: Andrzej Bialecki 
>         Attachments: map-file-v2.patch, map-file-v3.patch, map-file-v4.patch
>
>
> MapFileOutputFormat produces output data that is sorted locally in each part-NNNNN file
- however, there is no easy way to iterate over keys from all parts in a globally ascending
order.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message