hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-2834) Iterator for MapFileOutputFormat
Date Wed, 19 Mar 2008 16:00:26 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580422#action_12580422
] 

Andrzej Bialecki  commented on HADOOP-2834:
-------------------------------------------

I'm working on an updated patch, and I just realized that we can only iterate over entries
(using RawKeyValueIterator), and we can't support some other methods in the Reader abstraction,
methods that are available in other readers such as seek ... We can only implement next(),
get(), reset() and close(). If people feel that this still falls under the Reader rather than
Iterator I'll complete the patch that implements it.

> Iterator for MapFileOutputFormat
> --------------------------------
>
>                 Key: HADOOP-2834
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2834
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.17.0
>            Reporter: Andrzej Bialecki 
>            Assignee: Andrzej Bialecki 
>             Fix For: 0.17.0
>
>         Attachments: map-file-v2.patch, map-file-v3.patch
>
>
> MapFileOutputFormat produces output data that is sorted locally in each part-NNNNN file
- however, there is no easy way to iterate over keys from all parts in a globally ascending
order.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message