hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joydeep Sen Sarma (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-939) No-sort optimization
Date Sat, 04 Aug 2007 00:26:52 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12517672

Joydeep Sen Sarma commented on HADOOP-939:

am a new hadoop user - i am looking into how much of a warehouse type infrastructure can be
implemented over hadoop. in some cases - i would like to have the flexibility of having the
output partitioned by the mapoutput key - but i am not interested in it being sorted. the
directmapoutputcollector takes away sorting - but it also takes away partitioning. a lighter
hammer would be useful imho.

> No-sort optimization
> --------------------
>                 Key: HADOOP-939
>                 URL: https://issues.apache.org/jira/browse/HADOOP-939
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: mapred
>         Environment: all
>            Reporter: Doug Judd
> There should be a way to tell the mapred framework that the output of the map() phase
will already be sorted.  The Reduce phase can just merge the intermediate files together without

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message