hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-3310) Custom grouping comparator cannot be set for Combiners
Date Fri, 15 Nov 2013 04:15:29 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alejandro Abdelnur updated MAPREDUCE-3310:

    Attachment: MAPREDUCE-3310-trunk.patch

test failure seems unrelated.

uploading patch that fixes the javac warning (was in a testcase)

> Custom grouping comparator cannot be set for Combiners
> ------------------------------------------------------
>                 Key: MAPREDUCE-3310
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3310
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: client
>    Affects Versions: 0.20.1
>         Environment: All
>            Reporter: Mathias Herberts
>            Assignee: Alejandro Abdelnur
>         Attachments: MAPREDUCE-3310-trunk.patch, MAPREDUCE-3310-trunk.patch
> Combiners are often described as 'Reducers running on the Map side'.
> As Reducers, Combiners are fed <K,{V}>, where {V} is built by grouping values associated
with the 'same' key.
> For Reducers, the comparator used for grouping values can be set independently of that
used to sort the keys (using Job.setGroupingComparatorClass).
> Such a configuration is not possible for Combiners, meaning some things done in Reducers
cannot be done in Combiners (such as secondary sort).
> It would be handy to have a Job.setCombinerGroupingComparatorClass method that would
allow the setting of the grouping comparator used when applying a Combiner.

This message was sent by Atlassian JIRA

View raw message