hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris Douglas (JIRA)" <j...@apache.org>
Subject [jira] Issue Comment Edited: (HADOOP-3226) Run combiner when merging spills from map output
Date Tue, 22 Apr 2008 18:51:22 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591403#action_12591403
] 

chris.douglas edited comment on HADOOP-3226 at 4/22/08 11:49 AM:
-----------------------------------------------------------------

This patch adds a run of the combiner to the reduce-side spills. It also runs the combiner
on the map side merge if there are more than min.num.spills.for.combine (6 by default). It
adds no new test cases because it changes no behavior and should be covered by existing mapred
test cases.

      was (Author: chris.douglas):
    This patch adds a run of the combiner to the reduce-side spills. It also runs the combiner
on the map side merge if there are more than min.num.spills.for.combine (6 by default).
  
> Run combiner when merging spills from map output
> ------------------------------------------------
>
>                 Key: HADOOP-3226
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3226
>             Project: Hadoop Core
>          Issue Type: Improvement
>            Reporter: Chris Douglas
>             Fix For: 0.18.0
>
>         Attachments: 3226-0.patch, 3226-1.patch
>
>
> When merging spills from the map, running the combiner should further diminish the volume
of data we send to the reduce.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message