Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0092610C0D for ; Thu, 6 Jun 2013 12:52:25 +0000 (UTC) Received: (qmail 71616 invoked by uid 500); 6 Jun 2013 12:52:25 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 71549 invoked by uid 500); 6 Jun 2013 12:52:23 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 71515 invoked by uid 500); 6 Jun 2013 12:52:21 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 71505 invoked by uid 99); 6 Jun 2013 12:52:20 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Jun 2013 12:52:20 +0000 Date: Thu, 6 Jun 2013 12:52:20 +0000 (UTC) From: "Josh Wills (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CRUNCH-162) Add utility function for merging output by identity reduce MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CRUNCH-162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Wills updated CRUNCH-162: ------------------------------ Attachment: CRUNCH-162.patch A patch for this that uses the Aggregate lib and getDetachedValue to do the shard/unshard operation. > Add utility function for merging output by identity reduce > ---------------------------------------------------------- > > Key: CRUNCH-162 > URL: https://issues.apache.org/jira/browse/CRUNCH-162 > Project: Crunch > Issue Type: Improvement > Components: MapReduce Patterns > Affects Versions: 0.4.0 > Reporter: Dave Beech > Priority: Minor > Attachments: CRUNCH-162.patch > > > Something I find myself doing reasonably often in mapreduce is to use > the reduce step as nothing more than a means to merge data into larger > files (using the identity reducer). > There doesn't appear to be a neat way to do this with Crunch at the moment. > Ref: http://mail-archives.apache.org/mod_mbox/incubator-crunch-user/201302.mbox/%3CCAFZSZPsXRxWT45c9w4ef7Ruij2exE4HP2CDNMjd%2BVc%3D9RWX-Jw%40mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira