Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 80133 invoked from network); 9 Jun 2010 00:28:05 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 9 Jun 2010 00:28:05 -0000 Received: (qmail 63566 invoked by uid 500); 9 Jun 2010 00:28:04 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 63525 invoked by uid 500); 9 Jun 2010 00:28:04 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 63517 invoked by uid 99); 9 Jun 2010 00:28:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Jun 2010 00:28:04 +0000 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=AWL,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.219.215] (HELO mail-ew0-f215.google.com) (209.85.219.215) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Jun 2010 00:27:59 +0000 Received: by ewy7 with SMTP id 7so1178080ewy.31 for ; Tue, 08 Jun 2010 17:27:38 -0700 (PDT) MIME-Version: 1.0 Received: by 10.213.21.201 with SMTP id k9mr12590958ebb.31.1276043258135; Tue, 08 Jun 2010 17:27:38 -0700 (PDT) Received: by 10.213.9.129 with HTTP; Tue, 8 Jun 2010 17:27:38 -0700 (PDT) In-Reply-To: References: Date: Wed, 9 Jun 2010 02:27:38 +0200 Message-ID: Subject: Re: multiple outputs From: Torsten Curdt To: mapreduce-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Can the MultipleOutputs also be used inside a mapper? So basically I pipe data into different reducers from the mapper. Of course I could do two separate jobs but that would very inefficient as I would have to go/read through all the data twice. cheers -- Torsten On Tue, Jun 8, 2010 at 06:22, Amareshwari Sri Ramadasu wrote: > MultipleOutputs is ported to use new api through > http://issues.apache.org/jira/browse/MAPREDUCE-370 > See the discussions on jira and javadoc/testcase as an example on how to use > it. > > Thanks > Amareshwari > > On 6/7/10 8:08 PM, "Torsten Curdt" wrote: > > I need to emit to different output files from a reducer. > > The old API had MultipleSequenceFileOutputFormat. > Am I missing something or is this gone in the new API? > > Are there any problems porting this over? > Or does it just needs to be done? > > cheers > -- > Torsten > >