hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Torsten Curdt <tcu...@vafer.org>
Subject Re: multiple outputs
Date Wed, 09 Jun 2010 00:27:38 GMT
Can the MultipleOutputs also be used inside a mapper?

So basically I pipe data into different reducers from the mapper.

Of course I could do two separate jobs but that would very inefficient
as I would have to go/read through all the data twice.

cheers
--
Torsten

On Tue, Jun 8, 2010 at 06:22, Amareshwari Sri Ramadasu
<amarsri@yahoo-inc.com> wrote:
> MultipleOutputs is ported to use new api through
> http://issues.apache.org/jira/browse/MAPREDUCE-370
> See the discussions on jira and javadoc/testcase as an example on how to use
> it.
>
> Thanks
> Amareshwari
>
> On 6/7/10 8:08 PM, "Torsten Curdt" <tcurdt@apache.org> wrote:
>
> I need to emit to different output files from a reducer.
>
> The old API had MultipleSequenceFileOutputFormat.
> Am I missing something or is this gone in the new API?
>
> Are there any problems porting this over?
> Or does it just needs to be done?
>
> cheers
> --
> Torsten
>
>

Mime
View raw message