hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rahul Bhattacharjee <rahul.rec....@gmail.com>
Subject Re: Assigning the same partition number to the mapper output
Date Fri, 14 Jun 2013 09:43:11 GMT
Some flexibility is there when it comes to changing the name of the output.
Check out MultipleOutputs

Never used it with a map only job.


On Thu, Jun 13, 2013 at 8:33 AM, Maysam Yabandeh <m.yabandeh@gmail.com>wrote:

> Hi,
> I was wondering if it is possible in hadoop to assign the same partition
> numbers to the map outputs. I am running a map-only job (with zero
> reducers) and hadoop shuffles the partitions in the output: i.e.
> input/part-m-0000X is processed by task number Y and hence generates
> output/part-m-0000Y (where X != Y).
> Thanks
> Maysam

View raw message