hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: Number of Reduce Outputs
Date Tue, 29 Aug 2006 17:01:25 GMT
To generate a single output file, specify just a single reduce task.  If 
your reducer isn't doing much computation, then it might be faster to do 
this in the original job, otherwise use a subsequent job.

Doug

Dennis Kubes wrote:
> This is probably a simple question but when I run my MR job I am getting 
> 10 splits and therefore 10 output files like part-xxxxx.  Is there a way 
> to merge those outputs into a single file using the currently running MR 
> job or do I need to run another MR job to merge them?
> 
> Dennis Kubes

Mime
View raw message