hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bejoy Ks <bejoy.had...@gmail.com>
Subject Re: Number of Maps running more than expected
Date Fri, 17 Aug 2012 09:36:55 GMT
Hi Gaurav

How many input files are there for the wordcount map reduce job? Do you
have input files lesser than a block size? If you are using the default
TextInputFormat there will be one task generated per file for sure, so if
you have  files less than block size the calculation specified here for
number of splits won't hold. If small files are there then definitely the
number of maps tasks should be more.

Also did you change the split sizes as well along with block size?

Bejoy KS

View raw message