hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Jones <nick.jo...@amd.com>
Subject Uneven DBInputFormat Splits
Date Wed, 10 Mar 2010 14:07:21 GMT
Hi all,
I've setup a job that pulls say 250 records from MySQL and splits them 
across several mappers.  Each mapper (with the exception of attempt_*_0) 
gets roughly 250/(n mappers) records.  However, attempt 0 always ends up 
with ~5x the workload of the others.  Is there something I'm missing or 
is this normal?


Nick Jones

View raw message