hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Terry Healy <the...@bnl.gov>
Subject TT nodes distributed cache failure
Date Fri, 25 Jan 2013 17:48:39 GMT
Running hadoop-0.20.2 on a 20 node cluster.

When running a Map/Reduce job that uses several .jars loaded into the
Distributed cache, several (~4) nodes have their map jobs fails because
of ClassNotFoundException. All the other nodes proceed through the job
normally and the jobs completes. But this is wasting 20-25% of my TT nodes.

Can anyone explain why some nodes might fail to read all the .jars from
the Distributed cache?

Thanks

Mime
View raw message