hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From - <commodor...@ymail.com>
Subject How / when does On-disk merge work?
Date Fri, 25 Oct 2013 19:35:40 GMT
Hi All,

Can anyone provide documentation regarding how on-disk merge on reduce phase works in detail
in Hadoop 2.2.0?
There is an explanation in this page but I am afraid it could be outdated since what I observe
in my log files is a bunch of "OnDiskMerger - Thread to merge on-disk map-outputs" work at
the end of merge phase.

Thanks,
-
Mime
View raw message