hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ravi Prakash <ravi...@ymail.com>
Subject Re: How / when does On-disk merge work?
Date Mon, 28 Oct 2013 14:15:36 GMT

Tom White's "Hadoop: The Definitive Guide" is probably the best source for information on
this (apart from the code itself ;-) Look at MergeManagerImpl.java btw in case you are so


On Friday, October 25, 2013 2:36 PM, - <commodore65@ymail.com> wrote:
Hi All,

Can anyone provide documentation regarding how on-disk merge on reduce phase works in detail
in Hadoop 2.2.0?
There is an explanation in this page but I am afraid it could be outdated since what I observe
in my log files is a bunch of "OnDiskMerger - Thread to merge on-disk map-outputs" work at
the end of merge phase.

View raw message