hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Douglas <cdoug...@apache.org>
Subject Re: How does map-merge work exactly?
Date Wed, 19 Sep 2012 23:21:21 GMT
On Tue, Sep 18, 2012 at 7:02 AM, Martin Dobmeier
<martin.dobmeier@gmail.com> wrote:
> Ah, alright. But why is Hadoop telling me that there are 117 segments given
> that only 96 reducers have been configured?
> (btw, I'm using Hadoop 1.0.0)

There were 117 spills, so the merger starts with 117 files, does an
intermediate merge of 54 segments (#reducers = 96 times), then a final
merge of 64 segments (96 times). All of those layers produce log

> So the merger is called "number of reducers" times because it combines the
> data for a particular reducer which is spread over all spill files, right?

Yup, you have it. -C

View raw message