Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of kawa.adam@gmail.com designates
 209.85.223.182 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <SNT149-W36729B58BE3D39C1F5C1C0D0DD0@phx.gbl>
References: 
 <CACeqxwRAfMysGxSeH=GxC-EvXfLTo+7AdaJ9zRFbk1DzJccCNg@mail.gmail.com>
	<5DF48A23D7B14649BBA72C2F64C6663B82B356DB@szxeml523-mbx.china.huawei.com>
	<CACeqxwRcbBDtKQ_vPYzghaDjxYi3QGH_54__Xc1oaNZNcUhvPQ@mail.gmail.com>
	<SNT149-W36729B58BE3D39C1F5C1C0D0DD0@phx.gbl>
Date: Wed, 11 Dec 2013 23:30:03 +0100
Message-ID: 
 <CAHodO=J_Ap_YerU_GARDCzkNJkAY+ZO5k7HS1mhPcMubVYBrYQ@mail.gmail.com>
Subject: Re: issue about Shuffled Maps in MR job summary
From: Adam Kawa <kawa.adam@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=001a11c3078031d76d04ed49c5fd

--001a11c3078031d76d04ed49c5fd
Content-Type: text/plain; charset=ISO-8859-1

> why sometime ,increase reducer number will not decrease job complete
time ?

Apart from valid information that Yong wrote in the previous point, please
note that:

1) You do not want to have very shortly lived (seconds) reduce tasks,
because the overhead for coordinating them, starting JVMs, setting up the
connections to all map tasks becomes too costly. It depends on your use
case, but usually MapReduce jobs are for batch processing, and at my
company we set the number of reduce tasks to make sure that each task runs
at least a couple of minutes (for production jobs that are scheduled in
"background", we aim for ~10 minutes).

2) We you have more reduce tasks, then you need more slots (or containers,
if you use YARN). Sometimes, you can not get slots/containers as quick as
you want, so that you can get stuck waiting for more resources. Then job
completion time extends.

3) It you have thinner reducers, then they probably they write smaller
output files to HDFS. Small files are problematic for HDFS (e.g. higher
memory requirement on NN, bigger load on NN, slower NN restarts, more
random than streaming access pattern and more). If the output of that job
is later processed by another job, then you will see thin mappers (this can
be partially alleviated by CombineFileInputFormat, though).


2013/12/11 java8964 <java8964@hotmail.com>

> The whole job complete time depends on a lot of factors. Are you sure the
> reducers part is the bottleneck?
>
> Also, it also depends on how many Reducer input groups it has in your MR
> job. If you only have 20 reducer groups, even you jump your reducer count
> to 40, then the epoch of reducers part won't have too much change, as the
> additional 20 reducer task won't get data to process.
>
> If you have a lot of reducer input groups, and your cluster does have
> capacity at this time, and your also have a lot idle reducer slot, then
> increase your reducer count should decrease your whole job complete time.
>
> Make sense?
>
> Yong
>
> ------------------------------
> Date: Wed, 11 Dec 2013 14:20:24 +0800
> Subject: Re: issue about Shuffled Maps in MR job summary
> From: justlooks@gmail.com
> To: user@hadoop.apache.org
>
>
> i read the doc, and find if i have 8 reducer ,a map task will output 8
> partition ,each partition will be send to a different reducer, so if i
> increase reduce number ,the partition number increase ,but the volume on
> network traffic is same,why sometime ,increase reducer number will not
> decrease job complete time ?
>
> On Wed, Dec 11, 2013 at 1:48 PM, Vinayakumar B <vinayakumar.b@huawei.com>wrote:
>
>  It looks simple, J
>
>
>
> Shuffled Maps= Number of Map Tasks * Number of Reducers
>
>
>
> Thanks and Regards,
>
> Vinayakumar B
>
>
>
> *From:* ch huang [mailto:justlooks@gmail.com]
> *Sent:* 11 December 2013 10:56
> *To:* user@hadoop.apache.org
> *Subject:* issue about Shuffled Maps in MR job summary
>
>
>
> hi,maillist:
>
>            i run terasort with 16 reducers and 8 reducers,when i double
> reducer number, the Shuffled maps is also double ,my question is the job
> only run 20 map tasks (total input file is 10,and each file is 100M,my
> block size is 64M,so split is 20) why i need shuffle 160 maps in 8 reducers
> run and 320 maps in 16 reducers run?how to caculate the shuffle maps number?
>
>
>
> 16 reducer summary output:
>
>
>
>
>
>  Shuffled Maps =320
>
>
>
> 8 reducer summary output:
>
>
>
> Shuffled Maps =160
>
>
>

--001a11c3078031d76d04ed49c5fd
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">&gt;=A0<span style=3D"font-family:arial,sans-serif;font-si=
ze:13px">why sometime ,increase reducer number will not decrease job=A0comp=
lete time=A0?</span><div><span style=3D"font-family:arial,sans-serif;font-s=
ize:13px"><br>
</span></div><div><span style=3D"font-family:arial,sans-serif;font-size:13p=
x">Apart from valid information that=A0</span><span style=3D"font-family:ar=
ial,sans-serif;font-size:13px">Yong wrote in the previous point, please not=
e that:</span></div>
<div><font face=3D"arial, sans-serif"><br></font></div><div><font face=3D"a=
rial, sans-serif">1) You do not want to have very shortly lived (seconds) r=
educe tasks, because the overhead for=A0</font><span style=3D"font-family:a=
rial,sans-serif">coordinating them, starting JVMs, setting up the connectio=
ns to all map tasks becomes too costly. It depends on your use case, but us=
ually MapReduce jobs are for batch processing, and at my company we set the=
 number of reduce tasks to make sure that each task runs at least a couple =
of minutes (for production jobs that are scheduled in &quot;background&quot=
;, we aim for ~10 minutes).</span></div>
<div><font face=3D"arial, sans-serif"><br></font></div><div><font face=3D"a=
rial, sans-serif">2) We you have more reduce tasks, then you need more slot=
s (or containers, if you use YARN). Sometimes, you can not get slots/contai=
ners as quick as you want, so that you can get stuck=A0waiting=A0for more r=
esources. Then job completion time extends.</font></div>
<div><font face=3D"arial, sans-serif"><br></font></div><div><font face=3D"a=
rial, sans-serif">3) It you have=A0thinner=A0reducers, then they probably t=
hey write smaller output files to HDFS. Small files are problematic for HDF=
S (e.g. higher memory requirement on NN, bigger load on NN,=A0slower NN res=
tarts, more random than streaming access pattern and more)</font><span styl=
e=3D"font-family:arial,sans-serif">. If the output of that job is later pro=
cessed by another job, then you will see thin mappers (this can be partiall=
y alleviated by CombineFileInputFormat, though).</span></div>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/12=
/11 java8964 <span dir=3D"ltr">&lt;<a href=3D"mailto:java8964@hotmail.com" =
target=3D"_blank">java8964@hotmail.com</a>&gt;</span><br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">


<div><div dir=3D"ltr">The whole job complete time depends on a lot of facto=
rs. Are you sure the reducers part is the bottleneck?<div><br></div><div>Al=
so, it also depends on how many Reducer input groups it has in your MR job.=
 If you only have 20 reducer groups, even you jump your reducer count to 40=
, then the epoch of reducers part won&#39;t have too much change, as the ad=
ditional 20 reducer task won&#39;t get data to process.</div>
<div><br></div><div>If you have a lot of reducer input groups, and your clu=
ster does have capacity at this time, and your also have a lot idle reducer=
 slot, then increase your reducer count should decrease your whole job comp=
lete time.</div>
<div><br></div><div>Make sense?</div><div><br></div><div>Yong<br><br><div><=
hr>Date: Wed, 11 Dec 2013 14:20:24 +0800<br>Subject: Re: issue about Shuffl=
ed Maps in MR job summary<br>From: <a href=3D"mailto:justlooks@gmail.com" t=
arget=3D"_blank">justlooks@gmail.com</a><br>
To: <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user@hadoop=
.apache.org</a><div><div class=3D"h5"><br><br><div>i read the doc, and find=
 if i have 8 reducer ,a map task will output 8 partition ,each partition wi=
ll be=A0send to a different reducer, so if i increase reduce number ,the pa=
rtition number increase ,but the=A0volume on network traffic is same,why so=
metime ,increase reducer number will not decrease job=A0complete time=A0?</=
div>


<div>=A0</div>
<div>On Wed, Dec 11, 2013 at 1:48 PM, Vinayakumar B <span dir=3D"ltr">&lt;<=
a href=3D"mailto:vinayakumar.b@huawei.com" target=3D"_blank">vinayakumar.b@=
huawei.com</a>&gt;</span> wrote:<br>
<blockquote style=3D"BORDER-LEFT:#ccc 1px solid;MARGIN:0px 0px 0px 0.8ex;PA=
DDING-LEFT:1ex">
<div lang=3D"EN-US">
<div>
<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt">It looks simple, </span><span style=3D"FONT-FAMILY:=
Wingdings;COLOR:#1f497d;FONT-SIZE:11pt">J</span><span style=3D"FONT-FAMILY:=
&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:#1f497d;FONT-SIZE:11pt"><u></u=
><u></u></span></p>


<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt"><u></u>=A0<u></u></span></p>
<p><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT-SI=
ZE:13.5pt">Shuffled Maps=3D Number of Map Tasks * Number of Reducers</span>=
<span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:#1f=
497d;FONT-SIZE:11pt"><u></u><u></u></span></p>


<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt"><u></u>=A0<u></u></span></p>
<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt">Thanks and Regards,<u></u><u></u></span></p>
<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt">Vinayakumar B<u></u><u></u></span></p>
<p><span style=3D"FONT-FAMILY:&#39;Calibri&#39;,&#39;sans-serif&#39;;COLOR:=
#1f497d;FONT-SIZE:11pt"><u></u>=A0<u></u></span></p>
<div style=3D"BORDER-BOTTOM:medium none;BORDER-LEFT:medium none;PADDING-BOT=
TOM:0cm;PADDING-LEFT:0cm;PADDING-RIGHT:0cm;BORDER-TOP:#b5c4df 1pt solid;BOR=
DER-RIGHT:medium none;PADDING-TOP:3pt">
<p><b><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT=
-SIZE:10pt">From:</span></b><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,=
9;sans-serif&#39;;FONT-SIZE:10pt"> ch huang [mailto:<a href=3D"mailto:justl=
ooks@gmail.com" target=3D"_blank">justlooks@gmail.com</a>] <br>

<b>Sent:</b> 11 December 2013 10:56<br><b>To:</b> <a href=3D"mailto:user@ha=
doop.apache.org" target=3D"_blank">user@hadoop.apache.org</a><br><b>Subject=
:</b> issue about Shuffled Maps in MR job summary<u></u><u></u></span></p>
</div>

<div>
<div>
<p><u></u>=A0<u></u></p>
<div>
<p>hi,maillist:<u></u><u></u></p></div>
<div>
<p>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 i run terasort with 16 reducers and 8 red=
ucers,when i=A0double reducer number, the Shuffled maps is also double ,my =
question is the job only run 20 map tasks (total input file is 10,and each =
file is 100M,my block size is 64M,so split is 20) why i need shuffle 160 ma=
ps in 8 reducers run and 320 maps in 16 reducers run?how to caculate the sh=
uffle maps number?<u></u><u></u></p>

</div>
<div>
<p>=A0<u></u><u></u></p></div>
<div>
<p>16 reducer summary output:<u></u><u></u></p></div>
<div>
<p>=A0<u></u><u></u></p></div>
<div>
<p>=A0=A0=A0 <u></u><u></u></p>
<div>
<p><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT-SI=
ZE:13.5pt">=A0Shuffled Maps =3D320</span><u></u><u></u></p></div>
<div>
<p>=A0<u></u><u></u></p></div>
<div>
<div>
<p><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT-SI=
ZE:13.5pt">8=A0reducer summary output:<u></u><u></u></span></p></div>
<div>
<p><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT-SI=
ZE:13.5pt">=A0 <u></u><u></u></span></p>
<div>
<p><span style=3D"FONT-FAMILY:&#39;Tahoma&#39;,&#39;sans-serif&#39;;FONT-SI=
ZE:13.5pt">Shuffled Maps =3D160<u></u><u></u></span></p></div></div></div><=
/div></div></div></div></div></blockquote></div><br></div></div></div></div=
>
 		 	   		  </div></div>
</blockquote></div><br></div>

--001a11c3078031d76d04ed49c5fd--