Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of kawa.adam@gmail.com
 designates 209.85.223.173 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAHoPa61MXCNtT1W94zLygTxUe5rQG07djT2PD410YQTEtno7pw@mail.gmail.com>
References: 
 <CAHoPa61kF2nvk5+sE5K__Cd9gRw2wMxPZAuqSSjz=xye0SDd6w@mail.gmail.com>
	<FA7C672E-35A5-4210-9828-1F1AEB04D327@gmail.com>
	<CAHoPa62Sjr9VcMfNvcfiGbFBbct6xnBaDCCYs1P-ZWMr35+DrQ@mail.gmail.com>
	<CAHodO=+FLbTjrXsNL4uZud9t+2HyyRLiV3eVCAD=--HWRW385w@mail.gmail.com>
	<CAHoPa61MXCNtT1W94zLygTxUe5rQG07djT2PD410YQTEtno7pw@mail.gmail.com>
Date: Wed, 9 Jul 2014 16:01:42 +0200
Message-ID: 
 <CAHodO=Lcjp1SLkijQwX7bxBcrf1pHxQ+2sWvRqdxHaPBLeBkRg@mail.gmail.com>
Subject: Re: The number of simultaneous map tasks is unexpected.
From: Adam Kawa <kawa.adam@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=20cf3042745adfb73504fdc32584

--20cf3042745adfb73504fdc32584
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Tomek,

You have 9.26GB in 4 nodes what is 2.315GB on average. What is your value
of yarn.nodemanager.resource.memory-mb?

You consume 1GB of RAM per container (8 containers running =3D 8GB of memor=
y
used). My idea is that, after running 8 containers (1 AM + 7 map tasks),
you have only 315MB of available memory on each NodeManager. Therefore,
when you request 1GB to get a container for #8 map task, there is no
NodeManager than can give you a whole 1GB (despite having more than 1GB of
aggregated memory on the cluster).

To verify this, please check the value of
yarn.nodemanager.resource.memory-mb.

Thanks,
Adam

PS1.
Just our of curiosity. What are your values of
*yarn.nodemanager.resource.cpu-vcores* (is not it 2?)
*yarn.resourcemanager.scheduler.class* (I assume that Fair Scheduler, but
just to confirm. Could you have any non-default settings in your
scheduler's configuration that limit the number of resources per user?)
*yarn.nodemanager.linux-container-executor.resources-handler.class*
?

PS2.
"I am comparing M/R implementation with a custom one, where one node is
dedicated for coordination and I utilize 4 slaves fully for computation."

Note that this might not work on a larger scale, because "one node is
dedicated for coordination" might become the bottleneck. This is one of a
couple of reasons why YARN and original MapReduce at Google have decided to
run coordination processes on slave nodes.


2014-07-09 9:47 GMT+02:00 Tomasz Guzia=C5=82ek <tomasz@guzialek.info>:

> Thank you for your assistance, Adam.
>
> Containers running | Memory used | Memory total | Memory reserved
>                          8 |             8 GB |        9.26 GB
> |                     0 B
>
> Seems like you are right: the ApplicationMaster is occupying one slot as =
I
> have 8 containers running, but 7 map tasks.
>
> Again, I revised my information about m1.large instance on EC2. There are
> only 2 cores available per node giving 4 computing units (ECU units
> introduced by Amazon). So 8 slots at a time is expected. However,
> scheduling AM on a slave node ruins my experiment. I am comparing M/R
> implementation with a custom one, where one node is dedicated for
> coordination and I utilize 4 slaves fully for computation. This one core
> for AM is extending the execution time by a factor of 2. Does any one hav=
e
> an idea how to have 8 map tasks running?
>
> Pozdrawiam / Regards / Med venlig hilsen
> Tomasz Guzia=C5=82ek
>
>
> 2014-07-09 0:56 GMT+02:00 Adam Kawa <kawa.adam@gmail.com>:
>
> If you run an application (e.g. MapReduce job) on YARN cluster, first the
>> Application Master will be is started on some slave node to coordinate t=
he
>> execution of all tasks within the job. The ApplicationMaster and tasks t=
hat
>> belong to its application run in the containers controlled by the
>> NodeManagers.
>>
>> Maybe, you simply run 8 containers on your YARN cluster and 1 container
>> is consumed by MapReduce AppMaster and 7 containers are consumed by map
>> tasks. But it seems not to be a root cause of you problem, because
>> according to your settings you should be able to run 16 containers
>> maximally.
>>
>> Another idea might be that your are bottlenecked by the amount of memory
>> on the cluster (each container consumes memory) and despite having vcore=
(s)
>> available, you can not launch new tasks. When you go to the ResourceMana=
ger
>> Web UI, do you see that you utilize whole cluster memory?
>>
>>
>>
>> 2014-07-08 21:06 GMT+02:00 Tomasz Guzia=C5=82ek <tomasz@guzialek.info>:
>>
>> I was not precise when describing my cluster. I have 4 slave nodes and a
>>> separate master node. The master has ResourceManager role (along with
>>> JobHistory role) and the rest have NodeManager roles. If this really is=
 an
>>> ApplicationMaster, is it possible to schedule it on the master node? Th=
is
>>> single waiting map task is doubling my execution time.
>>>
>>> Pozdrawiam / Regards / Med venlig hilsen
>>> Tomasz Guzia=C5=82ek
>>>
>>>
>>> 2014-07-08 18:42 GMT+02:00 Adam Kawa <kawa.adam@gmail.com>:
>>>
>>> Is not your MapReduce AppMaster occupying one slot?
>>>>
>>>> Sent from my iPhone
>>>>
>>>> > On 8 jul 2014, at 13:01, Tomasz Guzia=C5=82ek <tomaszguzialek@gmail.=
com>
>>>> wrote:
>>>> >
>>>> > Hello all,
>>>> >
>>>> > I am running a 4-nodes CDH5 cluster on Amazon EC2 . The instances
>>>> used are m1.large, so I have 4 cores (2 core x 2 unit) per node. My HB=
ase
>>>> table has 8 regions, so I expected at least 8 (if not 16) mapper tasks=
 to
>>>> run simultaneously. However, only 7 are running and 1 is waiting for a=
n
>>>> empty slot. Why this surprising number came up? I have checked that th=
e
>>>> regions are equally distributed on the region servers (2 per node).
>>>> >
>>>> > My properties in the job:
>>>> > Configuration mapReduceConfiguration =3D HBaseConfiguration.create()=
;
>>>> > mapReduceConfiguration.set("hbase.client.max.perregion.tasks", "4");
>>>> > mapReduceConfiguration.set("mapreduce.tasktracker.map.tasks.maximum"=
,
>>>> "16");
>>>> >
>>>> > My properties in the CDH:
>>>> > yarn.scheduler.minimum-allocation-vcores =3D 1
>>>> > yarn.scheduler.maximum-allocation-vcores =3D 4
>>>> >
>>>> > Do I miss some property? Please share your experience.
>>>> >
>>>> > Best regards
>>>> > Tomasz
>>>>
>>>
>>>
>>
>

--20cf3042745adfb73504fdc32584
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Hi Tomek,</div><div><br></div><div>You have=C2=A0<spa=
n style=3D"font-family:arial,sans-serif;font-size:13px">9.26GB in 4 nodes w=
hat is=C2=A0</span>2.315GB on average. What is your value of yarn.nodemanag=
er.resource.memory-mb?</div>
<div><br></div><div>You consume 1GB of RAM per container (8 containers runn=
ing =3D 8GB of memory used). My idea is that, after running 8 containers (1=
 AM + 7 map tasks), you have only 315MB of available memory on each NodeMan=
ager. Therefore, when you request 1GB to get a container for #8 map task, t=
here is no NodeManager than can give you a whole 1GB (despite having more t=
han 1GB of aggregated memory on the cluster).</div>
<div><br></div><div>To verify this, please check the value of yarn.nodemana=
ger.resource.memory-mb.</div><div><br></div><div><div>Thanks,</div><div>Ada=
m</div></div><div><br></div><div>PS1.=C2=A0</div><div>Just our of curiosity=
. What are your values of=C2=A0<br>
</div><div><div><i>yarn.nodemanager.resource.cpu-vcores</i> (is not it 2?)<=
/div><div><i>yarn.resourcemanager.scheduler.class</i> (I assume that Fair S=
cheduler, but just to confirm. Could you have any non-default settings in y=
our scheduler&#39;s configuration that limit the number of resources per us=
er?)<br>
</div><div><div><i>yarn.nodemanager.linux-container-executor.resources-hand=
ler.class</i></div></div><div>?<br></div><div><br></div><div><span style=3D=
"font-family:arial,sans-serif;font-size:13px">PS2.</span></div><div><span s=
tyle=3D"font-family:arial,sans-serif;font-size:13px">&quot;I am comparing M=
/R implementation with a custom one, where one node is dedicated for coordi=
nation and I utilize 4 slaves fully for computation.&quot;</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13px"><br></span=
></div><div><span style=3D"font-family:arial,sans-serif;font-size:13px">Not=
e that this might not work on a larger scale, because &quot;</span><span st=
yle=3D"font-family:arial,sans-serif;font-size:13px">one node is dedicated f=
or coordination&quot; might become the bottleneck. This is one of a couple =
of reasons why YARN and original MapReduce at Google have decided to run co=
ordination processes on slave nodes.</span></div>
<div><br></div><div><br></div></div></div><div class=3D"gmail_extra"><br><b=
r><div class=3D"gmail_quote">2014-07-09 9:47 GMT+02:00 Tomasz Guzia=C5=82ek=
 <span dir=3D"ltr">&lt;<a href=3D"mailto:tomasz@guzialek.info" target=3D"_b=
lank">tomasz@guzialek.info</a>&gt;</span>:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div><div><div>Thank y=
ou for your assistance, Adam.<br></div><div><br>Containers running | Memory=
 used | Memory total | Memory reserved<br>
</div>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=
=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
 8 |=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=
=A0 8 GB |=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 9.26 GB |=C2=A0=C2=A0=
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=
=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 0 B<br>

<br></div>Seems like you are right: the ApplicationMaster is occupying one =
slot as I have 8 containers running, but 7 map tasks. <br><br></div>Again, =
I revised my information about m1.large instance on EC2. There are only 2 c=
ores available per node giving 4 computing units (ECU units introduced by A=
mazon). So 8 slots at a time is expected. However, scheduling AM on a slave=
 node ruins my experiment. I am comparing M/R implementation with a custom =
one, where one node is dedicated for coordination and I utilize 4 slaves fu=
lly for computation. This one core for AM is extending the execution time b=
y a factor of 2. Does any one have an idea how to have 8 map tasks running?=
<br>


</div><div><div class=3D"gmail_extra"><div class=3D""><br clear=3D"all"><di=
v>Pozdrawiam / Regards / Med venlig hilsen<br>Tomasz Guzia=C5=82ek<br></div=
>
<br><br></div><div class=3D"gmail_quote">2014-07-09 0:56 GMT+02:00 Adam Kaw=
a <span dir=3D"ltr">&lt;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_b=
lank">kawa.adam@gmail.com</a>&gt;</span>:<div><div class=3D"h5"><br><blockq=
uote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


<div dir=3D"ltr"><div>If you run an application (e.g. MapReduce job) on YAR=
N cluster, first the Application Master will be is started on some slave no=
de to coordinate the execution of all tasks within the job. The Application=
Master and tasks that belong to its application run in the containers contr=
olled by the NodeManagers.<br>


</div><div><br></div><div>Maybe, you simply run 8 containers on your YARN c=
luster and 1 container is consumed by MapReduce AppMaster and 7 containers =
are consumed by map tasks. But it seems not to be a root cause of you probl=
em, because according to your settings you should be able to run 16 contain=
ers maximally.</div>


<div><br></div><div>Another idea might be that your are bottlenecked by the=
 amount of memory on the cluster (each container consumes memory) and despi=
te having vcore(s) available, you can not launch new tasks. When you go to =
the ResourceManager Web UI, do you see that you utilize whole cluster memor=
y?</div>


<div><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">2014-07-08 21:06 GMT+02:00 Tomasz Guzia=C5=82ek <span dir=3D"ltr">&=
lt;<a href=3D"mailto:tomasz@guzialek.info" target=3D"_blank">tomasz@guziale=
k.info</a>&gt;</span>:<div>


<div><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">I was not precise when desc=
ribing my cluster. I have 4 slave nodes and a separate master node. The mas=
ter has ResourceManager role (along with JobHistory role) and the rest have=
 NodeManager roles. If this really is an ApplicationMaster, is it possible =
to schedule it on the master node? This single waiting map task is doubling=
 my execution time.<br>


<div class=3D"gmail_extra"><br clear=3D"all"><div>Pozdrawiam / Regards / Me=
d venlig hilsen<br>Tomasz Guzia=C5=82ek<br></div>
<br><br><div class=3D"gmail_quote">2014-07-08 18:42 GMT+02:00 Adam Kawa <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank"=
>kawa.adam@gmail.com</a>&gt;</span>:<div><div><br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex">


Is not your MapReduce AppMaster occupying one slot?<br>
<br>
Sent from my iPhone<br>
<div><div><br>
&gt; On 8 jul 2014, at 13:01, Tomasz Guzia=C5=82ek &lt;<a href=3D"mailto:to=
maszguzialek@gmail.com" target=3D"_blank">tomaszguzialek@gmail.com</a>&gt; =
wrote:<br>
&gt;<br>
&gt; Hello all,<br>
&gt;<br>
&gt; I am running a 4-nodes CDH5 cluster on Amazon EC2 . The instances used=
 are m1.large, so I have 4 cores (2 core x 2 unit) per node. My HBase table=
 has 8 regions, so I expected at least 8 (if not 16) mapper tasks to run si=
multaneously. However, only 7 are running and 1 is waiting for an empty slo=
t. Why this surprising number came up? I have checked that the regions are =
equally distributed on the region servers (2 per node).<br>


&gt;<br>
&gt; My properties in the job:<br>
&gt; Configuration mapReduceConfiguration =3D HBaseConfiguration.create();<=
br>
&gt; mapReduceConfiguration.set(&quot;hbase.client.max.perregion.tasks&quot=
;, &quot;4&quot;);<br>
&gt; mapReduceConfiguration.set(&quot;mapreduce.tasktracker.map.tasks.maxim=
um&quot;, &quot;16&quot;);<br>
&gt;<br>
&gt; My properties in the CDH:<br>
&gt; yarn.scheduler.minimum-allocation-vcores =3D 1<br>
&gt; yarn.scheduler.maximum-allocation-vcores =3D 4<br>
&gt;<br>
&gt; Do I miss some property? Please share your experience.<br>
&gt;<br>
&gt; Best regards<br>
&gt; Tomasz<br>
</div></div></blockquote></div></div></div><br></div></div>
</blockquote></div></div></div><br></div>
</blockquote></div></div></div><br></div></div></div>
</blockquote></div><br></div>

--20cf3042745adfb73504fdc32584--