Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of tomaszguzialek@gmail.com
 designates 209.85.220.180 as permitted sender)
MIME-Version: 1.0
Sender: tomaszguzialek@gmail.com
In-Reply-To: 
 <CAHodO=+FLbTjrXsNL4uZud9t+2HyyRLiV3eVCAD=--HWRW385w@mail.gmail.com>
References: 
 <CAHoPa61kF2nvk5+sE5K__Cd9gRw2wMxPZAuqSSjz=xye0SDd6w@mail.gmail.com>
 <FA7C672E-35A5-4210-9828-1F1AEB04D327@gmail.com>
 <CAHoPa62Sjr9VcMfNvcfiGbFBbct6xnBaDCCYs1P-ZWMr35+DrQ@mail.gmail.com>
 <CAHodO=+FLbTjrXsNL4uZud9t+2HyyRLiV3eVCAD=--HWRW385w@mail.gmail.com>
From: =?ISO-8859-2?Q?Tomasz_Guzia=B3ek?= <tomasz@guzialek.info>
Date: Wed, 9 Jul 2014 09:47:41 +0200
Message-ID: 
 <CAHoPa61MXCNtT1W94zLygTxUe5rQG07djT2PD410YQTEtno7pw@mail.gmail.com>
Subject: Re: The number of simultaneous map tasks is unexpected.
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=001a113351b20ac0cc04fdbdeeac

--001a113351b20ac0cc04fdbdeeac
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thank you for your assistance, Adam.

Containers running | Memory used | Memory total | Memory reserved
                         8 |             8 GB |        9.26 GB
|                     0 B

Seems like you are right: the ApplicationMaster is occupying one slot as I
have 8 containers running, but 7 map tasks.

Again, I revised my information about m1.large instance on EC2. There are
only 2 cores available per node giving 4 computing units (ECU units
introduced by Amazon). So 8 slots at a time is expected. However,
scheduling AM on a slave node ruins my experiment. I am comparing M/R
implementation with a custom one, where one node is dedicated for
coordination and I utilize 4 slaves fully for computation. This one core
for AM is extending the execution time by a factor of 2. Does any one have
an idea how to have 8 map tasks running?

Pozdrawiam / Regards / Med venlig hilsen
Tomasz Guzia=C5=82ek


2014-07-09 0:56 GMT+02:00 Adam Kawa <kawa.adam@gmail.com>:

> If you run an application (e.g. MapReduce job) on YARN cluster, first the
> Application Master will be is started on some slave node to coordinate th=
e
> execution of all tasks within the job. The ApplicationMaster and tasks th=
at
> belong to its application run in the containers controlled by the
> NodeManagers.
>
> Maybe, you simply run 8 containers on your YARN cluster and 1 container i=
s
> consumed by MapReduce AppMaster and 7 containers are consumed by map task=
s.
> But it seems not to be a root cause of you problem, because according to
> your settings you should be able to run 16 containers maximally.
>
> Another idea might be that your are bottlenecked by the amount of memory
> on the cluster (each container consumes memory) and despite having vcore(=
s)
> available, you can not launch new tasks. When you go to the ResourceManag=
er
> Web UI, do you see that you utilize whole cluster memory?
>
>
>
> 2014-07-08 21:06 GMT+02:00 Tomasz Guzia=C5=82ek <tomasz@guzialek.info>:
>
> I was not precise when describing my cluster. I have 4 slave nodes and a
>> separate master node. The master has ResourceManager role (along with
>> JobHistory role) and the rest have NodeManager roles. If this really is =
an
>> ApplicationMaster, is it possible to schedule it on the master node? Thi=
s
>> single waiting map task is doubling my execution time.
>>
>> Pozdrawiam / Regards / Med venlig hilsen
>> Tomasz Guzia=C5=82ek
>>
>>
>> 2014-07-08 18:42 GMT+02:00 Adam Kawa <kawa.adam@gmail.com>:
>>
>> Is not your MapReduce AppMaster occupying one slot?
>>>
>>> Sent from my iPhone
>>>
>>> > On 8 jul 2014, at 13:01, Tomasz Guzia=C5=82ek <tomaszguzialek@gmail.c=
om>
>>> wrote:
>>> >
>>> > Hello all,
>>> >
>>> > I am running a 4-nodes CDH5 cluster on Amazon EC2 . The instances use=
d
>>> are m1.large, so I have 4 cores (2 core x 2 unit) per node. My HBase ta=
ble
>>> has 8 regions, so I expected at least 8 (if not 16) mapper tasks to run
>>> simultaneously. However, only 7 are running and 1 is waiting for an emp=
ty
>>> slot. Why this surprising number came up? I have checked that the regio=
ns
>>> are equally distributed on the region servers (2 per node).
>>> >
>>> > My properties in the job:
>>> > Configuration mapReduceConfiguration =3D HBaseConfiguration.create();
>>> > mapReduceConfiguration.set("hbase.client.max.perregion.tasks", "4");
>>> > mapReduceConfiguration.set("mapreduce.tasktracker.map.tasks.maximum",
>>> "16");
>>> >
>>> > My properties in the CDH:
>>> > yarn.scheduler.minimum-allocation-vcores =3D 1
>>> > yarn.scheduler.maximum-allocation-vcores =3D 4
>>> >
>>> > Do I miss some property? Please share your experience.
>>> >
>>> > Best regards
>>> > Tomasz
>>>
>>
>>
>

--001a113351b20ac0cc04fdbdeeac
Content-Type: text/html; charset=ISO-8859-2
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div><div>Thank you for your assistance, Adam.<b=
r></div><div><br>Containers running | Memory used | Memory total | Memory r=
eserved<br></div>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0 8 |=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 8 GB |=A0=A0=A0=A0=
=A0=A0=A0 9.26 GB |=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 0 B<br>

<br></div>Seems like you are right: the ApplicationMaster is occupying one =
slot as I have 8 containers running, but 7 map tasks. <br><br></div>Again, =
I revised my information about m1.large instance on EC2. There are only 2 c=
ores available per node giving 4 computing units (ECU units introduced by A=
mazon). So 8 slots at a time is expected. However, scheduling AM on a slave=
 node ruins my experiment. I am comparing M/R implementation with a custom =
one, where one node is dedicated for coordination and I utilize 4 slaves fu=
lly for computation. This one core for AM is extending the execution time b=
y a factor of 2. Does any one have an idea how to have 8 map tasks running?=
<br>

</div><div><div class=3D"gmail_extra"><br clear=3D"all"><div>Pozdrawiam / R=
egards / Med venlig hilsen<br>Tomasz Guzia=B3ek<br></div>
<br><br><div class=3D"gmail_quote">2014-07-09 0:56 GMT+02:00 Adam Kawa <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank">=
kawa.adam@gmail.com</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div dir=3D"ltr"><div>If you run an application (e.g. MapReduce job) on YAR=
N cluster, first the Application Master will be is started on some slave no=
de to coordinate the execution of all tasks within the job. The Application=
Master and tasks that belong to its application run in the containers contr=
olled by the NodeManagers.<br>


</div><div><br></div><div>Maybe, you simply run 8 containers on your YARN c=
luster and 1 container is consumed by MapReduce AppMaster and 7 containers =
are consumed by map tasks. But it seems not to be a root cause of you probl=
em, because according to your settings you should be able to run 16 contain=
ers maximally.</div>


<div><br></div><div>Another idea might be that your are bottlenecked by the=
 amount of memory on the cluster (each container consumes memory) and despi=
te having vcore(s) available, you can not launch new tasks. When you go to =
the ResourceManager Web UI, do you see that you utilize whole cluster memor=
y?</div>


<div><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">2014-07-08 21:06 GMT+02:00 Tomasz Guzia=B3ek <span dir=3D"ltr">&lt;=
<a href=3D"mailto:tomasz@guzialek.info" target=3D"_blank">tomasz@guzialek.i=
nfo</a>&gt;</span>:<div>

<div class=3D"h5"><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">I was not precise when desc=
ribing my cluster. I have 4 slave nodes and a separate master node. The mas=
ter has ResourceManager role (along with JobHistory role) and the rest have=
 NodeManager roles. If this really is an ApplicationMaster, is it possible =
to schedule it on the master node? This single waiting map task is doubling=
 my execution time.<br>


<div class=3D"gmail_extra"><br clear=3D"all"><div>Pozdrawiam / Regards / Me=
d venlig hilsen<br>Tomasz Guzia=B3ek<br></div>
<br><br><div class=3D"gmail_quote">2014-07-08 18:42 GMT+02:00 Adam Kawa <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:kawa.adam@gmail.com" target=3D"_blank"=
>kawa.adam@gmail.com</a>&gt;</span>:<div><div><br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex">


Is not your MapReduce AppMaster occupying one slot?<br>
<br>
Sent from my iPhone<br>
<div><div><br>
&gt; On 8 jul 2014, at 13:01, Tomasz Guzia=B3ek &lt;<a href=3D"mailto:tomas=
zguzialek@gmail.com" target=3D"_blank">tomaszguzialek@gmail.com</a>&gt; wro=
te:<br>
&gt;<br>
&gt; Hello all,<br>
&gt;<br>
&gt; I am running a 4-nodes CDH5 cluster on Amazon EC2 . The instances used=
 are m1.large, so I have 4 cores (2 core x 2 unit) per node. My HBase table=
 has 8 regions, so I expected at least 8 (if not 16) mapper tasks to run si=
multaneously. However, only 7 are running and 1 is waiting for an empty slo=
t. Why this surprising number came up? I have checked that the regions are =
equally distributed on the region servers (2 per node).<br>


&gt;<br>
&gt; My properties in the job:<br>
&gt; Configuration mapReduceConfiguration =3D HBaseConfiguration.create();<=
br>
&gt; mapReduceConfiguration.set(&quot;hbase.client.max.perregion.tasks&quot=
;, &quot;4&quot;);<br>
&gt; mapReduceConfiguration.set(&quot;mapreduce.tasktracker.map.tasks.maxim=
um&quot;, &quot;16&quot;);<br>
&gt;<br>
&gt; My properties in the CDH:<br>
&gt; yarn.scheduler.minimum-allocation-vcores =3D 1<br>
&gt; yarn.scheduler.maximum-allocation-vcores =3D 4<br>
&gt;<br>
&gt; Do I miss some property? Please share your experience.<br>
&gt;<br>
&gt; Best regards<br>
&gt; Tomasz<br>
</div></div></blockquote></div></div></div><br></div></div>
</blockquote></div></div></div><br></div>
</blockquote></div><br></div></div></div>

--001a113351b20ac0cc04fdbdeeac--