Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of giorgioath@gmail.com
 designates 209.85.215.42 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <AD354F56741A1B47882A625909A59C692BDD3794@SZXEML505-MBX.china.huawei.com>
References: 
 <CAA0wSUtOfPgtzOFxrBXTAVQ-Le1XFT1rSHxqo0zYm=AoRQafbQ@mail.gmail.com>
 <0EE80F6F7A98A64EBD18F2BE839C9115677ADE35@szxeml512-mbs.china.huawei.com>
 <AD354F56741A1B47882A625909A59C692BDD3794@SZXEML505-MBX.china.huawei.com>
From: George Ioannidis <giorgioath@gmail.com>
Date: Thu, 16 Apr 2015 20:03:14 +0200
Message-ID: 
 <CAA0wSUv0XoiQ4vntCzrvVAuRZEcM05mbfMazhe2hErVM4PEnfw@mail.gmail.com>
Subject: Re: Pin Map/Reduce tasks to specific cores
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=089e0115fd1c3eedfb0513db4897

--089e0115fd1c3eedfb0513db4897
Content-Type: text/plain; charset=UTF-8

Dear Rohith and Naga,

Thank you very much for your quick responses, your information has proven
very useful.

Cheers,
George

On 7 April 2015 at 07:08, Naganarasimha G R (Naga) <
garlanaganarasimha@huawei.com> wrote:

>  Hi George,
>
>  The current implementation present in YARN using Cgroups supports CPU
> isolation but not by pinning to specific cores (Cgroup CPUsets) but based
> on cpu cycles (quota & Period).
> Admin is provided with an option of specifying how much percentage of CPU
> can be used by YARN containers. And Yarn will take care of configuring
> Cgroup Quota and Period files and
> ensures only configured CPU percentage is only used by YARN containers
>
>  Is there any particular need to pin the MR tasks to the specific cores ?
> or you just want to ensure YARN is not using more than the specified
> percentage of CPU in a give node ?
>
>  Regards,
> Naga
>
>  ------------------------------
> *From:* Rohith Sharma K S [rohithsharmaks@huawei.com]
> *Sent:* Tuesday, April 07, 2015 09:23
> *To:* user@hadoop.apache.org
> *Subject:* RE: Pin Map/Reduce tasks to specific cores
>
>   Hi George
>
>
>
> In MRV2, YARN supports CGroups implementation.  Using CGroup it is
> possible to run containers in specific cores.
>
>
>
> For your detailed reference, some of the useful links
>
>
> http://dev.hortonworks.com.s3.amazonaws.com/HDPDocuments/HDP2/HDP-2-trunk/bk_system-admin-guide/content/ch_cgroups.html
>
>
> http://blog.cloudera.com/blog/2013/12/managing-multiple-resources-in-hadoop-2-with-yarn/
>
> http://riccomini.name/posts/hadoop/2013-06-14-yarn-with-cgroups/
>
>
>
> P.S : I could not find any related document in Hadoop Yarn docs. I will
> raise ticket for the same  in community.
>
>
>
> Hope the above information will help your use case!!!
>
>
>
> Thanks & Regards
>
> Rohith Sharma K S
>
>
>
> *From:* George Ioannidis [mailto:giorgioath@gmail.com]
> *Sent:* 07 April 2015 01:55
> *To:* user@hadoop.apache.org
> *Subject:* Pin Map/Reduce tasks to specific cores
>
>
>
> Hello. My question, which can be found on *Stack Overflow
> <http://stackoverflow.com/questions/29283213/core-affinity-of-map-tasks-in-hadoop>*
> as well, regards pinning map/reduce tasks to specific cores, either on
> hadoop v.1.2.1 or hadoop v.2.
>
> In specific, I would like to know if the end-user can have any control on
> which core executes a specific map/reduce task.
>
> To pin an application on linux, there's the "taskset" command, but is
> anything similar provided by hadoop? If not, is the Linux Scheduler in
> charge of allocating tasks to specific cores?
>
>
>
> ------------------
>
> Below I am providing two cases to better illustrate my question:
>
> *Case #1:* 2 GiB input size, HDFS block size of 64 MiB and 2 compute
> nodes available, with 32 cores each.
>
> As follows, 32 map tasks will be called; let's suppose that mapred.tasktracker.map.tasks.maximum
> = 16, so 16 map tasks will be allocated to each node.
>
> Can I guarantee that each Map Task will run on a specific core, or is it
> up to the Linux Scheduler?
>
> ------------------
>
> *Case #2:* The same as case #1, but now the input size is 8 GiB, so there
> are not enough slots for all map tasks (128), so multiple tasks will share
> the same cores.
>
> Can I control how much "time" each task will spend on a specific core and
> if it will be reassigned to the same core in the future?
>
> Any information on the above would be highly appreciated.
>
> Kind Regards,
>
> George
>

--089e0115fd1c3eedfb0513db4897
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Dear Rohith and Naga,<div><br></div><div>Thank you very mu=
ch for your quick responses, your information has proven very useful.</div>=
<div><br></div><div>Cheers,</div><div>George</div></div><div class=3D"gmail=
_extra"><br><div class=3D"gmail_quote">On 7 April 2015 at 07:08, Naganarasi=
mha G R (Naga) <span dir=3D"ltr">&lt;<a href=3D"mailto:garlanaganarasimha@h=
uawei.com" target=3D"_blank">garlanaganarasimha@huawei.com</a>&gt;</span> w=
rote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;borde=
r-left:1px #ccc solid;padding-left:1ex">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div style=3D"direction:ltr;font-family:Tahoma;color:#000000;font-size:10pt=
">Hi George,
<div><br>
</div>
<div>The current implementation present in YARN using Cgroups supports CPU =
isolation but not by pinning to specific cores (Cgroup CPUsets) but based o=
n cpu cycles (quota &amp; Period).</div>
<div>Admin is provided with an option of specifying how much percentage of =
CPU can be used by YARN containers. And Yarn will take care of configuring =
Cgroup Quota and Period files and=C2=A0</div>
<div>ensures only configured CPU percentage is only used by YARN containers=
</div>
<div><br>
</div>
<div>Is there any particular need to pin the MR tasks to the specific cores=
 ? or you just want to ensure YARN is not using more than the specified per=
centage of CPU in a give node ?</div>
<div><br>
</div>
<div>Regards,</div>
<div>Naga</div>
<div><br>
<div style=3D"font-family:Times New Roman;color:#000000;font-size:16px">
<hr>
<div style=3D"direction:ltr"><font face=3D"Tahoma" size=3D"2" color=3D"#000=
000"><b>From:</b> Rohith Sharma K S [<a href=3D"mailto:rohithsharmaks@huawe=
i.com" target=3D"_blank">rohithsharmaks@huawei.com</a>]<br>
<b>Sent:</b> Tuesday, April 07, 2015 09:23<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> RE: Pin Map/Reduce tasks to specific cores<br>
</font><br>
</div><div><div class=3D"h5">
<div></div>
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hi George</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">In MRV2, YARN supports CG=
roups implementation.=C2=A0 Using CGroup it is possible to run containers i=
n specific cores.
</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<p class=3D"MsoNormal">For your detailed reference, some of the useful link=
s </p>
<p class=3D"MsoNormal"><a href=3D"http://dev.hortonworks.com.s3.amazonaws.c=
om/HDPDocuments/HDP2/HDP-2-trunk/bk_system-admin-guide/content/ch_cgroups.h=
tml" target=3D"_blank">http://dev.hortonworks.com.s3.amazonaws.com/HDPDocum=
ents/HDP2/HDP-2-trunk/bk_system-admin-guide/content/ch_cgroups.html</a><spa=
n style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-seri=
f&quot;;color:#1f497d"></span></p>
<p class=3D"MsoNormal"><a href=3D"http://blog.cloudera.com/blog/2013/12/man=
aging-multiple-resources-in-hadoop-2-with-yarn/" target=3D"_blank">http://b=
log.cloudera.com/blog/2013/12/managing-multiple-resources-in-hadoop-2-with-=
yarn/</a><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&q=
uot;sans-serif&quot;;color:#1f497d"></span></p>
<p class=3D"MsoNormal"><a href=3D"http://riccomini.name/posts/hadoop/2013-0=
6-14-yarn-with-cgroups/" target=3D"_blank">http://riccomini.name/posts/hado=
op/2013-06-14-yarn-with-cgroups/</a></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<p class=3D"MsoNormal">P.S : I could not find any related document in Hadoo=
p Yarn docs. I will raise ticket for the same=C2=A0 in community.</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hope the above informatio=
n will help your use case!!!</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Thanks &amp; Regards</spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Rohith Sharma K S</span><=
/p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=C2=A0</span></p>
<div style=3D"border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0cm =
0cm 0cm">
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-s=
ize:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;"> George I=
oannidis [mailto:<a href=3D"mailto:giorgioath@gmail.com" target=3D"_blank">=
giorgioath@gmail.com</a>]
<br>
<b>Sent:</b> 07 April 2015 01:55<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> Pin Map/Reduce tasks to specific cores</span></p>
</div>
<p class=3D"MsoNormal">=C2=A0</p>
<div>
<div>
<div>
<div>
<div>
<p class=3D"MsoNormal">Hello. My question, which can be found on <b><a href=
=3D"http://stackoverflow.com/questions/29283213/core-affinity-of-map-tasks-=
in-hadoop" target=3D"_blank">Stack Overflow</a></b> as well, regards pinnin=
g map/reduce tasks to specific cores,
 either on hadoop v.1.2.1 or hadoop v.2.</p>
</div>
<p class=3D"MsoNormal">In specific, I would like to know if the end-user ca=
n have any control on which core executes a specific map/reduce task.<br>
<br>
To pin an application on linux, there&#39;s the &quot;taskset&quot; command=
, but is anything similar provided by hadoop? If not, is the Linux Schedule=
r in charge of allocating tasks to specific cores?</p>
</div>
<p class=3D"MsoNormal">=C2=A0</p>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">------------------</p=
>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Below I am providing =
two cases to better illustrate my question:</p>
</div>
<div>
<p class=3D"MsoNormal"><b>Case #1:</b> 2 GiB input size, HDFS block size of=
 64 MiB and 2 compute nodes available, with 32 cores each.</p>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">As follows, 32 map ta=
sks will be called; let&#39;s suppose that
<code><span style=3D"font-size:10.0pt">mapred.tasktracker.map.tasks.maximum=
 =3D 16,</span></code> so 16 map tasks will be allocated to each node.</p>
</div>
<div>
<p class=3D"MsoNormal">Can I guarantee that each Map Task will run on a spe=
cific core, or is it up to the Linux Scheduler?<br>
<br>
------------------<br>
<br>
<b>Case #2:</b> The same as case #1, but now the input size is 8 GiB, so th=
ere are not enough slots for all map tasks (128), so multiple tasks will sh=
are the same cores.</p>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Can I control how muc=
h &quot;time&quot; each task will spend on a specific core and if it will b=
e reassigned to the same core in the future?</p>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Any information on th=
e above would be highly appreciated.</p>
</div>
<div>
<p class=3D"MsoNormal">Kind Regards,</p>
</div>
<div>
<p class=3D"MsoNormal">George</p>
</div>
</div>
</div>
</div>
</div>
</div>
</div></div></div>
</div>
</div>
</div>

</blockquote></div><br></div>

--089e0115fd1c3eedfb0513db4897--