Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of sandy.ryza@cloudera.com
 designates 209.85.160.42 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <869970D71E26D7498BDAC4E1CA92226B658D6ABC@MBX021-E3-NJ-2.exch021.domain.local>
References: 
 <869970D71E26D7498BDAC4E1CA92226B658D64E9@MBX021-E3-NJ-2.exch021.domain.local>
	<d95e08f28fb044679175657cc7711720@BLUPR03MB614.namprd03.prod.outlook.com>
	<CACBYxKJ85Dj9et-aCfqkLL3eUc+Aw-tLgJV9VkDeAt7u0Q8Tfw@mail.gmail.com>
	<869970D71E26D7498BDAC4E1CA92226B658D6A5F@MBX021-E3-NJ-2.exch021.domain.local>
	<CACBYxKLjWw6-aaZpLqrCquYZeg46dBb_=0i0-ejJBiy_rkj4+w@mail.gmail.com>
	<869970D71E26D7498BDAC4E1CA92226B658D6ABC@MBX021-E3-NJ-2.exch021.domain.local>
Date: Tue, 2 Jul 2013 13:04:44 -0700
Message-ID: 
 <CACBYxKLOvAzUg0sfwjLvHCed+XXeXOsVvR19TXXRS=_t_TMp8w@mail.gmail.com>
Subject: Re: Containers and CPU
From: Sandy Ryza <sandy.ryza@cloudera.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=047d7b111cb531c08a04e08cdb89

--047d7b111cb531c08a04e08cdb89
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

That's correct.

-Sandy


On Tue, Jul 2, 2013 at 12:28 PM, John Lilley <john.lilley@redpoint.net>wrot=
e:

>  Sandy,****
>
> Thanks, I think I understand.  So it only makes a difference if cgroups i=
s
> on AND the AM requests multiple cores?  E.g. if each task wants 4 cores t=
he
> RM would only allow two containers per 8-core node?****
>
> John****
>
> ** **
>
> ** **
>
> *From:* Sandy Ryza [mailto:sandy.ryza@cloudera.com]
> *Sent:* Tuesday, July 02, 2013 1:26 PM
>
> *To:* user@hadoop.apache.org
> *Subject:* Re: Containers and CPU****
>
> ** **
>
> Use of cgroups for controlling CPU is off by default, but can be turned o=
n
> as a nodemanager configuration with
> yarn.nodemanager.linux-container-executor.resources-handler.class.  So it
> is site-wide.  If you want tasks to purely fight it out in the OS thread
> scheduler, simply don't change from the default.****
>
> ** **
>
> Even with cgroups on, all tasks will have access to all CPU cores.  We
> don't do any pinning of tasks to cores.  If a task is requested with a
> single vcore and placed on an otherwise empty machine with 8 cores, it wi=
ll
> have access to all 8 cores.  If 3 other tasks that requested a single vco=
re
> are later placed on the same node, and all tasks are using as much CPU as
> they can get their hands on, then each of the tasks will get 2 cores of
> CPU-time.****
>
> ** **
>
> On Tue, Jul 2, 2013 at 12:12 PM, John Lilley <john.lilley@redpoint.net>
> wrote:****
>
> Sandy,****
>
> Sorry, I don=92t completely follow.  ****
>
> When you say =93with cgroups on=94, is that an attribute of the AM, the
> Scheduler, or the Site/RM?  In other words is it site-wide or something
> that my application can control?****
>
> With cgroups on, is there still a way to get my desired behavior?  I=92d
> really like all tasks to have access to all CPU cores and simply fight it
> out in the OS thread scheduler.****
>
> Thanks,****
>
> john****
>
>  ****
>
> *From:* Sandy Ryza [mailto:sandy.ryza@cloudera.com]
> *Sent:* Tuesday, July 02, 2013 11:56 AM
> *To:* user@hadoop.apache.org
> *Subject:* Re: Containers and CPU****
>
>  ****
>
> CPU limits are only enforced if cgroups is turned on.  With cgroups on,
> they are only limited when there is contention, in which case tasks are
> given CPU time in proportion to the number of cores requested for/allocat=
ed
> to them.  Does that make sense?****
>
>  ****
>
> -Sandy****
>
>  ****
>
> On Tue, Jul 2, 2013 at 9:50 AM, Chuan Liu <chuanliu@microsoft.com> wrote:=
*
> ***
>
> I believe this is the default behavior.****
>
> By default, only memory limit on resources is enforced.****
>
> The capacity scheduler will use DefaultResourceCalculator to compute
> resource allocation for containers by default, which also does not take C=
PU
> into account.****
>
>  ****
>
> -Chuan****
>
>  ****
>
> *From:* John Lilley [mailto:john.lilley@redpoint.net]
> *Sent:* Tuesday, July 02, 2013 8:57 AM
> *To:* user@hadoop.apache.org
> *Subject:* Containers and CPU****
>
>  ****
>
> I have YARN tasks that benefit from multicore scaling.  However, they
> don=92t **always** use more than one core.  I would like to allocate
> containers based only on memory, and let each task use as many cores as
> needed, without allocating exclusive CPU =93slots=94 in the scheduler.  F=
or
> example, on an 8-core node with 16GB memory, I=92d like to be able to run=
 3
> tasks each consuming 4GB memory and each using as much CPU as they like.
> Is this the default behavior if I don=92t specify CPU restrictions to the
> scheduler?****
>
> Thanks****
>
> John****
>
>  ****
>
>  ****
>
>  ****
>
> ** **
>

--047d7b111cb531c08a04e08cdb89
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">That&#39;s correct.<div><br></div><div style>-Sandy</div><=
/div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Tue, =
Jul 2, 2013 at 12:28 PM, John Lilley <span dir=3D"ltr">&lt;<a href=3D"mailt=
o:john.lilley@redpoint.net" target=3D"_blank">john.lilley@redpoint.net</a>&=
gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Sandy,<u></u><u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Thanks, I think I underst=
and.=A0 So it only makes a difference if cgroups is on AND the AM requests =
multiple cores?=A0 E.g. if each task wants 4 cores the RM would
 only allow two containers per 8-core node?<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">John<u></u><u></u></span>=
</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-s=
ize:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;"> Sandy Ry=
za [mailto:<a href=3D"mailto:sandy.ryza@cloudera.com" target=3D"_blank">san=
dy.ryza@cloudera.com</a>]
<br>
<b>Sent:</b> Tuesday, July 02, 2013 1:26 PM</span></p><div><div class=3D"h5=
"><br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> Re: Containers and CPU<u></u><u></u></div></div><p></p><div=
><div class=3D"h5">
<p class=3D"MsoNormal"><u></u>=A0<u></u></p>
<div>
<p class=3D"MsoNormal">Use of cgroups for controlling CPU is off by default=
, but can be turned on as a nodemanager configuration with yarn.nodemanager=
.linux-container-executor.resources-handler.class. =A0So it is site-wide. =
=A0If you want tasks to purely fight it
 out in the OS thread scheduler, simply don&#39;t change from the default.<=
u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Even with cgroups on, all tasks will have access to =
all CPU cores. =A0We don&#39;t do any pinning of tasks to cores. =A0If a ta=
sk is requested with a single vcore and placed on an otherwise empty machin=
e with 8 cores, it will have access to all
 8 cores. =A0If 3 other tasks that requested a single vcore are later place=
d on the same node, and all tasks are using as much CPU as they can get the=
ir hands on, then each of the tasks will get 2 cores of CPU-time.<u></u><u>=
</u></p>

</div>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt"><u></u>=A0<u></u></p>
<div>
<p class=3D"MsoNormal">On Tue, Jul 2, 2013 at 12:12 PM, John Lilley &lt;<a =
href=3D"mailto:john.lilley@redpoint.net" target=3D"_blank">john.lilley@redp=
oint.net</a>&gt; wrote:<u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Sandy,</span><u></u><u></=
u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Sorry, I don=92t complete=
ly follow.=A0
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">When you say =93with cgro=
ups on=94, is that an attribute of the AM, the Scheduler, or the Site/RM?=
=A0 In
 other words is it site-wide or something that my application can control?<=
/span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">With cgroups on, is there=
 still a way to get my desired behavior?=A0 I=92d really like all tasks to =
have
 access to all CPU cores and simply fight it out in the OS thread scheduler=
.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Thanks,</span><u></u><u><=
/u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">john</span><u></u><u></u>=
</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">=A0</span><u></u><u></u><=
/p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-s=
ize:10.0pt;font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;"> Sandy Ry=
za [mailto:<a href=3D"mailto:sandy.ryza@cloudera.com" target=3D"_blank">san=
dy.ryza@cloudera.com</a>]
<br>
<b>Sent:</b> Tuesday, July 02, 2013 11:56 AM<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> Re: Containers and CPU</span><u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">CPU limits are only enforced if cgroups is turned on=
. =A0With cgroups on, they are only limited when there is contention, in wh=
ich case tasks are given CPU time in proportion to the
 number of cores requested for/allocated to them. =A0Does that make sense?<=
u></u><u></u></p>
<div>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">-Sandy<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">On Tue, Jul 2, 2013 at 9:50 AM, Chuan Liu &lt;<a hre=
f=3D"mailto:chuanliu@microsoft.com" target=3D"_blank">chuanliu@microsoft.co=
m</a>&gt; wrote:<u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">I believe this is the =
default behavior.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">By default, only memor=
y limit on resources is enforced.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">The capacity scheduler=
 will use DefaultResourceCalculator to compute resource allocation for cont=
ainers by default, which also does not take CPU into
 account.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">=A0</span><u></u><u></=
u></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">-Chuan</span><u></u><u=
></u></p>
<p class=3D"MsoNormal"><span style=3D"color:#1f497d">=A0</span><u></u><u></=
u></p>
<div>
<div style=3D"border:none;border-top:solid #e1e1e1 1.0pt;padding:3.0pt 0in =
0in 0in">
<p class=3D"MsoNormal"><b>From:</b> John Lilley [mailto:<a href=3D"mailto:j=
ohn.lilley@redpoint.net" target=3D"_blank">john.lilley@redpoint.net</a>]
<br>
<b>Sent:</b> Tuesday, July 02, 2013 8:57 AM<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> Containers and CPU<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
<p class=3D"MsoNormal">I have YARN tasks that benefit from multicore scalin=
g.=A0 However, they don=92t *<b>always</b>* use more than one core.=A0 I wo=
uld like to allocate containers based only on memory, and
 let each task use as many cores as needed, without allocating exclusive CP=
U =93slots=94 in the scheduler.=A0 For example, on an 8-core node with 16GB=
 memory, I=92d like to be able to run 3 tasks each consuming 4GB memory and=
 each using as much CPU as they like.=A0 Is
 this the default behavior if I don=92t specify CPU restrictions to the sch=
eduler?<u></u><u></u></p>
<p class=3D"MsoNormal">Thanks<u></u><u></u></p>
<p class=3D"MsoNormal">John<u></u><u></u></p>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
</div>
</div>
</div>
</div>
<p class=3D"MsoNormal">=A0<u></u><u></u></p>
</div>
</div>
</div>
</div>
</div>
</div>
<p class=3D"MsoNormal"><u></u>=A0<u></u></p>
</div>
</div></div></div>
</div>

</blockquote></div><br></div>

--047d7b111cb531c08a04e08cdb89--