Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
MIME-Version: 1.0
In-Reply-To: <CAHd9_iQe6VrbE-kG3xCaOGB3hM6td9tWV_ux=ooJgMePP+mycA@mail.gmail.com>
References: <CAHd9_iSDMLNVEDXe8Df-SMMu-MHMzXLWSD_0YZ_GHkG611b6Mg@mail.gmail.com>
 <CAMs9kVjdw=FKtWXrCcdoaLw7ogExPDDTz9hh9bNcq4bu=T8hZQ@mail.gmail.com>
 <CAHd9_iRDqqvGEFpSCgQEs=WKG_3YY1Xqk6MYyY1i0hs2Fj8dyQ@mail.gmail.com>
 <B54B8B40C1F7D9418D0611BE38DB8F4D01B803CB@blreml501-mbx> <CAHd9_iSfoBaf-V_oS1dn9qnkHBqd=PDEb8VzVJkHrExrmcmcOw@mail.gmail.com>
 <B54B8B40C1F7D9418D0611BE38DB8F4D01B8040B@blreml501-mbx> <CAHd9_iQe6VrbE-kG3xCaOGB3hM6td9tWV_ux=ooJgMePP+mycA@mail.gmail.com>
From: Ravi Prakash <ravihadoop@gmail.com>
Date: Thu, 10 Nov 2016 10:26:03 -0800
Message-ID: <CAMs9kVg69qT=bzs286xm8pBNNpDH0enfOQWcfb7jXtsgzaRh0A@mail.gmail.com>
Subject: Re: Yarn 2.7.3 - capacity scheduler container allocation to nodes?
To: =?UTF-8?Q?Rafa=C5=82_Radecki?= <radecki.rafal@gmail.com>
Cc: Bibinchundatt <bibin.chundatt@huawei.com>, user <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=94eb2c187d7e94a34c0540f68149
archived-at: Thu, 10 Nov 2016 18:26:50 -0000

--94eb2c187d7e94a34c0540f68149
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Is there a reason you want that behavior? I'm not sure you can get it
easily. Here's a link to the code that may be coming into play (depending
on your configuration) :
https://github.com/apache/hadoop/blob/branch-2.7.3/hadoop-yarn-project/hado=
op-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java=
/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/LeafQueue=
.java#L1372

On Thu, Nov 10, 2016 at 1:57 AM, Rafa=C5=82 Radecki <radecki.rafal@gmail.co=
m>
wrote:

> I have already used maximum-capacity for both queues (70 and 30) to limit
> their resource usage but it seems that this mechanism does not work on no=
de
> level but rather on cluster level.
> We have samza tasks on the cluster and they run for a very long time so w=
e
> cannot depend on the elasticity mechanism.
>
> 2016-11-10 10:31 GMT+01:00 Bibinchundatt <bibin.chundatt@huawei.com>:
>
>> Hi Rafai,
>>
>>
>>
>> Probably the following 2 two option you can look into
>>
>> 1.       *Elasticity* - Free resources can be allocated to any queue
>> beyond it=E2=80=99s capacity. When there is demand for these resources f=
rom queues
>> running below capacity at a future point in time, as tasks scheduled on
>> these resources complete, they will be assigned to applications on queue=
s
>> running below the capacity (pre-emption is not supported). This ensures
>> that resources are available in a predictable and elastic manner to queu=
es,
>> thus preventing artifical silos of resources in the cluster which helps
>> utilization.
>>
>>
>>
>> http://hadoop.apache.org/docs/r2.7.2/hadoop-yarn/hadoop-yarn
>> -site/CapacityScheduler.html
>>
>>
>>
>>
>>
>> yarn.scheduler.capacity.<queue-path>.maximum-capacity
>>
>> Maximum queue capacity in percentage (%) as a float. This limits the
>> *elasticity* for applications in the queue. Defaults to -1 which
>> disables it.
>>
>>
>>
>> 2.       Preemption of containers.
>>
>>
>>
>>
>>
>> Regards
>>
>> Bibin
>>
>>
>>
>> *From:* Rafa=C5=82 Radecki [mailto:radecki.rafal@gmail.com]
>> *Sent:* 10 November 2016 17:26
>> *To:* Bibinchundatt
>> *Cc:* Ravi Prakash; user
>>
>> *Subject:* Re: Yarn 2.7.3 - capacity scheduler container allocation to
>> nodes?
>>
>>
>>
>> We have 4 nodes and 4 large (~30GB each tasks), additionally we have
>> about 25 small (~2 GB each) tasks. All tasks can possibly be started in
>> random order.
>> On each node we have 50GB for yarn. So in case we start all 4 large task=
s
>> at the beginning the are correctly scheduled to all 4 nodes.
>> But in case we first start all short tasks they all go to the first
>> cluster node and there is no free capacity on it. Then we try to start 4
>> large tasks but we only have resources from remaining 3 nodes available =
and
>> cannot start one of the large tasks.
>>
>>
>>
>> BR,
>>
>> Rafal.
>>
>>
>>
>> 2016-11-10 9:54 GMT+01:00 Bibinchundatt <bibin.chundatt@huawei.com>:
>>
>> Hi Rafal!
>>
>> Is there a way to force yarn to use configured above thresholds (70% and
>> 30%) per node?
>>
>> -Currently we can=E2=80=99t specify threshold per node.
>>
>>
>>
>> As per your initial mail Yarn per node is ~50GB means all nodes resource=
s
>> are same. Any usecase specifically for per node allocation based on
>> percentage?
>>
>>
>>
>>
>>
>> *From:* Rafa=C5=82 Radecki [mailto:radecki.rafal@gmail.com]
>> *Sent:* 10 November 2016 14:59
>> *To:* Ravi Prakash
>> *Cc:* user
>> *Subject:* Re: Yarn 2.7.3 - capacity scheduler container allocation to
>> nodes?
>>
>>
>>
>> Hi Ravi.
>>
>>
>>
>> I did not specify labels this time ;) I just created two queues as it is
>> visible in the configuration.
>>
>> Overall queues work but allocation of jobs is different then expected by
>> me as I wrote at the beginning.
>>
>>
>>
>> BR,
>>
>> Rafal.
>>
>>
>>
>> 2016-11-10 2:48 GMT+01:00 Ravi Prakash <ravihadoop@gmail.com>:
>>
>> Hi Rafal!
>>
>> Have you been able to launch the job successfully first without
>> configuring node-labels? Do you really need node-labels? How much total
>> memory do you have on the cluster? Node labels are usually for specifyin=
g
>> special capabilities of the nodes (e.g. some nodes could have GPUs and y=
our
>> application could request to be run on only the nodes which have GPUs)
>>
>> HTH
>>
>> Ravi
>>
>>
>>
>> On Wed, Nov 9, 2016 at 5:37 AM, Rafa=C5=82 Radecki <radecki.rafal@gmail.=
com>
>> wrote:
>>
>> Hi All.
>>
>>
>>
>> I have a 4 node cluster on which I run yarn. I created 2 queues "long"
>> and "short", first with 70% resource allocation, the second with 30%
>> allocation. Both queues are configured on all available nodes by default=
.
>>
>>
>>
>> My memory for yarn per node is ~50GB. Initially I thought that when I
>> will run tasks in "short" queue yarn will allocate them on all nodes usi=
ng
>> 30% of the memory on every node. So for example if I run 20 tasks, 2GB e=
ach
>> (40GB summary), in short queue:
>>
>> - ~7 first will be scheduled on node1 (14GB total, 30% out of 50GB
>> available on this node for "short" queue -> 15GB)
>> - next ~7 tasks will be scheduled on node2
>>
>> - ~6 remaining tasks will be scheduled on node3
>>
>> - yarn on node4 will not use any resources assigned to "short" queue.
>>
>> But this seems not to be the case. At the moment I see that all tasks ar=
e
>> started on node1 and other nodes have no tasks started.
>>
>>
>>
>> I attached my yarn-site.xml and capacity-scheduler.xml.
>>
>>
>>
>> Is there a way to force yarn to use configured above thresholds (70% and
>> 30%) per node and not per cluster as a whole? I would like to get a
>> configuration in which on every node 70% is always available for "short"
>> queue, 70% for "long" queue and in case any resources are free for a
>> particular queue they are not used by other queues. Is it possible?
>>
>>
>>
>> BR,
>>
>> Rafal.
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
>> For additional commands, e-mail: user-help@hadoop.apache.org
>>
>>
>>
>>
>>
>>
>>
>
>

--94eb2c187d7e94a34c0540f68149
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Is there a reason you want that behavior? I&#39;m not sure=
 you can get it easily. Here&#39;s a link to the code that may be coming in=
to play (depending on your configuration) : <a href=3D"https://github.com/a=
pache/hadoop/blob/branch-2.7.3/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-=
server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/y=
arn/server/resourcemanager/scheduler/capacity/LeafQueue.java#L1372">https:/=
/github.com/apache/hadoop/blob/branch-2.7.3/hadoop-yarn-project/hadoop-yarn=
/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/ap=
ache/hadoop/yarn/server/resourcemanager/scheduler/capacity/LeafQueue.java#L=
1372</a><br></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"=
>On Thu, Nov 10, 2016 at 1:57 AM, Rafa=C5=82 Radecki <span dir=3D"ltr">&lt;=
<a href=3D"mailto:radecki.rafal@gmail.com" target=3D"_blank">radecki.rafal@=
gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">I have already used maximum-capacity for both queues (70 and 30) t=
o limit their resource usage but it seems that this mechanism does not work=
 on node level but rather on cluster level.<br>We have samza tasks on the c=
luster and they run for a very long time so we cannot depend on the elastic=
ity mechanism.</div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"g=
mail_extra"><br><div class=3D"gmail_quote">2016-11-10 10:31 GMT+01:00 Bibin=
chundatt <span dir=3D"ltr">&lt;<a href=3D"mailto:bibin.chundatt@huawei.com"=
 target=3D"_blank">bibin.chundatt@huawei.com</a>&gt;</span>:<br><blockquote=
 class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc soli=
d;padding-left:1ex">


<div link=3D"blue" vlink=3D"purple" lang=3D"EN-US">
<div class=3D"m_6727205262923627974m_2532389127216859913WordSection1">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hi Rafai,<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Probably the following 2 =
two option you can look into<u></u><u></u></span></p>
<p class=3D"m_6727205262923627974m_2532389127216859913MsoListParagraph"><u>=
</u><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;s=
ans-serif&quot;;color:#1f497d"><span>1.<span style=3D"font:7.0pt &quot;Time=
s New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span></span><u></u><b><span style=3D"font-size:9.0pt;font-family:&=
quot;Verdana&quot;,&quot;sans-serif&quot;;color:black;background:white">Ela=
sticity</span></b><span class=3D"m_6727205262923627974m_2532389127216859913=
apple-converted-space"><span style=3D"font-size:9.0pt;font-family:&quot;Ver=
dana&quot;,&quot;sans-serif&quot;;color:black;background:white">=C2=A0</spa=
n></span><span style=3D"font-size:9.0pt;font-family:&quot;Verdana&quot;,&qu=
ot;sans-serif&quot;;color:black;background:white">-
 Free resources can be allocated to any queue beyond it=E2=80=99s capacity.=
 When there is demand for these resources from queues running below capacit=
y at a future point in time, as tasks scheduled on these resources complete=
, they will be assigned to applications
 on queues running below the capacity (pre-emption is not supported). This =
ensures that resources are available in a predictable and elastic manner to=
 queues, thus preventing artifical silos of resources in the cluster which =
helps utilization.</span><span style=3D"font-size:11.0pt;font-family:&quot;=
Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u><u></u></span></=
p>
<p class=3D"m_6727205262923627974m_2532389127216859913MsoListParagraph"><sp=
an style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-ser=
if&quot;;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><a href=3D"http://hadoop.=
apache.org/docs/r2.7.2/hadoop-yarn/hadoop-yarn-site/CapacityScheduler.html"=
 target=3D"_blank">http://hadoop.apache.org/docs/<wbr>r2.7.2/hadoop-yarn/ha=
doop-yarn<wbr>-site/CapacityScheduler.html</a><u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<table class=3D"m_6727205262923627974m_2532389127216859913MsoNormalTable" s=
tyle=3D"width:1080.0pt;background:white" cellpadding=3D"0" border=3D"0" wid=
th=3D"1440">
<tbody>
<tr>
<td style=3D"background:#dddddd;padding:1.5pt 3.0pt 1.5pt 3.0pt" valign=3D"=
top">
<p class=3D"MsoNormal"><span style=3D"font-size:10.0pt;font-family:&quot;Co=
urier New&quot;;color:#333333">yarn.scheduler.capacity.&lt;queue<wbr>-path&=
gt;.maximum-capacity</span><span style=3D"font-size:8.5pt;font-family:&quot=
;Verdana&quot;,&quot;sans-serif&quot;;color:#333333"><u></u><u></u></span><=
/p>
</td>
<td style=3D"background:#dddddd;padding:1.5pt 3.0pt 1.5pt 3.0pt" valign=3D"=
top">
<p class=3D"MsoNormal"><span style=3D"font-size:8.5pt;font-family:&quot;Ver=
dana&quot;,&quot;sans-serif&quot;;color:#333333">Maximum queue capacity in =
percentage (%) as a float. This limits the=C2=A0<i>elasticity</i>=C2=A0for =
applications in the queue. Defaults to -1 which disables it.<u></u><u></u><=
/span></p>
</td>
</tr>
</tbody>
</table>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"m_6727205262923627974m_2532389127216859913MsoListParagraph"><u>=
</u><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;s=
ans-serif&quot;;color:#1f497d"><span>2.<span style=3D"font:7.0pt &quot;Time=
s New Roman&quot;">=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0
</span></span></span><u></u><span style=3D"font-size:11.0pt;font-family:&qu=
ot;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">Preemption of contai=
ners.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Regards<u></u><u></u></sp=
an></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Bibin<u></u><u></u></span=
></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-=
size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;"> Rafa=
=C5=82 Radecki [mailto:<a href=3D"mailto:radecki.rafal@gmail.com" target=3D=
"_blank">radecki.rafal@gmail.co<wbr>m</a>]
<br>
<b>Sent:</b> 10 November 2016 17:26<br>
<b>To:</b> Bibinchundatt<br>
<b>Cc:</b> Ravi Prakash; user</span></p><div><div class=3D"m_67272052629236=
27974h5"><br>
<b>Subject:</b> Re: Yarn 2.7.3 - capacity scheduler container allocation to=
 nodes?<u></u><u></u></div></div><p></p><div><div class=3D"m_67272052629236=
27974h5">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">We have 4 nodes and 4 large (~30GB each tasks), addi=
tionally we have about 25 small (~2 GB each) tasks. All tasks can possibly =
be started in random order.<br>
On each node we have 50GB for yarn. So in case we start all 4 large tasks a=
t the beginning the are correctly scheduled to all 4 nodes.<br>
But in case we first start all short tasks they all go to the first cluster=
 node and there is no free capacity on it. Then we try to start 4 large tas=
ks but we only have resources from remaining 3 nodes available and cannot s=
tart one of the large tasks.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">BR,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Rafal.<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">2016-11-10 9:54 GMT+01:00 Bibinchundatt &lt;<a href=
=3D"mailto:bibin.chundatt@huawei.com" target=3D"_blank">bibin.chundatt@huaw=
ei.com</a>&gt;:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm">
<div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Hi Rafal!<u></u><u></=
u></p>
<p class=3D"MsoNormal">Is there a way to force yarn to use configured above=
 thresholds (70% and 30%) per node?<u></u><u></u></p>
<p class=3D"MsoNormal">-Currently we can=E2=80=99t specify threshold per no=
de.<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">As per your initial mail Yarn per node is ~50GB mean=
s all nodes resources are same. Any usecase specifically for per node alloc=
ation based on percentage?<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-=
size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;"> Rafa=
=C5=82 Radecki [mailto:<a href=3D"mailto:radecki.rafal@gmail.com" target=3D=
"_blank">radecki.rafal@gmail.co<wbr>m</a>]
<br>
<b>Sent:</b> 10 November 2016 14:59<br>
<b>To:</b> Ravi Prakash<br>
<b>Cc:</b> user<br>
<b>Subject:</b> Re: Yarn 2.7.3 - capacity scheduler container allocation to=
 nodes?</span><u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">Hi Ravi.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">I did not specify labels this time ;) I just created=
 two queues as it is visible in the configuration.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Overall queues work but allocation of jobs is differ=
ent then expected by me as I wrote at the beginning.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">BR,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Rafal.<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">2016-11-10 2:48 GMT+01:00 Ravi Prakash &lt;<a href=
=3D"mailto:ravihadoop@gmail.com" target=3D"_blank">ravihadoop@gmail.com</a>=
&gt;:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0cm;margin-=
bottom:5.0pt">
<div>
<div>
<div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Hi Rafal!<u></u><u></=
u></p>
</div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Have you been able to=
 launch the job successfully first without configuring node-labels? Do you =
really need node-labels? How much total memory do you have on the cluster? =
Node labels
 are usually for specifying special capabilities of the nodes (e.g. some no=
des could have GPUs and your application could request to be run on only th=
e nodes which have GPUs)<u></u><u></u></p>
</div>
<p class=3D"MsoNormal">HTH<u></u><u></u></p>
</div>
<p class=3D"MsoNormal">Ravi<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<div>
<div>
<p class=3D"MsoNormal">On Wed, Nov 9, 2016 at 5:37 AM, Rafa=C5=82 Radecki &=
lt;<a href=3D"mailto:radecki.rafal@gmail.com" target=3D"_blank">radecki.raf=
al@gmail.com</a>&gt; wrote:<u></u><u></u></p>
</div>
</div>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0cm;margin-=
bottom:5.0pt">
<div>
<div>
<div>
<p class=3D"MsoNormal">Hi All.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">I have a 4 node cluster on which I run yarn. I creat=
ed 2 queues &quot;long&quot; and &quot;short&quot;, first with 70% resource=
 allocation, the second with 30% allocation. Both queues are configured
 on all available nodes by default.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">My memory for yarn per node is ~50GB. Initially I th=
ought that when I will run tasks in &quot;short&quot; queue yarn will alloc=
ate them on all nodes using 30% of the memory on every node.
 So for example if I run 20 tasks, 2GB each (40GB summary), in short queue:=
<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- ~7 first will be scheduled on node1 (14GB total, 3=
0% out of 50GB available on this node for &quot;short&quot; queue -&gt; 15G=
B)<br>
- next ~7 tasks will be scheduled on node2<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- ~6 remaining tasks will be scheduled on node3<u></=
u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">- yarn on node4 will not use any resources assigned =
to &quot;short&quot; queue.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">But this seems not to be the case. At the moment I s=
ee that all tasks are started on node1 and other nodes have no tasks starte=
d.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">I attached my yarn-site.xml and capacity-scheduler.x=
ml.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Is there a way to force yarn to use configured above=
 thresholds (70% and 30%) per node and not per cluster as a whole? I would =
like to get a configuration in which on every node
 70% is always available for &quot;short&quot; queue, 70% for &quot;long&qu=
ot; queue and in case any resources are free for a particular queue they ar=
e not used by other queues. Is it possible?<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">BR,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Rafal.<u></u><u></u></p>
</div>
</div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">=C2=A0<u></u><u></u><=
/p>
</div>
</div>
<p class=3D"MsoNormal">------------------------------<wbr>-----------------=
-------------<wbr>---------<br>
To unsubscribe, e-mail: <a href=3D"mailto:user-unsubscribe@hadoop.apache.or=
g" target=3D"_blank">
user-unsubscribe@hadoop.apache<wbr>.org</a><br>
For additional commands, e-mail: <a href=3D"mailto:user-help@hadoop.apache.=
org" target=3D"_blank">
user-help@hadoop.apache.org</a><u></u><u></u></p>
</blockquote>
</div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
</div></div></div>
</div>

</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--94eb2c187d7e94a34c0540f68149--