Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of y.z.elshater@gmail.com
 designates 209.85.215.51 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <53DC189E5FAEFA43BFA1BC02431031DB740DFF4883@WECTMLBOX.winbond.com.tw>
References: 
 <53DC189E5FAEFA43BFA1BC02431031DB740DF31CBA@WECTMLBOX.winbond.com.tw>
	<CAEoKqSf4gDa14joRhxPrR_PyowWZe+9yPs6m4A7094=Ra-Kj-g@mail.gmail.com>
	<53DC189E5FAEFA43BFA1BC02431031DB740DFF4883@WECTMLBOX.winbond.com.tw>
Date: Wed, 13 Aug 2014 13:06:17 -0400
Message-ID: 
 <CAEoKqSfz1zRaGgV4mHraNUZ0vFFzJZZYf4wL_ybTUzxPLY5HPg@mail.gmail.com>
Subject: Re: fair scheduler not working as intended
From: Yehia Elshater <y.z.elshater@gmail.com>
To: user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=089e0158b9c86c1c75050085cec3

--089e0158b9c86c1c75050085cec3
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Henry,

I think FairScheduler is a better option for your case. As if you used
FifoScheduler, the latency of the short jobs will be worst (in case of any
of longrun jobs are currently running in your cluster). So I think your
queue layout is good to apply fairness between the long and short jobs.
Long run queue will gain more resources (more than its min share) just in
case that the short one does not have any jobs to run in the mean time.
These additional resources will be diminished gradually to its minimum
share again based on the short queue workload to ensure fairness.

I hope that helps you.

Yehia


On 13 August 2014 02:01, Henry Hung <YTHung1@winbond.com> wrote:

>  Hi Yehia,
>
>
>
> Oh? I thought that by using maxResources =3D 15360 mb (3072 mb * 5), vcor=
es
> =3D 5, and maxMaps =3D 5, I already restricting the job to only use 5 map=
s at
> max.
>
>
>
> The reason is my long run job have 841 maps, and each map will process
> data for almost 2 hours.
>
> In the meantime there will be some short jobs that only need couple of
> minutes to complete.
>
> Hence why I use fair scheduler to split resources into 2 groups, one
> default and other one longrun.
>
> I want to make sure there always an available resources ready to be used
> by short jobs.
>
>
>
> If your explanation is true, then current fair scheduler behavior is not
> what I wanted.
>
> So is there any other ways to setup YARN resources to accommodate the
> short / long run jobs?
> Or do I need to create 2 separate YARN cluster? (I have been thinking
> about this approach)
>
>
>
> Best regards,
>
> Henry
>
>
>
> *From:* Yehia Elshater [mailto:y.z.elshater@gmail.com]
> *Sent:* Wednesday, August 13, 2014 11:27 AM
> *To:* user@hadoop.apache.org
> *Subject:* Re: fair scheduler not working as intended
>
>
>
> Hi Henry,
>
>
>
> Are there any applications (on different queues rather than longrun queue=
)
> are running in the same time ? I think FairScheduler is going to assign
> more resources to your "longrun" as long as there no other applications a=
re
> running in the other queues.
>
>
>
> Thanks
>
> Yehia
>
>
>
> On 12 August 2014 20:30, Henry Hung <YTHung1@winbond.com> wrote:
>
>  Hi Everyone,
>
>
>
> I=E2=80=99m using Hadoop-2.2.0 with fair scheduler in my YARN cluster, bu=
t
> something is wrong with the fair scheduler.
>
>
>
> Here is my fair-scheduler.xml looks like:
>
>
>
> <allocations>
>
>   <queue name=3D"longrun">
>
>     <maxResources>15360 mb, 5 vcores</maxResources>
>
>     <weight>0.5</weight>
>
>     <minMaps>2</minMaps>
>
>     <maxMaps>5</maxMaps>
>
>     <minReduces>1</minReduces>
>
>   </queue>
>
> </allocations>
>
>
>
> I create a =E2=80=9Clongrun=E2=80=9D queue to ensure that huge MR applica=
tion can only use
> 5 resources. My YARN setup for each resource memory is 3072 MB:
>
>
>
>   <property>
>
>     <name>mapreduce.map.memory.mb</name>
>
>     <value>3072</value>
>
>   </property>
>
>   <property>
>
>     <name>mapreduce.reduce.memory.mb</name>
>
>     <value>3072</value>
>
>   </property>
>
>
>
> When the huge application started, it works just fine and scheduler
> restrict it to only run 5 maps in parallel.
>
> But after running for some time, the application run 10 maps in parallel.
>
> The scheduler page show that the =E2=80=9Clongrun=E2=80=9D queue used 66%=
, exceed the fair
> share 30%.
>
>
>
> Can anyone tell me why the application can get more than it deserved?
>
> Is the problem with my configuration? Or there is a bug?
>
>
>
> Best regards,
>
> Henry Hung
>
>
>  ------------------------------
>
> The privileged confidential information contained in this email is
> intended for use only by the addressees as indicated by the original send=
er
> of this email. If you are not the addressee indicated in this email or ar=
e
> not responsible for delivery of the email to such a person, please kindly
> reply to the sender indicating this fact and delete all copies of it from
> your computer and network server immediately. Your cooperation is highly
> appreciated. It is advised that any unauthorized use of confidential
> information of Winbond is strictly prohibited; and any information in thi=
s
> email irrelevant to the official business of Winbond shall be deemed as
> neither given nor endorsed by Winbond.
>
>
>
> ------------------------------
> The privileged confidential information contained in this email is
> intended for use only by the addressees as indicated by the original send=
er
> of this email. If you are not the addressee indicated in this email or ar=
e
> not responsible for delivery of the email to such a person, please kindly
> reply to the sender indicating this fact and delete all copies of it from
> your computer and network server immediately. Your cooperation is highly
> appreciated. It is advised that any unauthorized use of confidential
> information of Winbond is strictly prohibited; and any information in thi=
s
> email irrelevant to the official business of Winbond shall be deemed as
> neither given nor endorsed by Winbond.
>

--089e0158b9c86c1c75050085cec3
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Henry,<div><br></div><div>I think FairScheduler is a be=
tter option for your case. As if you used FifoScheduler, the latency of the=
 short jobs will be worst (in case of any of longrun jobs are currently run=
ning in your cluster). So I think your queue layout is good to apply fairne=
ss between the long and short jobs. Long run queue will gain more resources=
 (more than its min share) just in case that the short one does not have an=
y jobs to run in the mean time. These additional resources will be diminish=
ed gradually to its minimum share again based on the short queue workload t=
o ensure fairness.</div>
<div><br></div><div>I hope that helps you.</div><div><br></div><div>Yehia</=
div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On =
13 August 2014 02:01, Henry Hung <span dir=3D"ltr">&lt;<a href=3D"mailto:YT=
Hung1@winbond.com" target=3D"_blank">YTHung1@winbond.com</a>&gt;</span> wro=
te:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hi Yehia,<u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Oh? I thought that by usi=
ng maxResources =3D 15360 mb (3072 mb * 5), vcores =3D 5, and maxMaps =3D 5=
, I already restricting the job to only use 5 maps at max.<u></u><u></u></s=
pan></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">The reason is my long run=
 job have 841 maps, and each map will process data for almost 2 hours.<u></=
u><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">In the meantime there wil=
l be some short jobs that only need couple of minutes to complete.<u></u><u=
></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hence why I use fair sche=
duler to split resources into 2 groups, one default and other one longrun.<=
u></u><u></u></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">I want to make sure there=
 always an available resources ready to be used by short jobs.<u></u><u></u=
></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">If your explanation is tr=
ue, then current fair scheduler behavior is not what I wanted.<u></u><u></u=
></span></p>

<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">So is there any other way=
s to setup YARN resources to accommodate the short / long run jobs?<br>
Or do I need to create 2 separate YARN cluster? (I have been thinking about=
 this approach)<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Best regards,<u></u><u></=
u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Henry<u></u><u></u></span=
></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=C2=A0<u></u></spa=
n></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"font-=
size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;"> Yehia =
Elshater [mailto:<a href=3D"mailto:y.z.elshater@gmail.com" target=3D"_blank=
">y.z.elshater@gmail.com</a>]
<br>
<b>Sent:</b> Wednesday, August 13, 2014 11:27 AM<br>
<b>To:</b> <a href=3D"mailto:user@hadoop.apache.org" target=3D"_blank">user=
@hadoop.apache.org</a><br>
<b>Subject:</b> Re: fair scheduler not working as intended<u></u><u></u></s=
pan></p><div><div class=3D"h5">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:10.5pt;font-family:&quot;He=
lvetica&quot;,&quot;sans-serif&quot;;color:#111111">Hi Henry,</span><u></u>=
<u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:10.5pt;font-family:&quot;He=
lvetica&quot;,&quot;sans-serif&quot;;color:#111111">Are there any applicati=
ons (on different queues rather than longrun queue) are running in the same=
 time ? I think FairScheduler is going to assign more resources
 to your &quot;longrun&quot; as long as there no other applications are run=
ning in the other queues.</span><u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:10.5pt;font-family:&quot;He=
lvetica&quot;,&quot;sans-serif&quot;;color:#111111">Thanks</span><u></u><u>=
</u></p>
</div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:10.5pt;font-family:&quot;He=
lvetica&quot;,&quot;sans-serif&quot;;color:#111111">Yehia</span><u></u><u><=
/u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt"><u></u>=C2=A0<u></u><=
/p>
<div>
<p class=3D"MsoNormal">On 12 August 2014 20:30, Henry Hung &lt;<a href=3D"m=
ailto:YTHung1@winbond.com" target=3D"_blank">YTHung1@winbond.com</a>&gt; wr=
ote:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm">
<div>
<div>
<p class=3D"MsoNormal">Hi Everyone,<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">I=E2=80=99m using Hadoop-2.2.0 with fair scheduler i=
n my YARN cluster, but something is wrong with the fair scheduler.<u></u><u=
></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Here is my fair-scheduler.xml looks like:<u></u><u><=
/u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">&lt;allocations&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;queue name=3D&quot;longrun&quot;&gt;<u></=
u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;maxResources&gt;15360 mb, 5 v=
cores&lt;/maxResources&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;weight&gt;0.5&lt;/weight&gt;<=
u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;minMaps&gt;2&lt;/minMaps&gt;<=
u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;maxMaps&gt;5&lt;/maxMaps&gt;<=
u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;minReduces&gt;1&lt;/minReduce=
s&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;/queue&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">&lt;/allocations&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">I create a =E2=80=9Clongrun=E2=80=9D queue to ensure=
 that huge MR application can only use 5 resources. My YARN setup for each =
resource memory is 3072 MB:<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;property&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;name&gt;mapreduce.map.memory.=
mb&lt;/name&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;value&gt;3072&lt;/value&gt;<u=
></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;/property&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;property&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;name&gt;mapreduce.reduce.memo=
ry.mb&lt;/name&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0=C2=A0=C2=A0 &lt;value&gt;3072&lt;/value&gt;<u=
></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0 &lt;/property&gt;<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">When the huge application started, it works just fin=
e and scheduler restrict it to only run 5 maps in parallel.<u></u><u></u></=
p>
<p class=3D"MsoNormal">But after running for some time, the application run=
 10 maps in parallel.<u></u><u></u></p>
<p class=3D"MsoNormal">The scheduler page show that the =E2=80=9Clongrun=E2=
=80=9D queue used 66%, exceed the fair share 30%.<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Can anyone tell me why the application can get more =
than it deserved?<u></u><u></u></p>
<p class=3D"MsoNormal">Is the problem with my configuration? Or there is a =
bug?<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Best regards,<u></u><u></u></p>
<p class=3D"MsoNormal">Henry Hung<u></u><u></u></p>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div class=3D"MsoNormal" align=3D"center" style=3D"text-align:center">
<hr size=3D"2" width=3D"100%" align=3D"center">
</div>
<p class=3D"MsoNormal"><span style=3D"font-size:7.5pt;font-family:&quot;Ari=
al&quot;,&quot;sans-serif&quot;;color:gray">The privileged confidential inf=
ormation contained in this email is intended for use only by the addressees=
 as indicated by the original sender of this email. If you
 are not the addressee indicated in this email or are not responsible for d=
elivery of the email to such a person, please kindly reply to the sender in=
dicating this fact and delete all copies of it from your computer and netwo=
rk server immediately. Your cooperation
 is highly appreciated. It is advised that any unauthorized use of confiden=
tial information of Winbond is strictly prohibited; and any information in =
this email irrelevant to the official business of Winbond shall be deemed a=
s neither given nor endorsed by
 Winbond.</span><u></u><u></u></p>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
</div></div></div><div><div class=3D"h5">
<br>
<hr>
<font face=3D"Arial" color=3D"Gray" size=3D"1">The privileged confidential =
information contained in this email is intended for use only by the address=
ees as indicated by the original sender of this email. If you are not the a=
ddressee indicated in this email or are
 not responsible for delivery of the email to such a person, please kindly =
reply to the sender indicating this fact and delete all copies of it from y=
our computer and network server immediately. Your cooperation is highly app=
reciated. It is advised that any
 unauthorized use of confidential information of Winbond is strictly prohib=
ited; and any information in this email irrelevant to the official business=
 of Winbond shall be deemed as neither given nor endorsed by Winbond.<br>

</font>
</div></div></div>

</blockquote></div><br></div>

--089e0158b9c86c1c75050085cec3--