Mailing-List: contact dev-help@spark.apache.org; run by ezmlm
Precedence: bulk
Received-SPF: pass (nike.apache.org: domain of sandy.ryza@cloudera.com
 designates 209.85.216.49 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <OFBFBFA484.90BE4C7B-ON48257D2E.00291498-48257D2E.0029EF09@cn.ibm.com>
References: 
 <OFC79ECCF9.7307DF54-ON48257D2D.0029A9D9-48257D2D.0029FDEC@LocalDomain>
	<OF1B922427.882F1394-ON48257D2E.000F97EF-48257D2E.000FB521@cn.ibm.com>
	<CABPQxst0tuNROWiR3rjkcAJUqgCwNM5XsqCWN0O=RLRTU1n+SQ@mail.gmail.com>
	<CABPQxss=fs_E3JL-nay3N32_Mag6UULz+cw3uaeua5feEvPhpQ@mail.gmail.com>
	<OFE4B5E67A.527C58A9-ON48257D2E.001FC9BF-48257D2E.0020AB17@cn.ibm.com>
	<CACBYxKL6rw9G6MYYmGKMgq4raMgdxpPbte-janTHZA9rj1-Fow@mail.gmail.com>
	<OFBFBFA484.90BE4C7B-ON48257D2E.00291498-48257D2E.0029EF09@cn.ibm.com>
Date: Fri, 8 Aug 2014 00:49:44 -0700
Message-ID: 
 <CACBYxKKTfKeY4He+y7-wJfwgOaZYoMafReO5RdfVM_iPvdHt2w@mail.gmail.com>
Subject: Re: Fine-Grained Scheduler on Yarn
From: Sandy Ryza <sandy.ryza@cloudera.com>
To: Jun Feng Liu <liujunf@cn.ibm.com>
Cc: "dev@spark.apache.org" <dev@spark.apache.org>,
 Patrick Wendell <pwendell@gmail.com>
Content-Type: multipart/related; boundary=001a11c115a0df12ee050019727f

--001a11c115a0df12ee050019727f
Content-Type: multipart/alternative; boundary=001a11c115a0df12eb050019727e

--001a11c115a0df12eb050019727e
Content-Type: text/plain; charset=UTF-8

I think that would be useful work.  I don't know the minute details of this
code, but in general TaskSchedulerImpl keeps track of pending tasks.  Tasks
are organized into TaskSets, each of which corresponds to a particular
stage.  Each TaskSet has a TaskSetManager, which directly tracks the
pending tasks for that stage.

-Sandy


On Fri, Aug 8, 2014 at 12:37 AM, Jun Feng Liu <liujunf@cn.ibm.com> wrote:

> Yes, I think we need both level resource control (container numbers and
> dynamically change container resources), which can make the resource
> utilization much more effective, especially when we have more types work
> load share the same infrastructure.
>
> Is there anyway I can observe the tasks backlog in schedulerbackend?
> Sounds like scheduler backend be triggered during new taskset submitted. I
> did not figured if there is a way to check the whole backlog tasks inside
> it. I am interesting to implement some policy in schedulerbackend and test
> to see how useful it is going to be.
>
> Best Regards
>
>
> *Jun Feng Liu*
> IBM China Systems & Technology Laboratory in Beijing
>
>   ------------------------------
>  [image: 2D barcode - encoded with contact information] *Phone: *86-10-82452683
>
> * E-mail:* *liujunf@cn.ibm.com* <liujunf@cn.ibm.com>
> [image: IBM]
>
> BLD 28,ZGC Software Park
> No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193
> China
>
>
>
>
>
>  *Sandy Ryza <sandy.ryza@cloudera.com <sandy.ryza@cloudera.com>>*
>
> 2014/08/08 15:14
>   To
> Jun Feng Liu/China/IBM@IBMCN,
> cc
> Patrick Wendell <pwendell@gmail.com>, "dev@spark.apache.org" <
> dev@spark.apache.org>
> Subject
> Re: Fine-Grained Scheduler on Yarn
>
>
>
>
> Hi Jun,
>
> Spark currently doesn't have that feature, i.e. it aims for a fixed number
> of executors per application regardless of resource usage, but it's
> definitely worth considering.  We could start more executors when we have a
> large backlog of tasks and shut some down when we're underutilized.
>
> The fine-grained task scheduling is blocked on work from YARN that will
> allow changing the CPU allocation of a YARN container dynamically.  The
> relevant JIRA for this dependency is YARN-1197, though YARN-1488 might
> serve this purpose as well if it comes first.
>
> -Sandy
>
>
> On Thu, Aug 7, 2014 at 10:56 PM, Jun Feng Liu <liujunf@cn.ibm.com> wrote:
>
> > Thanks for echo on this. Possible to adjust resource based on container
> > numbers? e.g to allocate more container when driver need more resources
> and
> > return some resource by delete some container when parts of container
> > already have enough cores/memory
> >
> > Best Regards
> >
> >
> > *Jun Feng Liu*
>
> >
> > IBM China Systems & Technology Laboratory in Beijing
> >
> >   ------------------------------
>
> >  [image: 2D barcode - encoded with contact information]
> > *Phone: *86-10-82452683
> > * E-mail:* *liujunf@cn.ibm.com* <liujunf@cn.ibm.com>
>
> > [image: IBM]
> >
> > BLD 28,ZGC Software Park
> > No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193
> > China
> >
> >
> >
> >
> >
> >  *Patrick Wendell <pwendell@gmail.com <pwendell@gmail.com>>*
>
> >
> > 2014/08/08 13:10
> >   To
> > Jun Feng Liu/China/IBM@IBMCN,
> > cc
> > "dev@spark.apache.org" <dev@spark.apache.org>
> > Subject
> > Re: Fine-Grained Scheduler on Yarn
> >
> >
> >
> >
> > Hey sorry about that - what I said was the opposite of what is true.
> >
> > The current YARN mode is equivalent to "coarse grained" mesos. There is
> no
> > fine-grained scheduling on YARN at the moment. I'm not sure YARN supports
> > scheduling in units other than containers. Fine-grained scheduling
> requires
> > scheduling at the granularity of individual cores.
> >
> >
> > On Thu, Aug 7, 2014 at 9:43 PM, Patrick Wendell <*pwendell@gmail.com*
>
> > <pwendell@gmail.com>> wrote:
> > The current YARN is equivalent to what is called "fine grained" mode in
> > Mesos. The scheduling of tasks happens totally inside of the Spark
> driver.
> >
> >
> > On Thu, Aug 7, 2014 at 7:50 PM, Jun Feng Liu <*liujunf@cn.ibm.com*
>
> > <liujunf@cn.ibm.com>> wrote:
> > Any one know the answer?
> > Best Regards
> >
> >
> > * Jun Feng Liu*
>
> >
> > IBM China Systems & Technology Laboratory in Beijing
> >
> >   ------------------------------
> >  *Phone: *86-10-82452683
> > * E-mail:* *liujunf@cn.ibm.com* <liujunf@cn.ibm.com>
>
> >
> >
> > BLD 28,ZGC Software Park
> > No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193
> > China
> >
> >
> >
> >
> >   *Jun Feng Liu/China/IBM*
> >
> > 2014/08/07 15:37
> >
> >   To
> > *dev@spark.apache.org* <dev@spark.apache.org>,
>
> > cc
> >   Subject
> > Fine-Grained Scheduler on Yarn
> >
> >
> >
> >
> >
> > Hi, there
> >
> > Just aware right now Spark only support fine grained scheduler on Mesos
> > with MesosSchedulerBackend. The Yarn schedule sounds like only works on
> > coarse-grained model. Is there any plan to implement fine-grained
> scheduler
> > for YARN? Or there is any technical issue block us to do that.
> >
> > Best Regards
> >
> >
> > * Jun Feng Liu*
>
> >
> > IBM China Systems & Technology Laboratory in Beijing
> >
> >   ------------------------------
> >  *Phone: *86-10-82452683
> > * E-mail:* *liujunf@cn.ibm.com* <liujunf@cn.ibm.com>
>
> >
> >
> > BLD 28,ZGC Software Park
> > No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193
> > China
> >
> >
> >
> >
> >
> >
> >
>
>

--001a11c115a0df12eb050019727e
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I think that would be useful work. =C2=A0I don&#39;t know =
the minute details of this code, but in general TaskSchedulerImpl keeps tra=
ck of pending tasks. =C2=A0Tasks are organized into TaskSets, each of which=
 corresponds to a particular stage. =C2=A0Each TaskSet has a TaskSetManager=
, which directly tracks the pending tasks for that stage.<div>
<br></div><div>-Sandy</div></div><div class=3D"gmail_extra"><br><br><div cl=
ass=3D"gmail_quote">On Fri, Aug 8, 2014 at 12:37 AM, Jun Feng Liu <span dir=
=3D"ltr">&lt;<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_blank">liujun=
f@cn.ibm.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><font face=3D"sans-serif">Yes, I think we ne=
ed both level resource
control (container numbers and dynamically change container resources),
which can make the resource utilization much more effective, especially
when we have more types work load share the same infrastructure. </font>
<br>
<br><font face=3D"sans-serif">Is there anyway I can observe the tasks
backlog in schedulerbackend? Sounds like scheduler backend be triggered
during new taskset submitted. I did not figured if there is a way to check
the whole backlog tasks inside it. I am interesting to implement some polic=
y
in schedulerbackend and test to see how useful it is going to be.<br>
</font><font size=3D"1" face=3D"Arial"> </font>
<p><font size=3D"1" face=3D"Arial">Best Regards</font>
</p><p></p><div class=3D""><font size=3D"1" face=3D"Arial">=C2=A0</font>
<br><font size=3D"3" color=3D"#8f8f8f" face=3D"Arial"><b>Jun Feng Liu</b></=
font><font size=3D"1" face=3D"Arial"><br>
IBM China Systems &amp; Technology Laboratory in Beijing</font>
</div><p>
</p><table>
<tbody><tr>
<td colspan=3D"3">
<div align=3D"center">
<hr noshade></div>
</td></tr><tr>
<td rowspan=3D"2"><img src=3D"cid:_2_12BF645812BF60840029EEED48257D2E" alt=
=3D"2D barcode - encoded with contact information">
</td><td><font size=3D"1" color=3D"#4181c0" face=3D"=E5=AE=8B=E4=BD=93"><b>=
Phone: </b></font><font size=3D"1" color=3D"#5f5f5f" face=3D"=E5=AE=8B=E4=
=BD=93">86-10-82452683
</font><font size=3D"1" color=3D"#4181c0"><b><br>
E-mail:</b></font><font size=3D"1" color=3D"#5f5f5f"> </font><a href=3D"mai=
lto:liujunf@cn.ibm.com" target=3D"_blank"><font size=3D"1" color=3D"#5f5f5f=
" face=3D"=E5=AE=8B=E4=BD=93"><u>liujunf@cn.ibm.com</u></font></a>
</td><td rowspan=3D"2">
<div align=3D"right"><img src=3D"cid:_1_12BF6E0412BF6A300029EEED48257D2E" w=
idth=3D"32" height=3D"32" alt=3D"IBM"><font size=3D"1" color=3D"#5f5f5f"><b=
r>
</font><font size=3D"1" color=3D"#5f5f5f" face=3D"=E5=AE=8B=E4=BD=93"><br>
BLD 28,ZGC Software Park <br>
No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193 <br>
China </font></div>
</td></tr><tr>
<td><font size=3D"1" color=3D"#5f5f5f">=C2=A0</font></td></tr></tbody></tab=
le>
<br>
<p><font size=3D"3">=C2=A0</font>
<br>
<br>
<br>
</p><p></p><table width=3D"100%">
<tbody><tr valign=3D"top">
<td width=3D"40%"><font size=3D"1" face=3D"sans-serif"><b>Sandy Ryza &lt;<a=
 href=3D"mailto:sandy.ryza@cloudera.com" target=3D"_blank">sandy.ryza@cloud=
era.com</a>&gt;</b>
</font>
<p><font size=3D"1" face=3D"sans-serif">2014/08/08 15:14</font>
</p></td><td width=3D"59%">
<table width=3D"100%">
<tbody><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">To</font></div>
</td><td><font size=3D"1" face=3D"sans-serif">Jun Feng Liu/China/IBM@IBMCN,=
 </font>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">cc</font></div>
</td><td><font size=3D"1" face=3D"sans-serif">Patrick Wendell &lt;<a href=
=3D"mailto:pwendell@gmail.com" target=3D"_blank">pwendell@gmail.com</a>&gt;=
,
&quot;<a href=3D"mailto:dev@spark.apache.org" target=3D"_blank">dev@spark.a=
pache.org</a>&quot; &lt;<a href=3D"mailto:dev@spark.apache.org" target=3D"_=
blank">dev@spark.apache.org</a>&gt;</font>
</td></tr><tr valign=3D"top">
<td>
<div align=3D"right"><font size=3D"1" face=3D"sans-serif">Subject</font></d=
iv>
</td><td><font size=3D"1" face=3D"sans-serif">Re: Fine-Grained Scheduler on=
 Yarn</font></td></tr></tbody></table>
<br>
<table>
<tbody><tr valign=3D"top">
<td>
</td><td></td></tr></tbody></table>
<br></td></tr></tbody></table>
<br>
<br>
<br><tt><font><div class=3D"">Hi Jun,<br>
<br>
Spark currently doesn&#39;t have that feature, i.e. it aims for a fixed num=
ber<br>
of executors per application regardless of resource usage, but it&#39;s<br>
definitely worth considering. =C2=A0We could start more executors when
we have a<br>
large backlog of tasks and shut some down when we&#39;re underutilized.<br>
<br>
The fine-grained task scheduling is blocked on work from YARN that will<br>
allow changing the CPU allocation of a YARN container dynamically. =C2=A0Th=
e<br>
relevant JIRA for this dependency is YARN-1197, though YARN-1488 might<br>
serve this purpose as well if it comes first.<br>
<br>
-Sandy<br>
<br>
<br>
On Thu, Aug 7, 2014 at 10:56 PM, Jun Feng Liu &lt;<a href=3D"mailto:liujunf=
@cn.ibm.com" target=3D"_blank">liujunf@cn.ibm.com</a>&gt;
wrote:<br>
<br>
&gt; Thanks for echo on this. Possible to adjust resource based on containe=
r<br>
&gt; numbers? e.g to allocate more container when driver need more resource=
s
and<br>
&gt; return some resource by delete some container when parts of container<=
br>
&gt; already have enough cores/memory<br>
&gt;<br>
&gt; Best Regards<br>
&gt;<br>
&gt;<br></div>
&gt; *Jun Feng Liu*<div class=3D""><br>
&gt;<br>
&gt; IBM China Systems &amp; Technology Laboratory in Beijing<br>
&gt;<br></div>
&gt; =C2=A0 ------------------------------<div class=3D""><br>
&gt; =C2=A0[image: 2D barcode - encoded with contact information]<br></div>
&gt; *Phone: *86-10-82452683<br>
&gt; * E-mail:* *<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_blank">li=
ujunf@cn.ibm.com</a>* &lt;<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_=
blank">liujunf@cn.ibm.com</a>&gt;<div class=3D""><br>
&gt; [image: IBM]<br>
&gt;<br>
&gt; BLD 28,ZGC Software Park<br>
&gt; No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193<br>
&gt; China<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br></div>
&gt; =C2=A0*Patrick Wendell &lt;<a href=3D"mailto:pwendell@gmail.com" targe=
t=3D"_blank">pwendell@gmail.com</a> &lt;<a href=3D"mailto:pwendell@gmail.co=
m" target=3D"_blank">pwendell@gmail.com</a>&gt;&gt;*<div class=3D""><br>
&gt;<br>
&gt; 2014/08/08 13:10<br>
&gt; =C2=A0 To<br>
&gt; Jun Feng Liu/China/IBM@IBMCN,<br>
&gt; cc<br>
&gt; &quot;<a href=3D"mailto:dev@spark.apache.org" target=3D"_blank">dev@sp=
ark.apache.org</a>&quot; &lt;<a href=3D"mailto:dev@spark.apache.org" target=
=3D"_blank">dev@spark.apache.org</a>&gt;<br>
&gt; Subject<br>
&gt; Re: Fine-Grained Scheduler on Yarn<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; Hey sorry about that - what I said was the opposite of what is true.<b=
r>
&gt;<br>
&gt; The current YARN mode is equivalent to &quot;coarse grained&quot;
mesos. There is no<br>
&gt; fine-grained scheduling on YARN at the moment. I&#39;m not sure YARN s=
upports<br>
&gt; scheduling in units other than containers. Fine-grained scheduling
requires<br>
&gt; scheduling at the granularity of individual cores.<br>
&gt;<br>
&gt;<br></div>
&gt; On Thu, Aug 7, 2014 at 9:43 PM, Patrick Wendell &lt;*<a href=3D"mailto=
:pwendell@gmail.com" target=3D"_blank">pwendell@gmail.com</a>*<div class=3D=
""><br>
&gt; &lt;<a href=3D"mailto:pwendell@gmail.com" target=3D"_blank">pwendell@g=
mail.com</a>&gt;&gt; wrote:<br>
&gt; The current YARN is equivalent to what is called &quot;fine grained&qu=
ot;
mode in<br>
&gt; Mesos. The scheduling of tasks happens totally inside of the Spark
driver.<br>
&gt;<br>
&gt;<br></div>
&gt; On Thu, Aug 7, 2014 at 7:50 PM, Jun Feng Liu &lt;*<a href=3D"mailto:li=
ujunf@cn.ibm.com" target=3D"_blank">liujunf@cn.ibm.com</a>*<div class=3D"">=
<br>
&gt; &lt;<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_blank">liujunf@cn=
.ibm.com</a>&gt;&gt; wrote:<br>
&gt; Any one know the answer?<br>
&gt; Best Regards<br>
&gt;<br>
&gt;<br></div>
&gt; * Jun Feng Liu*<div class=3D""><br>
&gt;<br>
&gt; IBM China Systems &amp; Technology Laboratory in Beijing<br>
&gt;<br></div>
&gt; =C2=A0 ------------------------------<br>
&gt; =C2=A0*Phone: *86-10-82452683<br>
&gt; * E-mail:* *<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_blank">li=
ujunf@cn.ibm.com</a>* &lt;<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_=
blank">liujunf@cn.ibm.com</a>&gt;<div class=3D""><br>
&gt;<br>
&gt;<br>
&gt; BLD 28,ZGC Software Park<br>
&gt; No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193<br>
&gt; China<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br></div><div class=3D"">
&gt; =C2=A0 *Jun Feng Liu/China/IBM*<br>
&gt;<br>
&gt; 2014/08/07 15:37<br>
&gt;<br>
&gt; =C2=A0 To<br></div>
&gt; *<a href=3D"mailto:dev@spark.apache.org" target=3D"_blank">dev@spark.a=
pache.org</a>* &lt;<a href=3D"mailto:dev@spark.apache.org" target=3D"_blank=
">dev@spark.apache.org</a>&gt;,<div class=3D""><br>
&gt; cc<br>
&gt; =C2=A0 Subject<br>
&gt; Fine-Grained Scheduler on Yarn<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; Hi, there<br>
&gt;<br>
&gt; Just aware right now Spark only support fine grained scheduler on
Mesos<br>
&gt; with MesosSchedulerBackend. The Yarn schedule sounds like only works
on<br>
&gt; coarse-grained model. Is there any plan to implement fine-grained
scheduler<br>
&gt; for YARN? Or there is any technical issue block us to do that.<br>
&gt;<br>
&gt; Best Regards<br>
&gt;<br>
&gt;<br></div>
&gt; * Jun Feng Liu*<div class=3D""><br>
&gt;<br>
&gt; IBM China Systems &amp; Technology Laboratory in Beijing<br>
&gt;<br></div>
&gt; =C2=A0 ------------------------------<br>
&gt; =C2=A0*Phone: *86-10-82452683<br>
&gt; * E-mail:* *<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_blank">li=
ujunf@cn.ibm.com</a>* &lt;<a href=3D"mailto:liujunf@cn.ibm.com" target=3D"_=
blank">liujunf@cn.ibm.com</a>&gt;<div class=3D""><br>
&gt;<br>
&gt;<br>
&gt; BLD 28,ZGC Software Park<br>
&gt; No.8 Rd.Dong Bei Wang West, Dist.Haidian Beijing 100193<br>
&gt; China<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
</div></font></tt>
<br>
<p></p><p></p><p></p><p></p></blockquote></div><br></div>

--001a11c115a0df12eb050019727e--

--001a11c115a0df12ee050019727f--