Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: <B494BA64-ACF5-4E44-AD34-E3F5DB8C8AB6@inria.fr>
References: <A3F48AF2-A6ED-4F4A-BFD3-D294500AEADA@inria.fr>
 <CAGr9p8DjOBgKqjqbbOA=aVYOMovrZNaELG=fncTZ8S73H1RGKg@mail.gmail.com>
 <B494BA64-ACF5-4E44-AD34-E3F5DB8C8AB6@inria.fr>
From: Robert Metzger <rmetzger@apache.org>
Date: Fri, 20 Nov 2015 13:34:03 +0100
Message-ID: 
 <CAGr9p8A3JhJGqXsy8X+7xWr4u=X07_Lh_6bh5xqM9BxkRt_wSg@mail.gmail.com>
Subject: Re: Apache Flink on Hadoop YARN using a YARN Session
To: "user@flink.apache.org" <user@flink.apache.org>
Content-Type: multipart/alternative; boundary=001a114037445b07030524f818d6

--001a114037445b07030524f818d6
Content-Type: text/plain; charset=UTF-8

Hi,
I'll fix the link in the YARN documentation. Thank you for reporting the
issue.

I'm not aware of any discussions or implementations related to the
scheduling. From my experience working with users and also from the mailing
list, I don't think that such features are very important.
Since streaming jobs usually run permanently, there is no need to queue
jobs somehow.
For batch jobs, YARN is taking care of the resource allocation (in practice
this means that the job has to wait until the required resources are
available).

There are some discussions (and user requests) regarding resource
elasticity going on and I think we'll add features for dynamically changing
the size of a Flink cluster on YARN while a job is running.

Which features are you missing wrt to scheduling in Flink? Please let me
know if there is anything blocking you from using Flink in production and
we'll see what we can do.

Regards,
Robert


On Fri, Nov 20, 2015 at 1:24 PM, Ovidiu-Cristian MARCU <
ovidiu-cristian.marcu@inria.fr> wrote:

> Hi,
>
> The link to FAQ (
> https://ci.apache.org/projects/flink/flink-docs-release-0.10/faq.html) is
> on the yarn setup 0.10 documentation page (
> https://ci.apache.org/projects/flink/flink-docs-release-0.10/setup/yarn_setup.html)
> described in this sentence: *If you have troubles using the Flink YARN
> client, have a look in the FAQ section
> <https://ci.apache.org/projects/flink/flink-docs-release-0.10/faq.html>.*
>
> Is the scheduling features considered for next releases?
>
> Thank you.
> Best regards,
> Ovidiu
>
> On 20 Nov 2015, at 11:59, Robert Metzger <rmetzger@apache.org> wrote:
>
> Hi Ovidiu,
>
> you can submit multiple programs to a running Flink cluster (or a YARN
> session). Flink does currently not have any queuing mechanism.
> The JobManager will reject a program if there are not enough free
> resources for it. If there are enough resources for multiple programs,
> they'll run concurrently.
> Note that Flink is not starting separate JVMs for the programs, so if one
> program is doing a System.exit(0), it is killing the entire JVM,
> including other running programs.
>
> You can start as many YARN sessions (or single jobs to YARN) as you have
> resources available on the cluster. The resource allocation is up to the
> scheduler you've configured in YARN.
>
> In general, we recommend to start a YARN session per program. You can also
> directly submit a Flink program to YARN.
>
> Where did you find the link to the FAQ? The link on the front page is
> working: http://flink.apache.org/faq.html
>
>
>
> On Fri, Nov 20, 2015 at 11:41 AM, Ovidiu-Cristian MARCU <
> ovidiu-cristian.marcu@inria.fr> wrote:
>
>> Hi,
>>
>> I am currently interested in experimenting on Flink over Hadoop YARN.
>> I am documenting from the documentation we have here:
>> https://ci.apache.org/projects/flink/flink-docs-release-0.10/setup/yarn_setup.html
>>
>> There is a subsection *Start Flink Session* which states the following: *A
>> session will start all required Flink services (JobManager and
>> TaskManagers) so that you can submit programs to the cluster. Note that you
>> can run multiple programs per session.*
>>
>> Can you be more precise regarding the multiple programs per session? If I
>> submit multiple programs concurently what will happen (can I?)? Maybe they
>> will run in a FIFO fashion or what should I expect?
>>
>> The internals section specify that users can execute multiple Flink Yarn
>> sessions in parallel. This is great, this invites to static partitioning of
>> resources in order to run multiple applications concurrently. Do you
>> support a fair scheduler similar to what Spark claims it has?
>>
>> There is FAQ section (
>> https://ci.apache.org/projects/flink/flink-docs-release-0.10/faq.html)
>> resource that is missing, can this be updated?
>>
>> Thank you.
>>
>> Best regards,
>> Ovidiu
>>
>>
>
>
>

--001a114037445b07030524f818d6
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi,<div>I&#39;ll fix the link in the YARN documentation. T=
hank you for reporting the issue.</div><div><br></div><div>I&#39;m not awar=
e of any discussions or implementations related to the scheduling. From my =
experience working with users and also from the mailing list, I don&#39;t t=
hink that such features are very important.</div><div>Since streaming jobs =
usually run permanently, there is no need to queue jobs somehow.</div><div>=
For batch jobs, YARN is taking care of the resource allocation (in practice=
 this means that the job has to wait until the required resources are avail=
able).</div><div><br></div><div>There are some discussions (and user reques=
ts) regarding resource elasticity going on and I think we&#39;ll add featur=
es for dynamically changing the size of a Flink cluster on YARN while a job=
 is running.</div><div><br></div><div>Which features are you missing wrt to=
 scheduling in Flink? Please let me know if there is anything blocking you =
from using Flink in production and we&#39;ll see what we can do.</div><div>=
<br></div><div>Regards,</div><div>Robert</div><div><br></div><div><br></div=
></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, No=
v 20, 2015 at 1:24 PM, Ovidiu-Cristian MARCU <span dir=3D"ltr">&lt;<a href=
=3D"mailto:ovidiu-cristian.marcu@inria.fr" target=3D"_blank">ovidiu-cristia=
n.marcu@inria.fr</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote"=
 style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><d=
iv style=3D"word-wrap:break-word">Hi,<div><br></div><div>The link to FAQ (<=
a href=3D"https://ci.apache.org/projects/flink/flink-docs-release-0.10/faq.=
html" target=3D"_blank">https://ci.apache.org/projects/flink/flink-docs-rel=
ease-0.10/faq.html</a>)=C2=A0is on the yarn setup 0.10 documentation page (=
<a href=3D"https://ci.apache.org/projects/flink/flink-docs-release-0.10/set=
up/yarn_setup.html" target=3D"_blank">https://ci.apache.org/projects/flink/=
flink-docs-release-0.10/setup/yarn_setup.html</a>) described in this senten=
ce:=C2=A0<i><span style=3D"color:rgb(51,51,51);font-family:&#39;Helvetica N=
eue&#39;,Helvetica,Arial,sans-serif;font-size:14px;line-height:20px;backgro=
und-color:rgb(255,255,255)">If you have troubles using the Flink YARN clien=
t, have a look in the=C2=A0</span><a href=3D"https://ci.apache.org/projects=
/flink/flink-docs-release-0.10/faq.html" style=3D"color:rgb(51,122,183);tex=
t-decoration:none;font-family:&#39;Helvetica Neue&#39;,Helvetica,Arial,sans=
-serif;font-size:14px;line-height:20px;background-color:rgb(255,255,255)" t=
arget=3D"_blank">FAQ section</a><span style=3D"color:rgb(51,51,51);font-fam=
ily:&#39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;font-size:14px;line=
-height:20px;background-color:rgb(255,255,255)">.</span></i></div><div><fon=
t color=3D"#333333" face=3D"Helvetica Neue, Helvetica, Arial, sans-serif"><=
span style=3D"font-size:14px;background-color:rgb(255,255,255)"><br></span>=
</font></div><div><font color=3D"#333333" face=3D"Helvetica Neue, Helvetica=
, Arial, sans-serif"><span style=3D"font-size:14px;background-color:rgb(255=
,255,255)">Is the scheduling features considered for next releases?</span><=
/font></div><span class=3D""><div><font color=3D"#333333" face=3D"Helvetica=
 Neue, Helvetica, Arial, sans-serif"><span style=3D"font-size:14px;backgrou=
nd-color:rgb(255,255,255)"><br></span></font></div><div><span>Thank you.</s=
pan></div><div><span>Best regards,</span></div><div><span>Ovidiu</span></di=
v><div><br></div></span><div><div class=3D"h5"><div><div><blockquote type=
=3D"cite"><div>On 20 Nov 2015, at 11:59, Robert Metzger &lt;<a href=3D"mail=
to:rmetzger@apache.org" target=3D"_blank">rmetzger@apache.org</a>&gt; wrote=
:</div><br><div><div dir=3D"ltr">Hi=C2=A0<span style=3D"font-size:12.8px">O=
vidiu,</span><div><span style=3D"font-size:12.8px"><br></span></div><div><s=
pan style=3D"font-size:12.8px">you can submit multiple programs to a runnin=
g Flink cluster (or a YARN session). Flink does currently not have any queu=
ing mechanism.</span></div><div><span style=3D"font-size:12.8px">The JobMan=
ager will reject a program if there are not enough free resources for it. I=
f there are enough resources for multiple=C2=A0</span><span style=3D"font-s=
ize:12.8px">programs</span><span style=3D"font-size:12.8px">, they&#39;ll r=
un concurrently.</span></div><div><span style=3D"font-size:12.8px">Note tha=
t Flink is not starting separate JVMs for the=C2=A0</span><span style=3D"fo=
nt-size:12.8px">programs</span><span style=3D"font-size:12.8px">, so if one=
=C2=A0</span><span style=3D"font-size:12.8px">program</span><span style=3D"=
font-size:12.8px">=C2=A0is doing a System.exit(0), it is killing the entire=
 JVM, including other running=C2=A0programs.</span></div><div><span style=
=3D"font-size:12.8px"><br></span></div><div><span style=3D"font-size:12.8px=
">You can start as many YARN sessions (or single jobs to YARN) as you have =
resources available on the cluster. The resource allocation is up to the sc=
heduler you&#39;ve configured in YARN.</span></div><div><span style=3D"font=
-size:12.8px"><br></span></div><div><span style=3D"font-size:12.8px">In gen=
eral, we recommend to start a YARN session per program. You can also direct=
ly submit a Flink program to YARN.</span></div><div><span style=3D"font-siz=
e:12.8px"><br></span></div><div><span style=3D"font-size:12.8px">Where did =
you find the link to the FAQ? The link on the front page is working:=C2=A0<=
a href=3D"http://flink.apache.org/faq.html" target=3D"_blank">http://flink.=
apache.org/faq.html</a></span></div><div><span style=3D"font-size:12.8px"><=
br></span></div><div><span style=3D"font-size:12.8px"><br></span></div></di=
v><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, Nov 20,=
 2015 at 11:41 AM, Ovidiu-Cristian MARCU <span dir=3D"ltr">&lt;<a href=3D"m=
ailto:ovidiu-cristian.marcu@inria.fr" target=3D"_blank">ovidiu-cristian.mar=
cu@inria.fr</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div st=
yle=3D"word-wrap:break-word">Hi,<div><br></div><div>I am currently interest=
ed in experimenting on Flink over Hadoop YARN.</div><div>I am documenting f=
rom the documentation we have here:=C2=A0<a href=3D"https://ci.apache.org/p=
rojects/flink/flink-docs-release-0.10/setup/yarn_setup.html" target=3D"_bla=
nk">https://ci.apache.org/projects/flink/flink-docs-release-0.10/setup/yarn=
_setup.html</a></div><div><br></div><div>There is a subsection <i>Start Fli=
nk Session</i> which states the following:=C2=A0<span style=3D"color:rgb(51=
,51,51);font-family:&#39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;fon=
t-size:14px;background-color:rgb(255,255,255)"><i>A session will start all =
required Flink services (JobManager and TaskManagers) so that you can submi=
t programs to the cluster. Note that you can run multiple programs per sess=
ion.</i></span></div><div><span style=3D"color:rgb(51,51,51);font-family:&#=
39;Helvetica Neue&#39;,Helvetica,Arial,sans-serif;font-size:14px;background=
-color:rgb(255,255,255)"><i><br></i></span></div><div>Can you be more preci=
se regarding the multiple programs per session? If I submit multiple progra=
ms concurently what will happen (can I?)? Maybe they will run in a FIFO fas=
hion or what should I expect?</div><div><br></div><div>The internals sectio=
n specify that users can execute multiple Flink Yarn sessions in parallel. =
This is great, this invites to static partitioning of resources in order to=
 run multiple applications concurrently. Do you support a fair scheduler si=
milar to what Spark claims it has? =C2=A0</div><div><br></div><div>There is=
 FAQ section (<a href=3D"https://ci.apache.org/projects/flink/flink-docs-re=
lease-0.10/faq.html" target=3D"_blank">https://ci.apache.org/projects/flink=
/flink-docs-release-0.10/faq.html</a>) resource that is missing, can this b=
e updated?</div><div><br></div><div>Thank you.</div><div><br></div><div>Bes=
t regards,</div><div>Ovidiu</div><div>=C2=A0</div></div></blockquote></div>=
<br></div>
</div></blockquote></div><br></div></div></div></div></blockquote></div><br=
></div>

--001a114037445b07030524f818d6--