Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <CABGNe=YSKwMAEZyRYCKDBcWKp2BOWfxOVA-BUp=RDkksm5AV8w@mail.gmail.com>
References: 
 <CABGNe=aG+hQLesNtocQ0FMdn42A=b+QiRaV33k3q6KgwmM7CoA@mail.gmail.com>
	<CAAdrtT19vZ-c3KGr0kbgLiwYj3bh1XmAGeEFdBNwmgv-Omj1Cw@mail.gmail.com>
	<CABGNe=aG1CutV-ZWEjAB2BKx=4faH7UdPW1W23jz7XDwTZ1PGg@mail.gmail.com>
	<CAKiyyaE+5K1+j3rymMkCMK-wVVpxuwYr1frcE5Hj_edEZS554Q@mail.gmail.com>
	<CABGNe=YSKwMAEZyRYCKDBcWKp2BOWfxOVA-BUp=RDkksm5AV8w@mail.gmail.com>
Date: Tue, 23 Feb 2016 17:57:21 +0700
Message-ID: 
 <CABGNe=ZOi_3ytNSopiRDnHZ6KAB=RFF+Yt_gF_Y1Zb7TtV0K-w@mail.gmail.com>
Subject: Re: Optimal Configuration for Cluster
From: Welly Tambunan <if05041@gmail.com>
To: user@flink.apache.org
Content-Type: multipart/alternative; boundary=001a114157c6501ef8052c6dd051

--001a114157c6501ef8052c6dd051
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Ufuk and Fabian,

Is that better to start 48 task manager ( one slot each ) in one machine
than having single task manager with 48 slot ? Any trade-off that we should
know etc ?

Cheers

On Tue, Feb 23, 2016 at 3:03 PM, Welly Tambunan <if05041@gmail.com> wrote:

> Hi Ufuk,
>
> Thanks for the explanation.
>
> Yes. Our jobs is all streaming job.
>
> Cheers
>
> On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <uce@apache.org> wrote:
>
>> The new default is equivalent to the previous "streaming mode". The
>> community decided to get rid of this distinction, because it was
>> confusing to users.
>>
>> The difference between "streaming mode" and "batch mode" was how
>> Flink's managed memory was allocated, either lazily when required
>> ('streaming mode") or eagerly on task manager start up ("batch mode").
>> Now it's lazy by default.
>>
>> This is not something you need to worry about, but if you are mostly
>> using the DataSet API where pre allocation has benefits, you can get
>> the "batch mode" behaviour by using the following configuration key:
>>
>> taskmanager.memory.preallocate: true
>>
>> But you are using the DataStream API anyways, right?
>>
>> =E2=80=93 Ufuk
>>
>>
>> On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan <if05041@gmail.com>
>> wrote:
>> > Hi Fabian,
>> >
>> > Previously when using flink 0.9-0.10 we start the cluster with streami=
ng
>> > mode or batch mode. I see that this one is gone on Flink 1.00 snapshot
>> ? So
>> > this one has already taken care of the flink and optimize by runtime >
>> >
>> > On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske <fhueske@gmail.com>
>> wrote:
>> >>
>> >> Hi Welly,
>> >>
>> >> sorry for the late response.
>> >>
>> >> The number of network buffers primarily depends on the maximum
>> parallelism
>> >> of your job.
>> >> The given formula assumes a specific cluster configuration (1 task
>> manager
>> >> per machine, one parallel task per CPU).
>> >> The formula can be translated to:
>> >>
>> >> taskmanager.network.numberOfBuffers: p ^ 2 * t * 4
>> >>
>> >> where p is the maximum parallelism of the job and t is the number of
>> task
>> >> manager.
>> >> You can process more than one parallel task per TM if you configure
>> more
>> >> than one processing slot per machine ( taskmanager.numberOfTaskSlots)=
.
>> The
>> >> TM will divide its memory among all its slots. So it would be possibl=
e
>> to
>> >> start one TM for each machine with 100GB+ memory and 48 slots each.
>> >>
>> >> We can compute the number of network buffers if you give a few more
>> >> details about your setup:
>> >> - How many task managers do you start? I assume more than one TM per
>> >> machine given that you assign only 4GB of memory out of 128GB to each
>> TM.
>> >> - What is the maximum parallelism of you program?
>> >> - How many processing slots do you configure for each TM?
>> >>
>> >> In general, pipelined shuffles with a high parallelism require a lot =
of
>> >> memory.
>> >> If you configure batch instead of pipelined transfer, the memory
>> >> requirement goes down
>> >> (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).
>> >>
>> >> Eventually, we want to merge the network buffer and the managed memor=
y
>> >> pools. So the "taskmanager.network.numberOfBuffers" configuration whi=
ll
>> >> hopefully disappear at some point in the future.
>> >>
>> >> Best, Fabian
>> >>
>> >> 2016-02-19 9:34 GMT+01:00 Welly Tambunan <if05041@gmail.com>:
>> >>>
>> >>> Hi All,
>> >>>
>> >>> We are trying to running our job in cluster that has this informatio=
n
>> >>>
>> >>> 1. # of machine: 16
>> >>> 2. memory : 128 gb
>> >>> 3. # of core : 48
>> >>>
>> >>> However when we try to run we have an exception.
>> >>>
>> >>> "insufficient number of network buffers. 48 required but only 10
>> >>> available. the total number of network buffers is currently set to
>> 2048"
>> >>>
>> >>> After looking at the documentation we set configuration based on doc=
s
>> >>>
>> >>> taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * 4
>> >>>
>> >>> However we face another error from JVM
>> >>>
>> >>> java.io.IOException: Cannot allocate network buffer pool: Could not
>> >>> allocate enough memory segments for NetworkBufferPool (required (Mb)=
:
>> 2304,
>> >>> allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap space
>> >>>
>> >>> We fiddle the taskmanager.heap.mb: 4096
>> >>>
>> >>> Finally the cluster is running.
>> >>>
>> >>> However i'm still not sure about the configuration and fiddling in
>> task
>> >>> manager heap really fine tune. So my question is
>> >>>
>> >>> Am i doing it right for numberOfBuffers ?
>> >>> How much should we allocate on taskmanager.heap.mb given the
>> information
>> >>> Any suggestion which configuration we need to set to make it optimal
>> for
>> >>> the cluster ?
>> >>> Is there any chance that this will get automatically resolve by
>> >>> memory/network buffer manager ?
>> >>>
>> >>> Thanks a lot for the help
>> >>>
>> >>> Cheers
>> >>>
>> >>> --
>> >>> Welly Tambunan
>> >>> Triplelands
>> >>>
>> >>> http://weltam.wordpress.com
>> >>> http://www.triplelands.com
>> >>
>> >>
>> >
>> >
>> >
>> > --
>> > Welly Tambunan
>> > Triplelands
>> >
>> > http://weltam.wordpress.com
>> > http://www.triplelands.com
>>
>
>
>
> --
> Welly Tambunan
> Triplelands
>
> http://weltam.wordpress.com
> http://www.triplelands.com <http://www.triplelands.com/blog/>
>


--=20
Welly Tambunan
Triplelands

http://weltam.wordpress.com
http://www.triplelands.com <http://www.triplelands.com/blog/>

--001a114157c6501ef8052c6dd051
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi Ufuk and Fabian,=C2=A0<div><br></div><div><span style=
=3D"font-size:12.8px">Is that better to start 48 task manager ( one slot ea=
ch ) in one machine than having single task manager with 48 slot ? Any trad=
e-off that we should know etc ?</span><br></div><div><span style=3D"font-si=
ze:12.8px"><br></span></div><div><span style=3D"font-size:12.8px">Cheers</s=
pan></div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">O=
n Tue, Feb 23, 2016 at 3:03 PM, Welly Tambunan <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:if05041@gmail.com" target=3D"_blank">if05041@gmail.com</a>&gt;<=
/span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8=
ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Ufuk,=
=C2=A0<div><br></div><div>Thanks for the explanation.=C2=A0<br><br>Yes. Our=
 jobs is all streaming job.=C2=A0<br><div><br></div><div>Cheers</div></div>=
</div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><b=
r><div class=3D"gmail_quote">On Tue, Feb 23, 2016 at 2:48 PM, Ufuk Celebi <=
span dir=3D"ltr">&lt;<a href=3D"mailto:uce@apache.org" target=3D"_blank">uc=
e@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">The ne=
w default is equivalent to the previous &quot;streaming mode&quot;. The<br>
community decided to get rid of this distinction, because it was<br>
confusing to users.<br>
<br>
The difference between &quot;streaming mode&quot; and &quot;batch mode&quot=
; was how<br>
Flink&#39;s managed memory was allocated, either lazily when required<br>
(&#39;streaming mode&quot;) or eagerly on task manager start up (&quot;batc=
h mode&quot;).<br>
Now it&#39;s lazy by default.<br>
<br>
This is not something you need to worry about, but if you are mostly<br>
using the DataSet API where pre allocation has benefits, you can get<br>
the &quot;batch mode&quot; behaviour by using the following configuration k=
ey:<br>
<br>
taskmanager.memory.preallocate: true<br>
<br>
But you are using the DataStream API anyways, right?<br>
<span><font color=3D"#888888"><br>
=E2=80=93 Ufuk<br>
</font></span><div><div><br>
<br>
On Tue, Feb 23, 2016 at 6:36 AM, Welly Tambunan &lt;<a href=3D"mailto:if050=
41@gmail.com" target=3D"_blank">if05041@gmail.com</a>&gt; wrote:<br>
&gt; Hi Fabian,<br>
&gt;<br>
&gt; Previously when using flink 0.9-0.10 we start the cluster with streami=
ng<br>
&gt; mode or batch mode. I see that this one is gone on Flink 1.00 snapshot=
 ? So<br>
&gt; this one has already taken care of the flink and optimize by runtime &=
gt;<br>
&gt;<br>
&gt; On Mon, Feb 22, 2016 at 5:26 PM, Fabian Hueske &lt;<a href=3D"mailto:f=
hueske@gmail.com" target=3D"_blank">fhueske@gmail.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Hi Welly,<br>
&gt;&gt;<br>
&gt;&gt; sorry for the late response.<br>
&gt;&gt;<br>
&gt;&gt; The number of network buffers primarily depends on the maximum par=
allelism<br>
&gt;&gt; of your job.<br>
&gt;&gt; The given formula assumes a specific cluster configuration (1 task=
 manager<br>
&gt;&gt; per machine, one parallel task per CPU).<br>
&gt;&gt; The formula can be translated to:<br>
&gt;&gt;<br>
&gt;&gt; taskmanager.network.numberOfBuffers: p ^ 2 * t * 4<br>
&gt;&gt;<br>
&gt;&gt; where p is the maximum parallelism of the job and t is the number =
of task<br>
&gt;&gt; manager.<br>
&gt;&gt; You can process more than one parallel task per TM if you configur=
e more<br>
&gt;&gt; than one processing slot per machine ( taskmanager.numberOfTaskSlo=
ts). The<br>
&gt;&gt; TM will divide its memory among all its slots. So it would be poss=
ible to<br>
&gt;&gt; start one TM for each machine with 100GB+ memory and 48 slots each=
.<br>
&gt;&gt;<br>
&gt;&gt; We can compute the number of network buffers if you give a few mor=
e<br>
&gt;&gt; details about your setup:<br>
&gt;&gt; - How many task managers do you start? I assume more than one TM p=
er<br>
&gt;&gt; machine given that you assign only 4GB of memory out of 128GB to e=
ach TM.<br>
&gt;&gt; - What is the maximum parallelism of you program?<br>
&gt;&gt; - How many processing slots do you configure for each TM?<br>
&gt;&gt;<br>
&gt;&gt; In general, pipelined shuffles with a high parallelism require a l=
ot of<br>
&gt;&gt; memory.<br>
&gt;&gt; If you configure batch instead of pipelined transfer, the memory<b=
r>
&gt;&gt; requirement goes down<br>
&gt;&gt; (ExecutionConfig.setExecutionMode(ExecutionMode.BATCH)).<br>
&gt;&gt;<br>
&gt;&gt; Eventually, we want to merge the network buffer and the managed me=
mory<br>
&gt;&gt; pools. So the &quot;taskmanager.network.numberOfBuffers&quot; conf=
iguration whill<br>
&gt;&gt; hopefully disappear at some point in the future.<br>
&gt;&gt;<br>
&gt;&gt; Best, Fabian<br>
&gt;&gt;<br>
&gt;&gt; 2016-02-19 9:34 GMT+01:00 Welly Tambunan &lt;<a href=3D"mailto:if0=
5041@gmail.com" target=3D"_blank">if05041@gmail.com</a>&gt;:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Hi All,<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; We are trying to running our job in cluster that has this info=
rmation<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; 1. # of machine: 16<br>
&gt;&gt;&gt; 2. memory : 128 gb<br>
&gt;&gt;&gt; 3. # of core : 48<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; However when we try to run we have an exception.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; &quot;insufficient number of network buffers. 48 required but =
only 10<br>
&gt;&gt;&gt; available. the total number of network buffers is currently se=
t to 2048&quot;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; After looking at the documentation we set configuration based =
on docs<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; taskmanager.network.numberOfBuffers: # core ^ 2 * # machine * =
4<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; However we face another error from JVM<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; java.io.IOException: Cannot allocate network buffer pool: Coul=
d not<br>
&gt;&gt;&gt; allocate enough memory segments for NetworkBufferPool (require=
d (Mb): 2304,<br>
&gt;&gt;&gt; allocated (Mb): 698, missing (Mb): 1606). Cause: Java heap spa=
ce<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; We fiddle the taskmanager.heap.mb: 4096<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Finally the cluster is running.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; However i&#39;m still not sure about the configuration and fid=
dling in task<br>
&gt;&gt;&gt; manager heap really fine tune. So my question is<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Am i doing it right for numberOfBuffers ?<br>
&gt;&gt;&gt; How much should we allocate on taskmanager.heap.mb given the i=
nformation<br>
&gt;&gt;&gt; Any suggestion which configuration we need to set to make it o=
ptimal for<br>
&gt;&gt;&gt; the cluster ?<br>
&gt;&gt;&gt; Is there any chance that this will get automatically resolve b=
y<br>
&gt;&gt;&gt; memory/network buffer manager ?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Thanks a lot for the help<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Cheers<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Welly Tambunan<br>
&gt;&gt;&gt; Triplelands<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; <a href=3D"http://weltam.wordpress.com" rel=3D"noreferrer" tar=
get=3D"_blank">http://weltam.wordpress.com</a><br>
&gt;&gt;&gt; <a href=3D"http://www.triplelands.com" rel=3D"noreferrer" targ=
et=3D"_blank">http://www.triplelands.com</a><br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; Welly Tambunan<br>
&gt; Triplelands<br>
&gt;<br>
&gt; <a href=3D"http://weltam.wordpress.com" rel=3D"noreferrer" target=3D"_=
blank">http://weltam.wordpress.com</a><br>
&gt; <a href=3D"http://www.triplelands.com" rel=3D"noreferrer" target=3D"_b=
lank">http://www.triplelands.com</a><br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Welly Tambunan<br>Triplelands=C2=A0<br><br><a href=3D"http://weltam.wo=
rdpress.com" target=3D"_blank">http://weltam.wordpress.com</a><div><a href=
=3D"http://www.triplelands.com/blog/" target=3D"_blank">http://www.triplela=
nds.com</a></div></div>
</div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div class=3D"gmail_signature">Welly Tambunan<br>Triplelands=C2=A0<br><br><=
a href=3D"http://weltam.wordpress.com" target=3D"_blank">http://weltam.word=
press.com</a><div><a href=3D"http://www.triplelands.com/blog/" target=3D"_b=
lank">http://www.triplelands.com</a></div></div>
</div>

--001a114157c6501ef8052c6dd051--