Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
Date: Wed, 8 Mar 2017 18:21:19 -0800 (PST)
From: vinay patil <vinay18.patil@gmail.com>
To: user@flink.apache.org
Message-ID: <CAMpYU5S_0yGiDBDCqX8VfAr=c28vVzziBWY4Kq0X0r3BkZYRQg@mail.gmail.com>
In-Reply-To: <CAK18CWq8wM3-5RFxZcFnoC_Em7EaMYJEk6ae_MU9QDvh8v2wZQ@mail.gmail.com>
References: <CAMpYU5Rh2_iJxbBn135SaKDM8wafdNoB-j4pS3YO8RemF9qMDA@mail.gmail.com> <2778A8A8-9432-4881-98EB-A17426CCC031@data-artisans.com> <CAMpYU5SD9pANqJKyFmZ_-2VbS23ZeR01q6T_zLWW57msn9dBQQ@mail.gmail.com> <8552427B-F100-411C-8B15-EBD677A58CE4@data-artisans.com> <CAK18CWq8wM3-5RFxZcFnoC_Em7EaMYJEk6ae_MU9QDvh8v2wZQ@mail.gmail.com>
Subject: Re: Frequent Full GC's in case of FSStateBackend
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_Part_1848_1566169272.1489026079533"
archived-at: Thu, 09 Mar 2017 02:27:35 -0000

------=_Part_1848_1566169272.1489026079533
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

Hi Sai,

If you are sure that your state will not exceed the memory limit of nodes
then you should consider FSStatebackend otherwise you should go for RocksDB

What is the configuration of your cluster ?

On Mar 9, 2017 7:31 AM, "saiprasad mishra [via Apache Flink User Mailing
List archive.]" <ml-node+s2336050n12126h58@n4.nabble.com> wrote:

> Hi All
>
> I am also seeing issues with FsStateBackend as it stalls coz of full gc.
> We have very large state,
> Does this mean the below doc should not claim that FsStateBackend is
> encouraged for large state.
>
> https://ci.apache.org/projects/flink/flink-docs-release-1.2/ops/state_
> backends.html#the-fsstatebackend
>
> Regards
> Sai
>
> On Fri, Feb 10, 2017 at 6:19 AM, Stefan Richter <[hidden email]
> <http:///user/SendEmail.jtp?type=node&node=12126&i=0>> wrote:
>
>> Async snapshotting is the default.
>>
>> Am 10.02.2017 um 14:03 schrieb vinay patil <[hidden email]
>> <http:///user/SendEmail.jtp?type=node&node=12126&i=1>>:
>>
>> Hi Stephan,
>>
>> Thank you for the clarification.
>> Yes with RocksDB I don't see Full GC happening, also I am using Flink
>> 1.2.0 version and I have set the statebackend in flink-conf.yaml file to
>> rocksdb, so by default does this do asynchronous checkpointing or I have to
>> specify it at the job level  ?
>>
>> Regards,
>> Vinay Patil
>>
>> On Fri, Feb 10, 2017 at 4:16 PM, Stefan Richter [via Apache Flink User
>> Mailing List archive.] <[hidden email]> wrote:
>>
>>> Hi,
>>>
>>> FSStateBackend operates completely on-heap and only snapshots for
>>> checkpoints go against the file system. This is why the backend is
>>> typically faster for small states, but can become problematic for larger
>>> states. If your state exceeds a certain size, you should strongly consider
>>> to use RocksDB as backend. In particular, RocksDB also offers asynchronous
>>> snapshots which is very valuable to keep stream processing running for
>>> large state. RocksDB works on native memory/disk, so there is no GC to
>>> observe. For cases in which your state fits in memory but GC is a problem
>>> you could try using the G1 garbage collector which offers better
>>> performance for the FSStateBackend than the default.
>>>
>>> Best,
>>> Stefan
>>>
>>>
>>> Am 10.02.2017 um 11:16 schrieb Vinay Patil <[hidden email]
>>> <http://user/SendEmail.jtp?type=node&node=11565&i=0>>:
>>>
>>> Hi,
>>>
>>> I am doing performance test for my pipeline keeping FSStateBackend, I
>>> have observed frequent Full GC's after processing 20M records.
>>>
>>> When I did memory analysis using MAT, it showed that the many objects
>>> maintained by Flink state are live.
>>>
>>> Flink keeps the state in memory even after checkpointing , when does
>>> this state gets removed / GC. (I am using window operator in which the DTO
>>> comes as input)
>>>
>>> Also why does Flink keep the state in memory after checkpointing ?
>>>
>>> P.S Using RocksDB is not causing Full GC at all.
>>>
>>> Regards,
>>> Vinay Patil
>>>
>>>
>>>
>>>
>>> ------------------------------
>>> If you reply to this email, your message will be added to the discussion
>>> below:
>>> http://apache-flink-user-mailing-list-archive.2336050.n4.nab
>>> ble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p11565.html
>>> To start a new topic under Apache Flink User Mailing List archive.,
>>> email [hidden email]
>>> To unsubscribe from Apache Flink User Mailing List archive., click here.
>>> NAML
>>> <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>>>
>>
>>
>> ------------------------------
>> View this message in context: Re: Frequent Full GC's in case of
>> FSStateBackend
>> <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p11568.html>
>> Sent from the Apache Flink User Mailing List archive. mailing list
>> archive
>> <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/>
>> at Nabble.com.
>>
>>
>>
>
>
> ------------------------------
> If you reply to this email, your message will be added to the discussion
> below:
> http://apache-flink-user-mailing-list-archive.2336050.
> n4.nabble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p12126.
> html
> To start a new topic under Apache Flink User Mailing List archive., email
> ml-node+s2336050n1h83@n4.nabble.com
> To unsubscribe from Apache Flink User Mailing List archive., click here
> <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=1&code=dmluYXkxOC5wYXRpbEBnbWFpbC5jb218MXwxODExMDE2NjAx>
> .
> NAML
> <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>


--
View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p12127.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.
------=_Part_1848_1566169272.1489026079533
Content-Type: text/html; charset=UTF8
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr">Hi Sai,</p>
<p dir=3D"ltr">If you are sure that your state will not exceed the memory l=
imit of nodes then you should consider FSStatebackend otherwise you should =
go for RocksDB</p>
<p dir=3D"ltr">What is the configuration of your cluster ? </p>
<div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Mar 9, 2017 7:=
31 AM, &quot;saiprasad mishra [via Apache Flink User Mailing List archive.]=
&quot; &lt;<a href=3D"/user/SendEmail.jtp?type=3Dnode&node=3D12127&i=3D0" t=
arget=3D"_top" rel=3D"nofollow" link=3D"external">[hidden email]</a>&gt; wr=
ote:<br type=3D"attribution"><blockquote style=3D'border-left:2px solid #CC=
CCCC;padding:0 1em' class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex">

=09<div dir=3D"ltr">Hi All<div><br></div><div>I am also seeing issues with=
=C2=A0FsStateBackend as it stalls coz of full gc. We have very large state,=
</div><div>Does this mean the below doc should not claim that FsStateBacken=
d is encouraged for large state.</div><div><br></div><div><a href=3D"https:=
//ci.apache.org/projects/flink/flink-docs-release-1.2/ops/state_backends.ht=
ml#the-fsstatebackend" rel=3D"nofollow" link=3D"external" target=3D"_blank"=
>https://ci.apache.org/<wbr>projects/flink/flink-docs-<wbr>release-1.2/ops/=
state_<wbr>backends.html#the-<wbr>fsstatebackend</a><br></div><div><br></di=
v><div>Regards</div><div>Sai</div></div><div class=3D"gmail_extra"><br><div=
 class=3D"gmail_quote">On Fri, Feb 10, 2017 at 6:19 AM, Stefan Richter <spa=
n dir=3D"ltr">&lt;<a href=3D"http:///user/SendEmail.jtp?type=3Dnode&amp;nod=
e=3D12126&amp;i=3D0" rel=3D"nofollow" link=3D"external" target=3D"_blank">[=
hidden email]</a>&gt;</span> wrote:<br><blockquote style=3D'border-left:2px=
 solid #CCCCCC;padding:0 1em' style=3D"border-left:2px solid #cccccc;paddin=
g:0 1em" class=3D"gmail_quote"><div style=3D"word-wrap:break-word">Async sn=
apshotting is the default.=C2=A0<div><div class=3D"m_-7992874507661071908h5=
"><div><br><div><blockquote style=3D'border-left:2px solid #CCCCCC;padding:=
0 1em' style=3D"border-left:2px solid #cccccc;padding:0 1em" type=3D"cite">=
<div>Am 10.02.2017 um 14:03 schrieb vinay patil &lt;<a href=3D"http:///user=
/SendEmail.jtp?type=3Dnode&amp;node=3D12126&amp;i=3D1" rel=3D"nofollow" lin=
k=3D"external" target=3D"_blank">[hidden email]</a>&gt;:</div><br class=3D"=
m_-7992874507661071908m_-2937722640849736572Apple-interchange-newline"><div=
><div dir=3D"ltr">Hi Stephan,<br><br>Thank you for the clarification.<br>Ye=
s with RocksDB I don&#39;t see Full GC happening, also I am using Flink 1.2=
.0 version and I have set the statebackend in flink-conf.yaml file to rocks=
db, so by default does this do asynchronous checkpointing or I have to spec=
ify it at the job level =C2=A0?</div><div class=3D"gmail_extra"><br clear=
=3D"all"><div><div class=3D"m_-7992874507661071908m_-2937722640849736572gma=
il_signature" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><div><div=
 dir=3D"ltr"><font>Regards,</font><div><font>Vinay Patil</font></div></div>=
</div></div></div></div>
<br><div class=3D"gmail_quote">On Fri, Feb 10, 2017 at 4:16 PM, Stefan Rich=
ter [via Apache Flink User Mailing List archive.] <span dir=3D"ltr">&lt;<a =
rel=3D"nofollow" link=3D"external" target=3D"_top">[hidden email]</a>&gt;</=
span> wrote:<br><blockquote style=3D'border-left:2px solid #CCCCCC;padding:=
0 1em' style=3D"border-left:2px solid #cccccc;padding:0 1em" class=3D"gmail=
_quote">

=09<div>Hi,</div><div><br></div><div>FSStateBackend operates completely on-=
heap and only snapshots for checkpoints go against the file system. This is=
 why the backend is typically faster for small states, but can become probl=
ematic for larger states. If your state exceeds a certain size, you should =
strongly consider to use RocksDB as backend. In particular, RocksDB also of=
fers asynchronous snapshots which is very valuable to keep stream processin=
g running for large state. RocksDB works on native memory/disk, so there is=
 no GC to observe. For cases in which your state fits in memory but GC is a=
 problem you could try using the G1 garbage collector which offers better p=
erformance for the FSStateBackend than the default.</div><div><br></div><di=
v>Best,</div><div>Stefan</div><div><div class=3D"m_-7992874507661071908m_-2=
937722640849736572h5"><div><br></div><br><div><blockquote style=3D'border-l=
eft:2px solid #CCCCCC;padding:0 1em' style=3D"border-left:2px solid #cccccc=
;padding:0 1em" type=3D"cite"><div>Am 10.02.2017 um 11:16 schrieb Vinay Pat=
il &lt;<a href=3D"http://user/SendEmail.jtp?type=3Dnode&amp;node=3D11565&am=
p;i=3D0" rel=3D"nofollow" link=3D"external" target=3D"_blank">[hidden email=
]</a>&gt;:</div><br class=3D"m_-7992874507661071908m_-2937722640849736572m_=
5797439510880024310Apple-interchange-newline"><div><div dir=3D"ltr">Hi,<br>=
<br>I am doing performance test for my pipeline keeping FSStateBackend, I h=
ave observed frequent Full GC&#39;s after processing 20M records.<br><br>Wh=
en I did memory analysis using MAT, it showed that the many objects maintai=
ned by Flink state are live.<br><br>Flink keeps the state in memory even af=
ter checkpointing , when does this state gets removed / GC. (I am using win=
dow operator in which the DTO comes as input)<div><br></div><div>Also why d=
oes Flink keep the state in memory after checkpointing ?=C2=A0<br><br>P.S U=
sing RocksDB is not causing Full GC at all.<br><br clear=3D"all"><div><div =
class=3D"m_-7992874507661071908m_-2937722640849736572m_5797439510880024310g=
mail_signature" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><div><d=
iv dir=3D"ltr"><font>Regards,</font><div><font>Vinay Patil</font></div></di=
v></div></div></div></div>
</div></div>
</div></blockquote></div><br>

=09
=09
=09
=09<br>
=09<br>
=09</div></div><hr noshade size=3D"1">
=09<div style=3D"color:#444;font:12px tahoma,geneva,helvetica,arial,sans-se=
rif">
=09=09<div style=3D"font-weight:bold">If you reply to this email, your mess=
age will be added to the discussion below:</div>
=09=09<a href=3D"http://apache-flink-user-mailing-list-archive.2336050.n4.n=
abble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p11565.html" =
rel=3D"nofollow" link=3D"external" target=3D"_blank">http://apache-flink-us=
er-maili<wbr>ng-list-archive.2336050.n4.nab<wbr>ble.com/Frequent-Full-GC-s-=
in-<wbr>case-of-FSStateBackend-tp11564<wbr>p11565.html</a>
=09</div>
=09<div style=3D"color:#666;font:11px tahoma,geneva,helvetica,arial,sans-se=
rif;margin-top:.4em;line-height:1.5em">
=09=09To start a new topic under Apache Flink User Mailing List archive., e=
mail <a rel=3D"nofollow" link=3D"external" target=3D"_top">[hidden email]</=
a> <br>
=09=09To unsubscribe from Apache Flink User Mailing List archive., <a rel=
=3D"nofollow" link=3D"external" target=3D"_top">click here</a>.<br>
=09=09<a href=3D"http://apache-flink-user-mailing-list-archive.2336050.n4.n=
abble.com/template/NamlServlet.jtp?macro=3Dmacro_viewer&amp;id=3Dinstant_ht=
ml%21nabble%3Aemail.naml&amp;base=3Dnabble.naml.namespaces.BasicNamespace-n=
abble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamesp=
ace&amp;breadcrumbs=3Dnotify_subscribers%21nabble%3Aemail.naml-instant_emai=
ls%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml" rel=3D"n=
ofollow" style=3D"font:9px serif" link=3D"external" target=3D"_blank">NAML<=
/a>
=09</div></blockquote></div><br></div>


=09
=09
=09
<br><hr align=3D"left" width=3D"300">
View this message in context: <a href=3D"http://apache-flink-user-mailing-l=
ist-archive.2336050.n4.nabble.com/Frequent-Full-GC-s-in-case-of-FSStateBack=
end-tp11564p11568.html" rel=3D"nofollow" link=3D"external" target=3D"_blank=
">Re: Frequent Full GC&#39;s in case of FSStateBackend</a><br>
Sent from the <a href=3D"http://apache-flink-user-mailing-list-archive.2336=
050.n4.nabble.com/" rel=3D"nofollow" link=3D"external" target=3D"_blank">Ap=
ache Flink User Mailing List archive. mailing list archive</a> at <a href=
=3D"http://Nabble.com" rel=3D"nofollow" link=3D"external" target=3D"_blank"=
>Nabble.com</a>.<br></div></blockquote></div><br></div></div></div></div></=
blockquote></div><br></div>


=09
=09
=09
=09<br>
=09<br>
=09<hr noshade size=3D"1" color=3D"#cccccc">
=09<div style=3D"color:#444;font:12px tahoma,geneva,helvetica,arial,sans-se=
rif">
=09=09<div style=3D"font-weight:bold">If you reply to this email, your mess=
age will be added to the discussion below:</div>
=09=09<a href=3D"http://apache-flink-user-mailing-list-archive.2336050.n4.n=
abble.com/Frequent-Full-GC-s-in-case-of-FSStateBackend-tp11564p12126.html" =
target=3D"_blank" rel=3D"nofollow" link=3D"external">http://apache-flink-us=
er-<wbr>mailing-list-archive.2336050.<wbr>n4.nabble.com/Frequent-Full-<wbr>=
GC-s-in-case-of-<wbr>FSStateBackend-tp11564p12126.<wbr>html</a>
=09</div>
=09<div style=3D"color:#666;font:11px tahoma,geneva,helvetica,arial,sans-se=
rif;margin-top:.4em;line-height:1.5em">
=09=09To start a new topic under Apache Flink User Mailing List archive., e=
mail <a href=3D"/user/SendEmail.jtp?type=3Dnode&node=3D12127&i=3D1" target=
=3D"_top" rel=3D"nofollow" link=3D"external">[hidden email]</a> <br>
=09=09To unsubscribe from Apache Flink User Mailing List archive., <a href=
=3D"" target=3D"_blank" rel=3D"nofollow" link=3D"external">click here</a>.<=
br>
=09=09<a href=3D"http://apache-flink-user-mailing-list-archive.2336050.n4.n=
abble.com/template/NamlServlet.jtp?macro=3Dmacro_viewer&amp;id=3Dinstant_ht=
ml%21nabble%3Aemail.naml&amp;base=3Dnabble.naml.namespaces.BasicNamespace-n=
abble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamesp=
ace&amp;breadcrumbs=3Dnotify_subscribers%21nabble%3Aemail.naml-instant_emai=
ls%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml" rel=3D"n=
ofollow" style=3D"font:9px serif" target=3D"_blank" link=3D"external">NAML<=
/a>
=09</div></blockquote></div></div>


=09
=09
=09
<br/><hr align=3D"left" width=3D"300" />
View this message in context: <a href=3D"http://apache-flink-user-mailing-l=
ist-archive.2336050.n4.nabble.com/Frequent-Full-GC-s-in-case-of-FSStateBack=
end-tp11564p12127.html">Re: Frequent Full GC's in case of FSStateBackend</a=
><br/>
Sent from the <a href=3D"http://apache-flink-user-mailing-list-archive.2336=
050.n4.nabble.com/">Apache Flink User Mailing List archive. mailing list ar=
chive</a> at Nabble.com.<br/>
------=_Part_1848_1566169272.1489026079533--