flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jamie Grier <ja...@data-artisans.com>
Subject Re: Memory Management in Streaming?
Date Sat, 03 Sep 2016 22:55:26 GMT
Hi Shaosu,

Do you have an estimate on the total size of state you are keeping for the
windows?  How many messages/sec, how large a window, message size, etc
would be good details to include.

Also, which state backend are you using?  Have you considered using the
RocksDB state backend.  This backend will spill Flink state to disk if it's
larger than available RAM.  You'll also probably want to use "fully async"
mode for the RocksDBStateBackend.

-Jamie


On Fri, Sep 2, 2016 at 1:45 PM, Shaosu Liu <shu@uber.com> wrote:

> Hi,
>
> I have had issues when I processed large amount of data (large windows
> where I could not do incremental updates), flink slowed down significantly.
> It did help when I increased the amount of memory and used off heap
> allocation. But it only delayed the onset of the probelm without solving
> it.
>
> Could some one give me some hints on how Flink manage window buffer and
> how streaming manages its memory. I see this page on batch api memory
> management and wonder what is the equivalent for streaming?
> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=53741525
>
> --
> Cheers,
> Shaosu
>



-- 

Jamie Grier
data Artisans, Director of Applications Engineering
@jamiegrier <https://twitter.com/jamiegrier>
jamie@data-artisans.com

Mime
View raw message