Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: <CAGr9p8DCm9UFouzFCV2_MBabp7Q4jpgi8KJnGnCdSsftuOmHAQ@mail.gmail.com>
References: <CADno-RonJ0Y7N-mQ12T0MNOC7Tz8xBs_YkW8esBXYMsBk2gvVA@mail.gmail.com>
 <etPan.58b06247.2d188a8.a408@apache.org> <CAGr9p8DCm9UFouzFCV2_MBabp7Q4jpgi8KJnGnCdSsftuOmHAQ@mail.gmail.com>
From: Patrick Brunmayr <jay@kpibench.com>
Date: Fri, 24 Feb 2017 18:37:59 +0100
Message-ID: <CADno-RpC=iX5bE6A-rPf5BX97t+eJqKntHSeCAppek0X6VJAHA@mail.gmail.com>
Subject: Re: Flink the right tool for the job ? Huge Data window lateness
To: user@flink.apache.org
Content-Type: multipart/alternative; boundary=001a114076b0d9ab9a05494a3035
archived-at: Fri, 24 Feb 2017 17:38:14 -0000

--001a114076b0d9ab9a05494a3035
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi

Yes it is and would be nice to handle this with Flink :)

-
*Size of data point*
The size of a data point is basically just a simple case class with two
fields in a 64bit OS

case class MachineData(sensorId: String, eventTime:Long)

*- Last write wins*

We have cassandra as data warehouse but i was hoping i could solve that
issue in the state level rather than in the db level. The reason beeing is
one could send me the same events
over and over again and this will cause that state to blow up until out of
memory. Secondly by doing aggregations per sensor results will be wrong due
multiple events with the same
timestamp.

thx


2017-02-24 17:47 GMT+01:00 Robert Metzger <rmetzger@apache.org>:

> Hi,
> sounds like a cool project.
>
> What's the size of one data point?
> If one datapoint is 2 kb, you'll have 100 800 000 * 2048 bytes =3D 206
> gigabytes of state. That's something one or two machines (depending on th=
e
> disk throughput) should be able to handle.
>
> If possible, I would recommend you to do an experiment using a prototype
> to see how many machines you need for your workload.
>
> On Fri, Feb 24, 2017 at 5:41 PM, Tzu-Li (Gordon) Tai <tzulitai@apache.org=
>
> wrote:
>
>> Hi Patrick,
>>
>> Thanks a lot for feedback on your use case! At a first glance, I would
>> say that Flink can definitely solve the issues you are evaluating.
>>
>> I=E2=80=99ll try to explain them, and point you to some docs / articles =
that can
>> further explain in detail:
>>
>> *- Lateness*
>>
>> The 7-day lateness shouldn=E2=80=99t be a problem. We definitely recomme=
nd
>> using RocksDB as the state backend for such a use case, as you
>> mentioned correctly, the state would be kept for a long time.
>> The heavy burst when your locally buffered data on machines are
>> sent to Kafka once they come back online shouldn=E2=80=99t be a problem =
either;
>> since Flink is a pure data streaming engine, it handles backpressure
>> naturally without any additional mechanisms (I would recommend
>> taking a look at http://data-artisans.com/how-flink-handles-backpressure=
/
>> ).
>>
>> *- Out of Order*
>>
>> That=E2=80=99s exactly what event time processing is for :-) As long as =
the event
>> comes in before the allowed lateness for windows, the event will still
>> fall
>> into its corresponding event time window. So, even with the heavy burst =
of
>> the your late machine data, they will still be aggregated in the correct
>> windows.
>> You can look into event time in Flink with more detail in the event time
>> docs:
>> https://ci.apache.org/projects/flink/flink-docs-release-1.3/
>> dev/event_time.html
>>
>> *- Last write wins*
>>
>> Your operators that does the aggregations simply need to be able to
>> reprocess
>> results if it sees an event with the same id come in. Now, if results ar=
e
>> sent out
>> of Flink and stored in an external db, if you can design the db writes t=
o
>> be idempotent,
>> then it=E2=80=99ll effectively be a =E2=80=9Clast write wins=E2=80=9D. I=
t depends mostly on your
>> pipeline and
>> use case.
>>
>> *- Computations per minute*
>> I think you can simply do this by having two separate window operators.
>> One that works on your longer window, and another on a per-minute basis.
>>
>> Hope this helps!
>>
>> - Gordon
>>
>> On February 24, 2017 at 10:49:14 PM, Patrick Brunmayr (jay@kpibench.com)
>> wrote:
>>
>> Hello
>>
>> I've done my first steps with Flink and i am very impressed of its
>> capabilities. Thank you for that :) I want to use it for a project we ar=
e
>> currently working on. After reading some documentation
>> i am not sure if it's the right tool for the job. We have an IoT
>> application in which we are monitoring machines in production plants. Th=
e
>> machines have sensors attached and they are sending
>> their data to a broker ( Kafka, Azure Iot Hub ) currently on a per minut=
e
>> basis.
>>
>> Following requirements must be fulfilled
>>
>>
>>    - Lateness
>>
>>    We have to allow lateness for 7 days because machines can have down
>>    time due network issues, maintenance or something else. If thats the =
case
>>    buffering of data happens localy on the machine and once they
>>    are online again all data will be sent to the broker. This can result
>>    in some relly heavy burst.
>>
>>
>>    - Out of order
>>
>>    Events come out of order due this lateness issues
>>
>>
>>    - Last write wins
>>
>>    Machines are not stateful and can not guarantee exactly once sending
>>    of their data. It can happen that sometimes events are sent twice. In=
 that
>>    case the last event wins and should override the previous one.
>>    Events are unique due a sensor_id and a timestamp
>>
>>    - Computations per minute
>>
>>    We can not wait until the windows ends and have to do computations on
>>    a per minute basis. For example aggregating data per sensor and writi=
ng it
>>    to a db
>>
>>
>> My biggest concern in that case is the huge lateness. Keeping data for 7
>> days would result in 10080 data points for just one sensor! Multiplying
>> that by 10.000 sensors would result in 100800000 datapoints which Flink
>> would have to handle in its state. The number of sensors are constantly
>> growing so will the number of data points
>>
>> So my questions are
>>
>>
>>    - Is Flink the right tool for the Job ?
>>
>>    - Is that lateness an issue ?
>>
>>    - How can i implement the Last write wins ?
>>
>>    - How to tune flink to handle that growing load of sensors and data
>>    points ?
>>
>>    - Hardware requirements, storage and memory size ?
>>
>>
>>
>> I don't want to maintain two code base for batch and streaming because
>> the operations are all equal. The only difference is the time range! Tha=
ts
>> the reason i wanted to do all this with Flink Streaming.
>>
>> Hope you can guide me in the right direction
>>
>> Thx
>>
>>
>>
>>
>>
>>
>>
>>
>

--001a114076b0d9ab9a05494a3035
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi<div><br></div><div>Yes it is and would be nice to handl=
e this with Flink :)</div><div><br></div><div>- <b>Size of data point<br></=
b><br>The size of a data point is basically just a simple case class with t=
wo fields in a 64bit OS<br></div><div><br></div>case class MachineData(sens=
orId: String, eventTime:Long)<div><br></div><div><b style=3D"font-size:12.8=
px">- Last write wins</b><br></div><div><br></div><div>We have cassandra as=
 data warehouse but i was hoping i could solve that issue in the state leve=
l rather than in the db level. The reason beeing is one could send me the s=
ame events<br>over and over again and this will cause that state to blow up=
 until out of memory. Secondly by doing aggregations per sensor results wil=
l be wrong due multiple events with the same<br>timestamp.</div><div><br></=
div><div>thx</div><div><br></div><div><br></div><div><br></div><div><br></d=
iv></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">2017-02-=
24 17:47 GMT+01:00 Robert Metzger <span dir=3D"ltr">&lt;<a href=3D"mailto:r=
metzger@apache.org" target=3D"_blank">rmetzger@apache.org</a>&gt;</span>:<b=
r><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:=
1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi,<div>sounds like a coo=
l project.</div><div><br></div><div>What&#39;s the size of one data point?<=
/div><div>If one datapoint is 2 kb, you&#39;ll have=C2=A0100 800 000 * 2048=
 bytes =3D 206 gigabytes of state. That&#39;s something one or two machines=
 (depending on the disk throughput) should be able to handle.<br></div><div=
><br></div><div>If possible, I would recommend you to do an experiment usin=
g a prototype to see how many machines you need for your workload.</div></d=
iv><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br><=
div class=3D"gmail_quote">On Fri, Feb 24, 2017 at 5:41 PM, Tzu-Li (Gordon) =
Tai <span dir=3D"ltr">&lt;<a href=3D"mailto:tzulitai@apache.org" target=3D"=
_blank">tzulitai@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex"><div style=3D"word-wrap:break-word"><div id=3D"m_-1445360993964724=
098m_-4968230186323142792bloop_customfont" style=3D"font-family:Helvetica,A=
rial;font-size:13px;color:rgba(0,0,0,1.0);margin:0px;line-height:auto">Hi P=
atrick,</div><div id=3D"m_-1445360993964724098m_-4968230186323142792bloop_c=
ustomfont" style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0=
,0,0,1.0);margin:0px;line-height:auto"><br></div><div id=3D"m_-144536099396=
4724098m_-4968230186323142792bloop_customfont" style=3D"font-family:Helveti=
ca,Arial;font-size:13px;color:rgba(0,0,0,1.0);margin:0px;line-height:auto">=
Thanks a lot for feedback on your use case! At a first glance, I would say =
that Flink can definitely solve the issues you are evaluating.</div><div id=
=3D"m_-1445360993964724098m_-4968230186323142792bloop_customfont" style=3D"=
font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1.0);margin:0px=
;line-height:auto"><br></div><div id=3D"m_-1445360993964724098m_-4968230186=
323142792bloop_customfont" style=3D"font-family:Helvetica,Arial;font-size:1=
3px;color:rgba(0,0,0,1.0);margin:0px;line-height:auto">I=E2=80=99ll try to =
explain them, and point you to some docs / articles that can further explai=
n in detail:</div><div id=3D"m_-1445360993964724098m_-4968230186323142792bl=
oop_customfont" style=3D"font-family:Helvetica,Arial;font-size:13px;color:r=
gba(0,0,0,1.0);margin:0px;line-height:auto"><br></div><div id=3D"m_-1445360=
993964724098m_-4968230186323142792bloop_customfont" style=3D"font-family:He=
lvetica,Arial;font-size:13px;color:rgba(0,0,0,1.0);margin:0px;line-height:a=
uto"><b>- Lateness</b></div><div id=3D"m_-1445360993964724098m_-49682301863=
23142792bloop_customfont" style=3D"font-family:Helvetica,Arial;font-size:13=
px;color:rgba(0,0,0,1.0);margin:0px;line-height:auto"><br></div> The 7-day =
lateness shouldn=E2=80=99t be a problem. We definitely recommend<div>using =
RocksDB as the state backend for such a use case, as you</div><div>mentione=
d correctly, the state would be kept for a long time.</div><div>The heavy b=
urst when your locally buffered data on machines are</div><div>sent to Kafk=
a once they come back online shouldn=E2=80=99t be a problem either;</div><d=
iv>since Flink is a pure data streaming engine, it handles backpressure</di=
v><div>naturally without any additional mechanisms (I would recommend</div>=
<div>taking a look at <a href=3D"http://data-artisans.com/how-flink-handles=
-backpressure/" target=3D"_blank">http://data-artisans.com/how-f<wbr>link-h=
andles-backpressure/</a>).</div><div><br></div><div><b>- Out of Order</b></=
div><div><b><br></b></div><div>That=E2=80=99s exactly what event time proce=
ssing is for :-) As long as the event</div><div>comes in before the allowed=
 lateness for windows, the event will still fall</div><div>into its corresp=
onding event time window. So, even with the heavy burst of</div><div>the yo=
ur late machine data, they will still be aggregated in the correct windows.=
</div><div>You can look into event time in Flink with more detail in the ev=
ent time docs:</div><div><a href=3D"https://ci.apache.org/projects/flink/fl=
ink-docs-release-1.3/dev/event_time.html" target=3D"_blank">https://ci.apac=
he.org/projects<wbr>/flink/flink-docs-release-1.3/<wbr>dev/event_time.html<=
/a></div><div><br></div><div><b>- Last write wins</b></div><div><b><br></b>=
</div><div>Your operators that does the aggregations simply need to be able=
 to reprocess</div><div>results if it sees an event with the same id come i=
n. Now, if results are sent out</div><div>of Flink and stored in an externa=
l db, if you can design the db writes to be idempotent,</div><div>then it=
=E2=80=99ll effectively be a =E2=80=9Clast write wins=E2=80=9D. It depends =
mostly on your pipeline and</div><div>use case.</div><div><p><b>- Computati=
ons per minute</b></p><div>I think you can simply do this by having two sep=
arate window operators.</div><div>One that works on your longer window, and=
 another on a per-minute basis.</div><p>Hope this helps!</p><span class=3D"=
m_-1445360993964724098HOEnZb"><font color=3D"#888888"><p>- Gordon</p></font=
></span><div><div class=3D"m_-1445360993964724098h5"> <div class=3D"m_-1445=
360993964724098m_-4968230186323142792bloop_sign" id=3D"m_-14453609939647240=
98m_-4968230186323142792bloop_sign_1487952295710150912"></div> <br><p class=
=3D"m_-1445360993964724098m_-4968230186323142792airmail_on">On February 24,=
 2017 at 10:49:14 PM, Patrick Brunmayr (<a href=3D"mailto:jay@kpibench.com"=
 target=3D"_blank">jay@kpibench.com</a>) wrote:</p> <blockquote type=3D"cit=
e" class=3D"m_-1445360993964724098m_-4968230186323142792clean_bq"><span><di=
v><div></div><div>


<div dir=3D"ltr">Hello
<div><br></div>
<div>I&#39;ve done my first steps with Flink and i am very impressed of
its capabilities. Thank you for that :) I want to use it for a
project we are currently working on. After reading some
documentation</div>
<div>i am not sure if it&#39;s the right tool for the job. We have an
IoT application in which we are monitoring machines in production
plants. The machines have sensors attached and they are
sending<br>
their data to a broker ( Kafka, Azure Iot Hub ) currently on a per
minute basis.</div>
<div><br></div>
<div>Following requirements must be fulfilled</div>
<div><br></div>
<div>
<ul>
<li>Lateness<br>
<br>
We have to allow lateness for 7 days because machines can have down
time due network issues, maintenance or something else. If thats
the case buffering of data happens localy on the machine and once
they<br>
are online again all data will be sent to the broker. This can
result in some relly heavy burst.<br>
<br>
<br></li>
<li>Out of order<br>
<br>
Events come out of order due this lateness issues<br>
<br>
<br></li>
<li>Last write wins<br>
<br>
Machines are not stateful and can not guarantee exactly once
sending of their data. It can happen that sometimes events are sent
twice. In that case the last event wins and should override the
previous one.<br>
Events are unique due a sensor_id and a timestamp<br>
<br></li>
<li>Computations per minute<br>
<br>
We can not wait until the windows ends and have to do computations
on a per minute basis. For example aggregating data per sensor and
writing it to a db</li>
</ul>
<div><br></div>
My biggest concern in that case is the huge lateness. Keeping data
for 7 days would result in=C2=A010080 data points for just one
sensor! Multiplying that by 10.000 sensors would result
in=C2=A0100800000 datapoints which Flink<br>
would have to handle in its state. The number of sensors are
constantly growing so will the number of data points</div>
<div><br></div>
<div>So my questions are</div>
<div><br></div>
<div>
<ul>
<li>Is Flink the right tool for the Job ?<br>
<br></li>
<li>Is that lateness an issue ?<br>
<br></li>
<li>How can i implement the Last write wins ?<br>
<br></li>
<li>How to tune flink to handle that growing load of sensors and
data points ?<br>
<br></li>
<li>Hardware requirements, storage and memory size ?<br>
<br>
<br></li>
</ul>
<div>I don&#39;t want to maintain two code base for batch and streaming
because the operations are all equal. The only difference is the
time range! Thats the reason i wanted to do all this with Flink
Streaming.<br>
<br>
Hope you can guide me in the right direction</div>
</div>
<div><br></div>
<div>Thx</div>
<div><br></div>
<div><br></div>
<div><br></div>
<div><br></div>
<div><br></div>
<div><br></div>
<div><br></div>
</div>


</div></div></span></blockquote></div></div></div></div></blockquote></div>=
<br></div>
</div></div></blockquote></div><br></div>

--001a114076b0d9ab9a05494a3035--