Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: <c29ac7e4-0e51-32f1-cf94-113f141305c9@gmail.com>
References: <254b025d-d990-de5c-e215-d836f1c1dd4c@gmail.com>
 <CAAdrtT3zUdW=UP=4JoOrMevjEAmcRFj_ttBFFPAdVUhvqwr9bw@mail.gmail.com>
 <d2a10423-cb41-5574-6542-ae95e16d901a@gmail.com> <CAAdrtT2rb1xb7zyvmJi3Sh-FJ_=f3ETqtP7+rJ4atHx4UpSAAg@mail.gmail.com>
 <c29ac7e4-0e51-32f1-cf94-113f141305c9@gmail.com>
From: Fabian Hueske <fhueske@gmail.com>
Date: Thu, 19 Jan 2017 21:36:32 +0100
Message-ID: <CAAdrtT2+yrRog2AtTHf+QEmsM-3uRGkqgLm7+xTM0zoecHQkLw@mail.gmail.com>
Subject: Re: Operational concerns with state (was Re: Window limitations on groupBy)
To: user@flink.apache.org
Content-Type: multipart/alternative; boundary=001a114b414ef1fa4b0546787e5d
archived-at: Thu, 19 Jan 2017 20:37:25 -0000

--001a114b414ef1fa4b0546787e5d
Content-Type: text/plain; charset=UTF-8

Hi Raman,

Checkpoints are used to recover from task or process failures and usually
automatically taken at periodic intervals if configured correctly.
Checkpoints are usually removed when a more recent checkpoint is completed
(the exact policy can be configured).

Savepoints are used to restart a job that was previously shutdown, to
migrate a job to another cluster (e.g., when upgrading Flink), updating the
job itself etc. So more for planned maintenance.
Nonetheless they can also be used for more coarse-grained fault tolerance
and it is a common practice to periodically trigger a savepoint.

These blog posts might be helpful to understand the potential of savepoints
[1] [2].

Best, Fabian

[1] http://data-artisans.com/turning-back-time-savepoints/
[2] http://data-artisans.com/savepoints-part-2-updating-applications/

2017-01-19 19:02 GMT+01:00 Raman Gupta <rocketraman@gmail.com>:

> I was able to get it working well with the original approach you
> described. Thanks! Note that the documentation on how to do this with the
> Java API is... sparse, to say the least. I was able to look at the
> implementation of the scala flatMapWithState function as a starting point.
>
> Now I'm trying to understand all the operational concerns related to the
> stored state. My checkpoints are in rocksdb configured via the job
> definition.
>
> It seems that the checkpointed state of the streaming job is lost when I
> stop and restart flink normally, or Flink terminates abnormally and is
> restarted. I was able to take an explicit savepoint and then restart the
> job with it.
>
> Is the correct approach as of now to take savepoints periodically via
> cron, and use those to re-run jobs in case of flink failure or restart?
>
> Regards,
> Raman
>
> On 19/01/17 05:43 AM, Fabian Hueske wrote:
>
> Hi Raman,
>
> I think you would need a sliding count window of size 2 with slide 1.
> This is basically a GlobalWindow with a special trigger.
>
> However, you would need to modify the custom trigger to be able to
> - identify a terminal event (if there is such a thing) or to
> - close the window after a certain period of inactivity to clean up the
> state.
>
> Best, Fabian
>
> 2017-01-19 1:43 GMT+01:00 Raman Gupta <rocketraman@gmail.com>:
>
>> Thank you for your reply.
>>
>> If I were to use a keyed stream with a count-based window of 2, would
>> Flink keep the last state persistently until the next state is
>> received? Would this be another way of having Flink keep this
>> information persistently without having to implement it manually?
>>
>> Thanks,
>> Raman
>>
>> On 18/01/17 11:22 AM, Fabian Hueske wrote:
>> > Hi Raman,
>> >
>> > I would approach this issues as follows.
>> >
>> > You key the input stream on the sourceId and apply a stateful
>> > FlatMapFunction.
>> > The FlatMapFunction has a key-partioned state and stores for each key
>> > (sourceId) the latest event as state.
>> > When a new event arrives, you can compute the time spend in the last
>> > state by looking up the event from the state and the latest received
>> > event.
>> > Then you put the new event in the state.
>> >
>> > This solution works well if you have a finite number of sources or if
>> > you have an terminal event that signals that no more events will
>> > arrive for a key.
>> > Otherwise, the number of events stored in the state will grow
>> > infinitely and eventually become a problem.
>> >
>> > If the  number of sources increases, you need to evict data at some
>> > point in time. A ProcessFunction can help here, because you can
>> > register a timer which
>> > you can use to evict up old state.
>> >
>> > Hope this helps,
>> > Fabian
>> >
>> > 2017-01-18 15:39 GMT+01:00 Raman Gupta <rocketraman@gmail.com
>> > <mailto:rocketraman@gmail.com>>:
>> >
>> >     I am investigating Flink. I am considering a relatively simple use
>> >     case -- I want to ingest streams of events that are essentially
>> >     timestamped state changes. These events may look something like:
>> >
>> >     {
>> >       sourceId: 111,
>> >       state: OPEN,
>> >       timestamp: <date/time>
>> >     }
>> >
>> >     I want to apply various processing to these state change events, the
>> >     output of which can be used for analytics. For example:
>> >
>> >     1. average time spent in state, by state
>> >     2. sources with longest (or shortest) time spent in OPEN state
>> >
>> >     The time spent in each state may be days or even weeks.
>> >
>> >     All the examples I have seen of similar logic involve windows on the
>> >     order of 15 minutes. Since time spent in each state may far exceed
>> >     these window sizes, I'm wondering what the best approach will be.
>> >
>> >     One thought from reading the docs is to use `every` to operate on
>> the
>> >     entire stream. But it seems like this will take longer and longer to
>> >     run as the event stream grows, so this is not an ideal solution. Or
>> >     does Flink apply some clever optimizations to avoid the potential
>> >     performance issue?
>> >
>> >     Another thought was to split the event stream into multiple streams
>> by
>> >     source, each of which will have a small (and limited) amount of
>> data.
>> >     This will make processing each stream simpler, but since there can
>> be
>> >     thousands of sources, it will result in a lot of streams to handle
>> and
>> >     persist (probably in Kafka). This does not seem ideal either.
>> >
>> >     It seems like this should be simple, but I'm struggling with
>> >     understanding how to solve it elegantly.
>> >
>> >     Regards,
>> >     Raman
>> >
>> >
>>
>
>
>

--001a114b414ef1fa4b0546787e5d
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div>Hi Raman,<br><br></div>Checkpoints are used=
 to recover from task or process failures and usually automatically taken a=
t periodic intervals if configured correctly.<br></div><div>Checkpoints are=
 usually removed when a more recent checkpoint is completed (the exact poli=
cy can be configured).<br><br></div>Savepoints are used to restart a job th=
at was previously shutdown, to migrate a job to another cluster (e.g., when=
 upgrading Flink), updating the job itself etc. So more for planned mainten=
ance.<br></div><div>Nonetheless they can also be used for more coarse-grain=
ed fault tolerance and it is a common practice to periodically trigger a sa=
vepoint.<br><br></div><div>These blog posts might be helpful to understand =
the potential of savepoints [1] [2].<br><br></div><div>Best, Fabian<br><br>=
[1] <a href=3D"http://data-artisans.com/turning-back-time-savepoints/">http=
://data-artisans.com/turning-back-time-savepoints/</a><br>[2] <a href=3D"ht=
tp://data-artisans.com/savepoints-part-2-updating-applications/">http://dat=
a-artisans.com/savepoints-part-2-updating-applications/</a><br></div></div>=
<div class=3D"gmail_extra"><br><div class=3D"gmail_quote">2017-01-19 19:02 =
GMT+01:00 Raman Gupta <span dir=3D"ltr">&lt;<a href=3D"mailto:rocketraman@g=
mail.com" target=3D"_blank">rocketraman@gmail.com</a>&gt;</span>:<br><block=
quote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc=
 solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    I was able to get it working well with the original approach you
    described. Thanks! Note that the documentation on how to do this
    with the Java API is... sparse, to say the least. I was able to look
    at the implementation of the scala flatMapWithState function as a
    starting point.<br>
    <br>
    Now I&#39;m trying to understand all the operational concerns related t=
o
    the stored state. My checkpoints are in rocksdb configured via the
    job definition.<br>
    <br>
    It seems that the checkpointed state of the streaming job is lost
    when I stop and restart flink normally, or Flink terminates
    abnormally and is restarted. I was able to take an explicit
    savepoint and then restart the job with it.<br>
    <br>
    Is the correct approach as of now to take savepoints periodically
    via cron, and use those to re-run jobs in case of flink failure or
    restart?<br>
    <br>
    Regards,<br>
    Raman<br>
    <br>
    <div class=3D"m_3063023797093434884moz-cite-prefix">On 19/01/17 05:43 A=
M, Fabian Hueske
      wrote:<br>
    </div>
    <blockquote type=3D"cite">
      <div dir=3D"ltr">
        <div>
          <div>
            <div>
              <div>Hi Raman,<br>
                <br>
              </div>
              I think you would need a sliding count window of size 2
              with slide 1.<br>
            </div>
            <div>This is basically a GlobalWindow with a special
              trigger.<br>
            </div>
            <br>
            However, you would need to modify the custom trigger to be
            able to <br>
            - identify a terminal event (if there is such a thing) or to<br=
>
          </div>
          - close the window after a certain period of inactivity to
          clean up the state.<br>
          <br>
        </div>
        Best, Fabian<br>
      </div>
      <div class=3D"gmail_extra"><br>
        <div class=3D"gmail_quote">2017-01-19 1:43 GMT+01:00 Raman Gupta <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:rocketraman@gmail.com" target=3D"_bla=
nk">rocketraman@gmail.com</a>&gt;</span>:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">Thank you
            for your reply.<br>
            <br>
            If I were to use a keyed stream with a count-based window of
            2, would<br>
            Flink keep the last state persistently until the next state
            is<br>
            received? Would this be another way of having Flink keep
            this<br>
            information persistently without having to implement it
            manually?<br>
            <br>
            Thanks,<br>
            Raman<br>
            <span><br>
              On 18/01/17 11:22 AM, Fabian Hueske wrote:<br>
              &gt; Hi Raman,<br>
              &gt;<br>
              &gt; I would approach this issues as follows.<br>
              &gt;<br>
              &gt; You key the input stream on the sourceId and apply a
              stateful<br>
              &gt; FlatMapFunction.<br>
              &gt; The FlatMapFunction has a key-partioned state and
              stores for each key<br>
              &gt; (sourceId) the latest event as state.<br>
              &gt; When a new event arrives, you can compute the time
              spend in the last<br>
              &gt; state by looking up the event from the state and the
              latest received<br>
              &gt; event.<br>
              &gt; Then you put the new event in the state.<br>
              &gt;<br>
              &gt; This solution works well if you have a finite number
              of sources or if<br>
              &gt; you have an terminal event that signals that no more
              events will<br>
              &gt; arrive for a key.<br>
              &gt; Otherwise, the number of events stored in the state
              will grow<br>
              &gt; infinitely and eventually become a problem.<br>
              &gt;<br>
              &gt; If the=C2=A0 number of sources increases, you need to
              evict data at some<br>
              &gt; point in time. A ProcessFunction can help here,
              because you can<br>
              &gt; register a timer which<br>
              &gt; you can use to evict up old state.<br>
              &gt;<br>
              &gt; Hope this helps,<br>
              &gt; Fabian<br>
              &gt;<br>
              &gt; 2017-01-18 15:39 GMT+01:00 Raman Gupta &lt;<a href=3D"ma=
ilto:rocketraman@gmail.com" target=3D"_blank">rocketraman@gmail.com</a><br>
            </span>&gt; &lt;mailto:<a href=3D"mailto:rocketraman@gmail.com"=
 target=3D"_blank">rocketraman@gmail.com</a>&gt;<wbr>&gt;:<br>
            <div class=3D"m_3063023797093434884HOEnZb">
              <div class=3D"m_3063023797093434884h5">&gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0I am investigating Flink. I am cons=
idering a
                relatively simple use<br>
                &gt;=C2=A0 =C2=A0 =C2=A0case -- I want to ingest streams of=
 events that
                are essentially<br>
                &gt;=C2=A0 =C2=A0 =C2=A0timestamped state changes. These ev=
ents may
                look something like:<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0{<br>
                &gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0sourceId: 111,<br>
                &gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0state: OPEN,<br>
                &gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0timestamp: &lt;date/time&gt;=
<br>
                &gt;=C2=A0 =C2=A0 =C2=A0}<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0I want to apply various processing =
to these
                state change events, the<br>
                &gt;=C2=A0 =C2=A0 =C2=A0output of which can be used for ana=
lytics. For
                example:<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A01. average time spent in state, by =
state<br>
                &gt;=C2=A0 =C2=A0 =C2=A02. sources with longest (or shortes=
t) time
                spent in OPEN state<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0The time spent in each state may be=
 days or
                even weeks.<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0All the examples I have seen of sim=
ilar logic
                involve windows on the<br>
                &gt;=C2=A0 =C2=A0 =C2=A0order of 15 minutes. Since time spe=
nt in each
                state may far exceed<br>
                &gt;=C2=A0 =C2=A0 =C2=A0these window sizes, I&#39;m wonderi=
ng what the best
                approach will be.<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0One thought from reading the docs i=
s to use
                `every` to operate on the<br>
                &gt;=C2=A0 =C2=A0 =C2=A0entire stream. But it seems like th=
is will take
                longer and longer to<br>
                &gt;=C2=A0 =C2=A0 =C2=A0run as the event stream grows, so t=
his is not
                an ideal solution. Or<br>
                &gt;=C2=A0 =C2=A0 =C2=A0does Flink apply some clever optimi=
zations to
                avoid the potential<br>
                &gt;=C2=A0 =C2=A0 =C2=A0performance issue?<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0Another thought was to split the ev=
ent stream
                into multiple streams by<br>
                &gt;=C2=A0 =C2=A0 =C2=A0source, each of which will have a s=
mall (and
                limited) amount of data.<br>
                &gt;=C2=A0 =C2=A0 =C2=A0This will make processing each stre=
am simpler,
                but since there can be<br>
                &gt;=C2=A0 =C2=A0 =C2=A0thousands of sources, it will resul=
t in a lot
                of streams to handle and<br>
                &gt;=C2=A0 =C2=A0 =C2=A0persist (probably in Kafka). This d=
oes not seem
                ideal either.<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0It seems like this should be simple=
, but I&#39;m
                struggling with<br>
                &gt;=C2=A0 =C2=A0 =C2=A0understanding how to solve it elega=
ntly.<br>
                &gt;<br>
                &gt;=C2=A0 =C2=A0 =C2=A0Regards,<br>
                &gt;=C2=A0 =C2=A0 =C2=A0Raman<br>
                &gt;<br>
                &gt;<br>
              </div>
            </div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div>

</blockquote></div><br></div>

--001a114b414ef1fa4b0546787e5d--