Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
Sender: ewenstephan@gmail.com
In-Reply-To: 
 <CADoiZqqE3gY7aoK+nZNUfSFOC11Bgj_bTQDhcVCzxOtLie1Zig@mail.gmail.com>
References: 
 <CADoiZqr4e4fmC7u_WyLNAXbakucqQcRsjnzdn5Fj5LM9ZuiX9Q@mail.gmail.com>
	<CANC1h_v_HL3tOkUhZ9c59HGoXksZCK5dM+HOZrWhufEQmQy=JQ@mail.gmail.com>
	<CADoiZqoZc0Y_rXgbT3kNuZaWpZ=Hpxrp1WfLt=DhZr6igGaCyQ@mail.gmail.com>
	<CANC1h_su5YBM=BiOLk9Z-gaa-rsj08L8Zvej9_XJqf-ETbJ9Zw@mail.gmail.com>
	<CADoiZqpjN9KA_wzgVe_Ew8qCHHzUvoLQHmuJP_Q6+9maUjD2GA@mail.gmail.com>
	<CANC1h_tPtCEvy0QGSJxoDTBQWDYAjBO=B0qqTr_xvGPseiYmqg@mail.gmail.com>
	<CADoiZqrNsgZqKaX5v0T9KXX8+tmBDpmAuOC7L-5bSzKqy+cJnA@mail.gmail.com>
	<CANC1h_sZVn4j2GraTDyg6uMxiE7Uh5PeoFkhyXeKnLV8cT4Dgg@mail.gmail.com>
	<CANC1h_tVVAngm8ne8hv3HqDXpQpT-hjY3u19dNJ63CdbcUn+Sg@mail.gmail.com>
	<CANC1h_sdORQntTgQV_HXeyw=w8=T1bkTS1trRC_pi8Vbr0V2iA@mail.gmail.com>
	<CADoiZqqiRkK4X9F095zq+oP6D=3qfr3QBghSfNybZyCdzGdQVQ@mail.gmail.com>
	<CADoiZqqE3gY7aoK+nZNUfSFOC11Bgj_bTQDhcVCzxOtLie1Zig@mail.gmail.com>
Date: Tue, 1 Dec 2015 18:34:03 +0100
Message-ID: 
 <CANC1h_sg4MEs_PBpU6onyA5sRFZVx5Acy+zOJHy+Y7WJCX-=Cw@mail.gmail.com>
Subject: Re: Cleanup of OperatorStates?
From: Stephan Ewen <sewen@apache.org>
To: user@flink.apache.org
Content-Type: multipart/alternative; boundary=001a11371052571ea40525d9909a

--001a11371052571ea40525d9909a
Content-Type: text/plain; charset=UTF-8

Hi!

If you want to run with checkpoints (fault tolerance), you need to specify
a place to store the checkpoints to.

By default, it is the master's memory (or zookeeper in HA), so we put a
limit on the size of the size of the state there.

To use larger state, simply configure a different place to store
checkpoints to, and you can grow your size as large as your memory permits:

env.setStateBackend(new FsStateBackend("hdfs:///data/flink-checkpoints"));

or

env.setStateBackend(new FsStateBackend("file:///data/flink-checkpoints"));


More information on that is in the docs:
https://ci.apache.org/projects/flink/flink-docs-release-0.10/apis/state_backends.html

Greetings,
Stephan


On Tue, Dec 1, 2015 at 5:23 PM, Niels Basjes <Niels@basjes.nl> wrote:

> Hi,
>
> The first thing I noticed is that the Session object maintains a list of
> all events in memory.
> Your events are really small yet in my scenario the predicted number of
> events per session will be above 1000 and each is expected to be in the
> 512-1024 bytes range.
> This worried me yet I decided to give your code a run.
>
> After a while running it in my IDE (not on cluster) I got this:
>
> 17:18:46,336 INFO
>  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
> checkpoint 269 @ 1448986726336
> 17:18:46,587 INFO  org.apache.flink.runtime.taskmanager.Task
>       - sessionization -> Sink: Unnamed (4/4) switched to FAILED with
> exception.
> java.lang.RuntimeException: Error triggering a checkpoint as the result of
> receiving checkpoint barrier
> at
> org.apache.flink.streaming.runtime.tasks.StreamTask$1.onEvent(StreamTask.java:577)
> at
> org.apache.flink.streaming.runtime.tasks.StreamTask$1.onEvent(StreamTask.java:570)
> at
> org.apache.flink.streaming.runtime.io.BarrierBuffer.processBarrier(BarrierBuffer.java:201)
> at
> org.apache.flink.streaming.runtime.io.BarrierBuffer.getNextNonBlocked(BarrierBuffer.java:127)
> at
> org.apache.flink.streaming.runtime.io.StreamInputProcessor.processInput(StreamInputProcessor.java:173)
> at
> org.apache.flink.streaming.runtime.tasks.OneInputStreamTask.run(OneInputStreamTask.java:63)
> at
> org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:218)
> at org.apache.flink.runtime.taskmanager.Task.run(Task.java:584)
> at java.lang.Thread.run(Thread.java:745)
> Caused by: java.io.IOException: Size of the state is larger than the
> maximum permitted memory-backed state. Size=5246277 , maxSize=5242880 .
> Consider using a different state backend, like the File System State
> backend.
> at
> org.apache.flink.runtime.state.memory.MemoryStateBackend.checkSize(MemoryStateBackend.java:130)
> at
> org.apache.flink.runtime.state.memory.MemoryStateBackend.checkpointStateSerializable(MemoryStateBackend.java:108)
> at
> com.dataartisans.streaming.sessionization.SessionizingOperator.snapshotOperatorState(SessionizingOperator.java:162)
> at
> org.apache.flink.streaming.runtime.tasks.StreamTask.triggerCheckpoint(StreamTask.java:440)
> at
> org.apache.flink.streaming.runtime.tasks.StreamTask$1.onEvent(StreamTask.java:574)
> ... 8 more
>
>
> Niels
>
>
>
> On Tue, Dec 1, 2015 at 4:41 PM, Niels Basjes <Niels@basjes.nl> wrote:
>
>> Thanks!
>> I'm going to study this code closely!
>>
>> Niels
>>
>> On Tue, Dec 1, 2015 at 2:50 PM, Stephan Ewen <sewen@apache.org> wrote:
>>
>>> Hi Niels!
>>>
>>> I have a pretty nice example for you here:
>>> https://github.com/StephanEwen/sessionization
>>>
>>> It keeps only one state and has the structure:
>>>
>>>
>>> (source) --> (window sessions) ---> (real time sink)
>>>                       |
>>>                       +--> (15 minute files)
>>>
>>>
>>> The real time sink gets the event with attached visitId immediately. The
>>> session operator, as a side effect, writes out the 15 minute files with
>>> sessions that expired in that time.
>>>
>>>
>>> It is not a lot of code, the two main parts are
>>>
>>>   - the program and the program skeleton:
>>> https://github.com/StephanEwen/sessionization/blob/master/src/main/java/com/dataartisans/streaming/sessionization/EventTimeSessionization.java
>>>   - the sessionizing and file writing operator:
>>> https://github.com/StephanEwen/sessionization/blob/master/src/main/java/com/dataartisans/streaming/sessionization/SessionizingOperator.java
>>>
>>>
>>> The example runs fully on event time, where the timestamps are extracted
>>> from the records. That makes this program very robust (no issue with
>>> clocks, etc).
>>>
>>> Also, here comes the amazing part: The same program should do "replay"
>>> and real time. The only difference is what input you give it. Since time is
>>> event time, it can do both.
>>>
>>>
>>> One note:
>>>   - Event Time Watermarks are the mechanism to signal progress in event
>>> time. It is simple here, because I assume that timestamps are ascending in
>>> a Kafka partition. If that is not the case, you need to implement a more
>>> elaborate TimestampExtractor.
>>>
>>>
>>> Hope you can work with this!
>>>
>>> Greetings,
>>> Stephan
>>>
>>>
>>> On Tue, Dec 1, 2015 at 1:00 PM, Stephan Ewen <sewen@apache.org> wrote:
>>>
>>>> Just for clarification: The real-time results should also contain the
>>>> visitId, correct?
>>>>
>>>> On Tue, Dec 1, 2015 at 12:06 PM, Stephan Ewen <sewen@apache.org> wrote:
>>>>
>>>>> Hi Niels!
>>>>>
>>>>> If you want to use the built-in windowing, you probably need two
>>>>> window:
>>>>>   - One for ID assignment (that immediately pipes elements through)
>>>>>   - One for accumulating session elements, and then piping them into
>>>>> files upon session end.
>>>>>
>>>>> You may be able to use the rolling file sink (roll by 15 minutes) to
>>>>> store the files.
>>>>> That is probably the simplest to implement and will serve the real
>>>>> time case.
>>>>>
>>>>>
>>>>>                                     +--> (real time sink)
>>>>>                                     |
>>>>> (source) --> (window session ids) --+
>>>>>                                     |
>>>>>                                     +--> (window session) --> (rolling
>>>>> sink)
>>>>>
>>>>>
>>>>> You can put this all into one operator that accumulates the session
>>>>> elements but still immediately emits the new records (the realtime path),
>>>>> if you implement your own windowing/buffering in a custom function.
>>>>> This is also very easy to put onto event time then, which makes it
>>>>> valueable to process the history (replay). For this second case, still
>>>>> prototyping some code for the event time case, give me a bit, I'll get back
>>>>> at you...
>>>>>
>>>>> Greetings,
>>>>> Stephan
>>>>>
>>>>>
>>>>> On Tue, Dec 1, 2015 at 10:55 AM, Niels Basjes <Niels@basjes.nl> wrote:
>>>>>
>>>>>> Hi Stephan,
>>>>>>
>>>>>> I created a first version of the Visit ID assignment like this:
>>>>>>
>>>>>> First I group by sessionid and I create a Window per visit.
>>>>>> The custom Trigger for this window does a 'FIRE' after each element
>>>>>> and sets an EventTimer on the 'next possible moment the visit can expire'.
>>>>>> To avoid getting 'all events' in the visit after every 'FIRE' I'm
>>>>>> using CountEvictor.of(1).
>>>>>> When the visit expires I do a PURGE. So if there are more events
>>>>>> afterwards for the same sessionId I get a new visit (which is exactly what
>>>>>> I want).
>>>>>>
>>>>>> The last step I do is I want to have a 'normal' DataStream again to
>>>>>> work with.
>>>>>> I created this WindowFunction to map the Window stream back to
>>>>>>  normal DataStream
>>>>>> Essentially I do this:
>>>>>>
>>>>>> DataStream<Foo> visitDataStream = visitWindowedStream.apply(new
>>>>>> WindowToStream<Foo>())
>>>>>>
>>>>>> // This is an identity 'apply'
>>>>>> private static class WindowToStream<T> implements WindowFunction<T,
>>>>>> T, String, GlobalWindow> {
>>>>>>     @Override
>>>>>>     public void apply(String s, GlobalWindow window, Iterable<T>
>>>>>> values, Collector<T> out) throws Exception {
>>>>>>         for (T value: values) {
>>>>>>             out.collect(value);
>>>>>>         }
>>>>>>     }
>>>>>> }
>>>>>>
>>>>>>
>>>>>> The problem with this is that I first create the visitIds in a Window
>>>>>> (great).
>>>>>> Because I really need to have both the Windowed events AND the near
>>>>>> realtime version I currently break down the Window to get the single events
>>>>>> and after that I have to recreate the same Window again.
>>>>>>
>>>>>> I'm looking forward to the implementation direction you are referring
>>>>>> to. I hope you have a better way of doing this.
>>>>>>
>>>>>> Niels Basjes
>>>>>>
>>>>>>
>>>>>> On Mon, Nov 30, 2015 at 9:29 PM, Stephan Ewen <sewen@apache.org>
>>>>>> wrote:
>>>>>>
>>>>>>> Hi Niels!
>>>>>>>
>>>>>>> Nice use case that you have!
>>>>>>> I think you can solve this super nicely with Flink, such that
>>>>>>> "replay" and "realtime" are literally the same program - they differ only
>>>>>>> in whether
>>>>>>>
>>>>>>> Event time is, like you said, the key thing for "replay". Event time
>>>>>>> depends on the progress in the timestamps of the data, so it can progress
>>>>>>> at different speeds, depending on what the rate of your stream is.
>>>>>>> With the appropriate data source, it will progress very fast in
>>>>>>> "replay mode", so that you replay in "fast forward speed", and it
>>>>>>> progresses at the same speed as processing time when you attach to the end
>>>>>>> of the Kafka queue.
>>>>>>>
>>>>>>> When you define the time intervals in your program to react to event
>>>>>>> time progress, then you will compute the right sessionization in both
>>>>>>> replay and real time settings.
>>>>>>>
>>>>>>> I am writing a little example code to share. The type of
>>>>>>> ID-assignment sessions you want to do need an undocumented API right now,
>>>>>>> so I'll prepare something there for you...
>>>>>>>
>>>>>>> Greetings,
>>>>>>> Stephan
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Sun, Nov 29, 2015 at 4:04 PM, Niels Basjes <Niels@basjes.nl>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>> The sessionid is present in the measurements. It can also be seen
>>>>>>>> as a form of 'browser id'.
>>>>>>>> Most websites use either a 'long lived random value in a cookie' or
>>>>>>>> a 'application session id' for this.
>>>>>>>>
>>>>>>>> So with the id of the browser in hand I have the need to group all
>>>>>>>> events into "periods of activity" which I call a visit.
>>>>>>>> Such a visit is a bounded subset of all events from a single
>>>>>>>> browser.
>>>>>>>>
>>>>>>>> What I need is to add a (sort of) random visit id to the events
>>>>>>>> that becomes 'inactive' after more than X minutes of inactivity.
>>>>>>>> I then want to add this visitid to each event and
>>>>>>>> 1) stream them out in realtime
>>>>>>>> 2) Wait till the visit ends and store the complete visit on disk (I
>>>>>>>> am going for either AVRO or Parquet).
>>>>>>>>
>>>>>>>> I want to create diskfiles with all visits that ended in a specific
>>>>>>>> time period. So essentially
>>>>>>>>         "Group by round(<timestamp of last event>, 15 minutes)"
>>>>>>>>
>>>>>>>>
>>>>>>>> Because of the need to be able to 'repair' things I came with the
>>>>>>>> following question:
>>>>>>>> In the Flink API I see the 'process time' (i.e. the actual time of
>>>>>>>> the server) and the 'event time' (i.e. the time when and event was
>>>>>>>> recorded).
>>>>>>>>
>>>>>>>> Now in my case all events are in Kafka (for say 2 weeks).
>>>>>>>> When something goes wrong I want to be able to 'reprocess'
>>>>>>>> everything from the start of the queue.
>>>>>>>> Here the matter of 'event time' becomes a big question for me; In
>>>>>>>> those 'replay' situations the event time will progress at a much higher
>>>>>>>> speed than the normal 1sec/sec.
>>>>>>>>
>>>>>>>> How does this work in Apache Flink?
>>>>>>>>
>>>>>>>>
>>>>>>>> Niels Basjes
>>>>>>>>
>>>>>>>>
>>>>>>>> On Fri, Nov 27, 2015 at 3:28 PM, Stephan Ewen <sewen@apache.org>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Hey Niels!
>>>>>>>>>
>>>>>>>>> You may be able to implement this in windows anyways, depending on
>>>>>>>>> your setup. You can definitely implement state with timeout yourself (using
>>>>>>>>> the more low-level state interface), or you may be able to use custom
>>>>>>>>> windows for that (they can trigger on every element and return elements
>>>>>>>>> immediately, thereby giving you low latency).
>>>>>>>>>
>>>>>>>>> Can you tell me where exactly the session ID comes from? Is that
>>>>>>>>> something that the function with state generates itself?
>>>>>>>>> Depending on that answer, I can outline either the window, or the
>>>>>>>>> custom state way...
>>>>>>>>>
>>>>>>>>> Greetings,
>>>>>>>>> Stephan
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Fri, Nov 27, 2015 at 2:19 PM, Niels Basjes <Niels@basjes.nl>
>>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>>> Hi,
>>>>>>>>>>
>>>>>>>>>> Thanks for the explanation.
>>>>>>>>>> I have clickstream data arriving in realtime and I need to assign
>>>>>>>>>> the visitId and stream it out again (with the visitId now begin part of the
>>>>>>>>>> record) into Kafka with the lowest possible latency.
>>>>>>>>>> Although the Window feature allows me to group and close the
>>>>>>>>>> visit on a timeout/expire (as shown to me by Aljoscha in a separate email)
>>>>>>>>>> it does make a 'window'.
>>>>>>>>>>
>>>>>>>>>> So (as requested) I created a ticket for such a feature:
>>>>>>>>>> https://issues.apache.org/jira/browse/FLINK-3089
>>>>>>>>>>
>>>>>>>>>> Niels
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On Fri, Nov 27, 2015 at 11:51 AM, Stephan Ewen <sewen@apache.org>
>>>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hi Niels!
>>>>>>>>>>>
>>>>>>>>>>> Currently, state is released by setting the value for the key to
>>>>>>>>>>> null. If you are tracking web sessions, you can try and send a "end of
>>>>>>>>>>> session" element that sets the value to null.
>>>>>>>>>>>
>>>>>>>>>>> To be on the safe side, you probably want state that is
>>>>>>>>>>> automatically purged after a while. I would look into using Windows for
>>>>>>>>>>> that. The triggers there are flexible so you can schedule both actions on
>>>>>>>>>>> elements plus cleanup after a certain time delay (clock time or event time).
>>>>>>>>>>>
>>>>>>>>>>> The question about "state expiry" has come a few times. People
>>>>>>>>>>> seem to like working on state directly, but it should clean up
>>>>>>>>>>> automatically.
>>>>>>>>>>>
>>>>>>>>>>> Can you see if your use case fits onto windows, otherwise open a
>>>>>>>>>>> ticket for state expiry?
>>>>>>>>>>>
>>>>>>>>>>> Greetings,
>>>>>>>>>>> Stephan
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> On Thu, Nov 26, 2015 at 10:42 PM, Niels Basjes <Niels@basjes.nl>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> Hi,
>>>>>>>>>>>>
>>>>>>>>>>>> I'm working on a streaming application that ingests clickstream
>>>>>>>>>>>> data.
>>>>>>>>>>>> In a specific part of the flow I need to retain a little bit of
>>>>>>>>>>>> state per visitor (i.e. keyBy(sessionid) )
>>>>>>>>>>>>
>>>>>>>>>>>> So I'm using the Key/Value state interface (i.e. OperatorState<
>>>>>>>>>>>> MyRecord>) in a map function.
>>>>>>>>>>>>
>>>>>>>>>>>> Now in my application I expect to get a huge number of sessions
>>>>>>>>>>>> per day.
>>>>>>>>>>>> Since these sessionids are 'random' and become unused after the
>>>>>>>>>>>> visitor leaves the website over time the system will have seen millions of
>>>>>>>>>>>> those sessionids.
>>>>>>>>>>>>
>>>>>>>>>>>> So I was wondering: how are these OperatorStates cleaned?
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> --
>>>>>>>>>>>> Best regards / Met vriendelijke groeten,
>>>>>>>>>>>>
>>>>>>>>>>>> Niels Basjes
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> Best regards / Met vriendelijke groeten,
>>>>>>>>>>
>>>>>>>>>> Niels Basjes
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> Best regards / Met vriendelijke groeten,
>>>>>>>>
>>>>>>>> Niels Basjes
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Best regards / Met vriendelijke groeten,
>>>>>>
>>>>>> Niels Basjes
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>>
>> --
>> Best regards / Met vriendelijke groeten,
>>
>> Niels Basjes
>>
>
>
>
> --
> Best regards / Met vriendelijke groeten,
>
> Niels Basjes
>

--001a11371052571ea40525d9909a
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi!<div><br></div><div>If you want to run with checkpoints=
 (fault tolerance), you need to specify a place to store the checkpoints to=
.</div><div><br></div><div>By default, it is the master&#39;s memory (or zo=
okeeper in HA), so we put a limit on the size of the size of the state ther=
e.</div><div><br></div><div>To use larger state, simply configure a differe=
nt place to store checkpoints to, and you can grow your size as large as yo=
ur memory permits:</div><div><br></div><div>env.setStateBackend(new FsState=
Backend(&quot;hdfs:///data/flink-checkpoints&quot;));</div><div><br></div><=
div>or</div><div><br></div><div>env.setStateBackend(new FsStateBackend(&quo=
t;file:///data/flink-checkpoints&quot;));<br></div><div><br></div><div><br>=
</div><div>More information on that is in the docs:=C2=A0<a href=3D"https:/=
/ci.apache.org/projects/flink/flink-docs-release-0.10/apis/state_backends.h=
tml">https://ci.apache.org/projects/flink/flink-docs-release-0.10/apis/stat=
e_backends.html</a></div><div><br></div><div>Greetings,</div><div>Stephan</=
div><div><br></div><div><br></div></div><div class=3D"gmail_extra"><br><div=
 class=3D"gmail_quote">On Tue, Dec 1, 2015 at 5:23 PM, Niels Basjes <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:Niels@basjes.nl" target=3D"_blank">Niels@b=
asjes.nl</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">Hi,<div><br></div><div>The first thing I noticed is that the Sessi=
on object maintains a list of all events in memory.</div><div>Your events a=
re really small yet in my scenario the predicted number of events per sessi=
on will be above 1000 and each is expected to be in the 512-1024 bytes rang=
e.</div><div>This worried me yet I decided to give your code a run.</div><d=
iv><br></div><div>After a while running it in my IDE (not on cluster) I got=
 this:</div><div><div><br></div><div>17:18:46,336 INFO =C2=A0org.apache.fli=
nk.runtime.checkpoint.CheckpointCoordinator =C2=A0 =C2=A0 - Triggering chec=
kpoint 269 @ 1448986726336</div><div>17:18:46,587 INFO =C2=A0org.apache.fli=
nk.runtime.taskmanager.Task =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 - sessionization -&gt; Sink: Unnamed (4/4) switche=
d to FAILED with exception.</div><div>java.lang.RuntimeException: Error tri=
ggering a checkpoint as the result of receiving checkpoint barrier</div><di=
v><span style=3D"white-space:pre-wrap">	</span>at org.apache.flink.streamin=
g.runtime.tasks.StreamTask$1.onEvent(StreamTask.java:577)</div><div><span s=
tyle=3D"white-space:pre-wrap">	</span>at org.apache.flink.streaming.runtime=
.tasks.StreamTask$1.onEvent(StreamTask.java:570)</div><div><span style=3D"w=
hite-space:pre-wrap">	</span>at org.apache.flink.streaming.runtime.io.Barri=
erBuffer.processBarrier(BarrierBuffer.java:201)</div><div><span style=3D"wh=
ite-space:pre-wrap">	</span>at org.apache.flink.streaming.runtime.io.Barrie=
rBuffer.getNextNonBlocked(BarrierBuffer.java:127)</div><div><span style=3D"=
white-space:pre-wrap">	</span>at org.apache.flink.streaming.runtime.io.Stre=
amInputProcessor.processInput(StreamInputProcessor.java:173)</div><div><spa=
n style=3D"white-space:pre-wrap">	</span>at org.apache.flink.streaming.runt=
ime.tasks.OneInputStreamTask.run(OneInputStreamTask.java:63)</div><div><spa=
n style=3D"white-space:pre-wrap">	</span>at org.apache.flink.streaming.runt=
ime.tasks.StreamTask.invoke(StreamTask.java:218)</div><div><span style=3D"w=
hite-space:pre-wrap">	</span>at org.apache.flink.runtime.taskmanager.Task.r=
un(Task.java:584)</div><div><span style=3D"white-space:pre-wrap">	</span>at=
 java.lang.Thread.run(Thread.java:745)</div><div>Caused by: java.io.IOExcep=
tion: Size of the state is larger than the maximum permitted memory-backed =
state. Size=3D5246277 , maxSize=3D5242880 . Consider using a different stat=
e backend, like the File System State backend.</div><div><span style=3D"whi=
te-space:pre-wrap">	</span>at org.apache.flink.runtime.state.memory.MemoryS=
tateBackend.checkSize(MemoryStateBackend.java:130)</div><div><span style=3D=
"white-space:pre-wrap">	</span>at org.apache.flink.runtime.state.memory.Mem=
oryStateBackend.checkpointStateSerializable(MemoryStateBackend.java:108)</d=
iv><div><span style=3D"white-space:pre-wrap">	</span>at com.dataartisans.st=
reaming.sessionization.SessionizingOperator.snapshotOperatorState(Sessioniz=
ingOperator.java:162)</div><div><span style=3D"white-space:pre-wrap">	</spa=
n>at org.apache.flink.streaming.runtime.tasks.StreamTask.triggerCheckpoint(=
StreamTask.java:440)</div><div><span style=3D"white-space:pre-wrap">	</span=
>at org.apache.flink.streaming.runtime.tasks.StreamTask$1.onEvent(StreamTas=
k.java:574)</div><div><span style=3D"white-space:pre-wrap">	</span>... 8 mo=
re</div></div><span class=3D"HOEnZb"><font color=3D"#888888"><div><br></div=
><div><br></div><div>Niels</div><div><br></div><div><br></div></font></span=
></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><=
br><div class=3D"gmail_quote">On Tue, Dec 1, 2015 at 4:41 PM, Niels Basjes =
<span dir=3D"ltr">&lt;<a href=3D"mailto:Niels@basjes.nl" target=3D"_blank">=
Niels@basjes.nl</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" =
style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><di=
v dir=3D"ltr">Thanks!<div>I&#39;m going to study this code closely!<span><f=
ont color=3D"#888888"><br></font></span></div><span><font color=3D"#888888"=
><div><br></div><div>Niels</div></font></span><div><div><div class=3D"gmail=
_extra"><br><div class=3D"gmail_quote">On Tue, Dec 1, 2015 at 2:50 PM, Step=
han Ewen <span dir=3D"ltr">&lt;<a href=3D"mailto:sewen@apache.org" target=
=3D"_blank">sewen@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex"><div dir=3D"ltr">Hi Niels!<div><br></div><div>I have a pretty nic=
e example for you here:=C2=A0<a href=3D"https://github.com/StephanEwen/sess=
ionization" target=3D"_blank">https://github.com/StephanEwen/sessionization=
</a></div><div><br></div><div>It keeps only one state and has the structure=
:</div><div><br></div><div><br></div><div><div style=3D"font-size:12.8px"><=
font face=3D"monospace, monospace">(source) --&gt; (window sessions) -</fon=
t><span style=3D"font-family:monospace,monospace;font-size:12.8px">--&gt; (=
real time sink)</span></div><div style=3D"font-size:12.8px"><font face=3D"m=
onospace, monospace">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 |</font></div><div style=3D"font-size:12.8px"><fon=
t face=3D"monospace, monospace">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 +--&gt; (15 minute files)</font></div></=
div><div><br></div><div><br></div><div>The real time sink gets the event wi=
th attached visitId immediately. The session operator, as a side effect, wr=
ites out the 15 minute files with sessions that expired in that time.</div>=
<div><br></div><div><br></div><div>It is not a lot of code, the two main pa=
rts are</div><div><br></div><div>=C2=A0 - the program and the program skele=
ton:=C2=A0<a href=3D"https://github.com/StephanEwen/sessionization/blob/mas=
ter/src/main/java/com/dataartisans/streaming/sessionization/EventTimeSessio=
nization.java" target=3D"_blank">https://github.com/StephanEwen/sessionizat=
ion/blob/master/src/main/java/com/dataartisans/streaming/sessionization/Eve=
ntTimeSessionization.java</a></div><div>=C2=A0 - the sessionizing and file =
writing operator:=C2=A0<a href=3D"https://github.com/StephanEwen/sessioniza=
tion/blob/master/src/main/java/com/dataartisans/streaming/sessionization/Se=
ssionizingOperator.java" target=3D"_blank">https://github.com/StephanEwen/s=
essionization/blob/master/src/main/java/com/dataartisans/streaming/sessioni=
zation/SessionizingOperator.java</a></div><div><br></div><div><br></div><di=
v>The example runs fully on event time, where the timestamps are extracted =
from the records. That makes this program very robust (no issue with clocks=
, etc).<br></div><div><br></div><div>Also, here comes the amazing part: The=
 same program should do &quot;replay&quot; and real time. The only differen=
ce is what input you give it. Since time is event time, it can do both.</di=
v><div><br></div><div><br></div><div>One note:</div><div>=C2=A0 - Event Tim=
e Watermarks are the mechanism to signal progress in event time. It is simp=
le here, because I assume that timestamps are ascending in a Kafka partitio=
n. If that is not the case, you need to implement a more elaborate Timestam=
pExtractor.</div><div><br></div><div><br></div><div>Hope you can work with =
this!</div><div><br></div><div>Greetings,</div><div>Stephan</div><div><br><=
/div></div><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">On Tue, Dec 1, 2015 at 1:00 PM, Stephan Ewen <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:sewen@apache.org" target=3D"_blank">sewen@apache.org</a>&gt;<=
/span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8=
ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Just for c=
larification: The real-time results should also contain the visitId, correc=
t?</div><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"=
>On Tue, Dec 1, 2015 at 12:06 PM, Stephan Ewen <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:sewen@apache.org" target=3D"_blank">sewen@apache.org</a>&gt;</s=
pan> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex=
;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Niels!<di=
v><br></div><div>If you want to use the built-in windowing, you probably ne=
ed two window:</div><div>=C2=A0 - One for ID assignment (that immediately p=
ipes elements through)</div><div>=C2=A0 - One for accumulating session elem=
ents, and then piping them into files upon session end.</div><div><br></div=
><div>You may be able to use the rolling file sink (roll by 15 minutes) to =
store the files.</div><div>That is probably the simplest to implement and w=
ill serve the real time case.</div><div><br></div><div><font face=3D"monosp=
ace, monospace"><br></font></div><div><font face=3D"monospace, monospace">=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 +--&gt; (real time sin=
k)</font></div><div><font face=3D"monospace, monospace">=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 |</font></div><div><font face=3D"monospa=
ce, monospace">(source) --&gt; (window session ids) --+</font></div><div><f=
ont face=3D"monospace, monospace">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 |</font></div><div><font face=3D"monospace, monospace">=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 +--&gt; (window session) --&g=
t; (rolling sink)</font></div><div><br></div><div><br></div><div>You can pu=
t this all into one operator that accumulates the session elements but stil=
l immediately emits the new records (the realtime path), if you implement y=
our own windowing/buffering in a custom function.<br></div><div>This is als=
o very easy to put onto event time then, which makes it valueable to proces=
s the history (replay). For this second case, still prototyping some code f=
or the event time case, give me a bit, I&#39;ll get back at you...</div><di=
v><br></div><div>Greetings,</div><div>Stephan</div><div><br></div></div><di=
v><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, De=
c 1, 2015 at 10:55 AM, Niels Basjes <span dir=3D"ltr">&lt;<a href=3D"mailto=
:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl</a>&gt;</span> wrote:<b=
r><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:=
1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Stephan,<div><div><br>=
</div><div>I created a first version of the Visit ID assignment like this:<=
/div><div><br></div><div>First I group by sessionid and I create a Window p=
er visit.=C2=A0</div><div>The custom Trigger for this window does a &#39;FI=
RE&#39; after each element and sets an EventTimer on the &#39;next possible=
 moment the visit can expire&#39;.=C2=A0</div><div>To avoid getting &#39;al=
l events&#39; in the visit after every &#39;FIRE&#39; I&#39;m using CountEv=
ictor.of(1).</div><div>When the visit expires I do a PURGE. So if there are=
 more events afterwards for the same sessionId I get a new visit (which is =
exactly what I want).</div><div><br></div><div>The last step I do is I want=
 to have a &#39;normal&#39; DataStream again to work with.</div><div>I crea=
ted this WindowFunction to map the Window stream back to =C2=A0normal DataS=
tream</div><div>Essentially I do this:</div><div>=C2=A0 =C2=A0</div><div>Da=
taStream&lt;Foo&gt; visitDataStream =3D visitWindowedStream.apply(new Windo=
wToStream&lt;Foo&gt;())</div><div><br></div><div>// This is an identity =
9;apply&#39;<br></div><div>private static class WindowToStream&lt;T&gt; imp=
lements WindowFunction&lt;T, T, String, GlobalWindow&gt; {</div><div>=C2=A0=
 =C2=A0 @Override</div><div>=C2=A0 =C2=A0 public void apply(String s, Globa=
lWindow window, Iterable&lt;T&gt; values, Collector&lt;T&gt; out) throws Ex=
ception {</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 for (T value: values) {</di=
v><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 out.collect(value);</div><=
div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 }</div><div>=C2=A0 =C2=A0 }</div><div>}</di=
v><div><br></div><div><br></div><div>The problem with this is that I first =
create the visitIds in a Window (great).</div><div>Because I really need to=
 have both the Windowed events AND the near realtime version I currently br=
eak down the Window to get the single events and after that I have to recre=
ate the same Window again.</div><div><br></div><div>I&#39;m looking forward=
 to the implementation direction you are referring to. I hope you have a be=
tter way of doing this.</div><span><font color=3D"#888888"><div><br></div><=
div>Niels Basjes</div></font></span></div><div><br></div></div><div><div><d=
iv class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Mon, Nov 30, 201=
5 at 9:29 PM, Stephan Ewen <span dir=3D"ltr">&lt;<a href=3D"mailto:sewen@ap=
ache.org" target=3D"_blank">sewen@apache.org</a>&gt;</span> wrote:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex"><div dir=3D"ltr">Hi Niels!<div><br></div><div>Nic=
e use case that you have!</div><div>I think you can solve this super nicely=
 with Flink, such that &quot;replay&quot; and &quot;realtime&quot; are lite=
rally the same program - they differ only in whether=C2=A0</div><div><br></=
div><div>Event time is, like you said, the key thing for &quot;replay&quot;=
. Event time depends on the progress in the timestamps of the data, so it c=
an progress at different speeds, depending on what the rate of your stream =
is.</div><div>With the appropriate data source, it will progress very fast =
in &quot;replay mode&quot;, so that you replay in &quot;fast forward speed&=
quot;, and it progresses at the same speed as processing time when you atta=
ch to the end of the Kafka queue.</div><div><br></div><div>When you define =
the time intervals in your program to react to event time progress, then yo=
u will compute the right sessionization in both replay and real time settin=
gs.</div><div><br></div><div>I am writing a little example code to share. T=
he type of ID-assignment sessions you want to do need an undocumented API r=
ight now, so I&#39;ll prepare something there for you...</div><div><br></di=
v><div>Greetings,</div><div>Stephan</div><div><br></div><div><br></div></di=
v><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Su=
n, Nov 29, 2015 at 4:04 PM, Niels Basjes <span dir=3D"ltr">&lt;<a href=3D"m=
ailto:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl</a>&gt;</span> wro=
te:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-=
left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Hi,</div><div><=
br></div>The sessionid is present in the measurements. It can also be seen =
as a form of &#39;browser id&#39;.<div>Most websites use either a &#39;long=
 lived random value in a cookie&#39; or a &#39;application session id&#39; =
for this.<br></div><div><br></div><div>So with the id of the browser in han=
d I have the need to group all events into &quot;periods of activity&quot; =
which I call a visit.</div><div>Such a visit is a bounded subset of all eve=
nts from a single browser.</div><div><br></div><div>What I need is to add a=
 (sort of) random visit id to the events that becomes &#39;inactive&#39; af=
ter more than X minutes of inactivity.</div><div>I then want to add this vi=
sitid to each event and=C2=A0</div><div>1) stream them out in realtime</div=
><div>2) Wait till the visit ends and store the complete visit on disk (I a=
m going for either AVRO or Parquet).</div><div><br></div><div>I want to cre=
ate diskfiles with all visits that ended in a specific time period. So esse=
ntially=C2=A0</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 &quot;Group by round(&l=
t;timestamp of last event&gt;, 15 minutes)&quot;</div><div><br></div><div><=
br></div><div>Because of the need to be able to &#39;repair&#39; things I c=
ame with the following question:<br></div><div>In the Flink API I see the &=
#39;process time&#39; (i.e. the actual time of the server) and the &#39;eve=
nt time&#39; (i.e. the time when and event was recorded).</div><div><br></d=
iv><div>Now in my case all events are in Kafka (for say 2 weeks).</div><div=
>When something goes wrong I want to be able to &#39;reprocess&#39; everyth=
ing from the start of the queue.</div><div>Here the matter of &#39;event ti=
me&#39; becomes a big question for me; In those &#39;replay&#39; situations=
 the event time will progress at a much higher speed than the normal 1sec/s=
ec.</div><div><br></div><div>How does this work in Apache Flink?</div><span=
><font color=3D"#888888"><div><br></div><div><br></div><div>Niels Basjes</d=
iv><div><div><br></div></div></font></span></div><div><div><div class=3D"gm=
ail_extra"><br><div class=3D"gmail_quote">On Fri, Nov 27, 2015 at 3:28 PM, =
Stephan Ewen <span dir=3D"ltr">&lt;<a href=3D"mailto:sewen@apache.org" targ=
et=3D"_blank">sewen@apache.org</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><div dir=3D"ltr"><div>Hey Niels!</div><div><br></div><div>You=
 may be able to implement this in windows anyways, depending on your setup.=
 You can definitely implement state with timeout yourself (using the more l=
ow-level state interface), or you may be able to use custom windows for tha=
t (they can trigger on every element and return elements immediately, there=
by giving you low latency).</div><div><br></div><div>Can you tell me where =
exactly the session ID comes from? Is that something that the function with=
 state generates itself?</div><div>Depending on that answer, I can outline =
either the window, or the custom state way...</div><div><br></div><div>Gree=
tings,</div><div>Stephan</div><div><br></div><div><br></div><div><br></div>=
<div><br></div></div><div><div><div class=3D"gmail_extra"><br><div class=3D=
"gmail_quote">On Fri, Nov 27, 2015 at 2:19 PM, Niels Basjes <span dir=3D"lt=
r">&lt;<a href=3D"mailto:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl=
</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin=
:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">H=
i,<div><br></div><div>Thanks for the explanation.</div><div>I have clickstr=
eam data arriving in realtime and I need to assign the visitId and stream i=
t out again (with the visitId now begin part of the record) into Kafka with=
 the lowest possible latency.</div><div>Although the Window feature allows =
me to group and close the visit on a timeout/expire (as shown to me by=C2=
=A0Aljoscha in a separate email) it does make a &#39;window&#39;.<br></div>=
<div><br></div><div>So (as requested) I created a ticket for such a feature=
:</div><div><a href=3D"https://issues.apache.org/jira/browse/FLINK-3089" ta=
rget=3D"_blank">https://issues.apache.org/jira/browse/FLINK-3089</a><span><=
font color=3D"#888888"><br></font></span></div><span><font color=3D"#888888=
"><div><br></div><div>Niels=C2=A0</div><div><br></div><div><br></div><div><=
br></div><div><br></div><div><br></div></font></span></div><div><div><div c=
lass=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, Nov 27, 2015 at=
 11:51 AM, Stephan Ewen <span dir=3D"ltr">&lt;<a href=3D"mailto:sewen@apach=
e.org" target=3D"_blank">sewen@apache.org</a>&gt;</span> wrote:<br><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc s=
olid;padding-left:1ex"><div dir=3D"ltr">Hi Niels!<div><br></div><div>Curren=
tly, state is released by setting the value for the key to null. If you are=
 tracking web sessions, you can try and send a &quot;end of session&quot; e=
lement that sets the value to null.</div><div><br></div><div>To be on the s=
afe side, you probably want state that is automatically purged after a whil=
e. I would look into using Windows for that. The triggers there are flexibl=
e so you can schedule both actions on elements plus cleanup after a certain=
 time delay (clock time or event time).</div><div><br></div><div>The questi=
on about &quot;state expiry&quot; has come a few times. People seem to like=
 working on state directly, but it should clean up automatically.<br></div>=
<div><br></div><div>Can you see if your use case fits onto windows, otherwi=
se open a ticket for state expiry?</div><div><br></div><div>Greetings,</div=
><div>Stephan</div><div><br></div></div><div><div><div class=3D"gmail_extra=
"><br><div class=3D"gmail_quote">On Thu, Nov 26, 2015 at 10:42 PM, Niels Ba=
sjes <span dir=3D"ltr">&lt;<a href=3D"mailto:Niels@basjes.nl" target=3D"_bl=
ank">Niels@basjes.nl</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex=
"><div dir=3D"ltr">Hi,<div><br></div><div>I&#39;m working on a streaming ap=
plication that ingests clickstream data.</div><div>In a specific part of th=
e flow I need to retain a little bit of state per visitor (i.e. keyBy(sessi=
onid) )</div><div><br></div><div>So I&#39;m using the Key/Value state inter=
face (i.e.=C2=A0<span style=3D"color:inherit;font-family:Menlo,&#39;Lucida =
Console&#39;,monospace;font-size:inherit;white-space:pre-wrap;line-height:1=
.42857;background-color:transparent">OperatorState</span><span style=3D"col=
or:inherit;font-family:Menlo,&#39;Lucida Console&#39;,monospace;font-size:i=
nherit;white-space:pre-wrap;line-height:1.42857;font-weight:bold;background=
-color:transparent">&lt;</span><span style=3D"color:inherit;font-family:Men=
lo,&#39;Lucida Console&#39;,monospace;font-size:inherit;white-space:pre-wra=
p;line-height:1.42857;background-color:transparent">MyRecord</span><span st=
yle=3D"color:inherit;font-family:Menlo,&#39;Lucida Console&#39;,monospace;f=
ont-size:inherit;white-space:pre-wrap;line-height:1.42857;font-weight:bold;=
background-color:transparent">&gt;</span>) in a map function.</div><div><br=
></div><div>Now in my application I expect to get a huge number of sessions=
 per day.</div><div>Since these sessionids are &#39;random&#39; and become =
unused after the visitor leaves the website over time the system will have =
seen millions of those sessionids.</div><div><br></div><div>So I was wonder=
ing: how are these OperatorStates cleaned?</div><span><font color=3D"#88888=
8"><div><br></div><div><div><br></div>-- <br><div>Best regards / Met vriend=
elijke groeten,<br><br>Niels Basjes</div>
</div></font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><div>Best re=
gards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>

--001a11371052571ea40525d9909a--