Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: <2055650045.647466.1478718173676@mail.yahoo.com>
References: <1266388850.821413.1478637213061.ref@mail.yahoo.com>
 <1266388850.821413.1478637213061@mail.yahoo.com> <CAC27z=M4yRykuzot9j3+Rj6OhmnPrpo8Rxwce1RjfXMo6WZZCg@mail.gmail.com>
 <2111005948.11581.1478646649230@mail.yahoo.com> <CAC27z=N1RUKFh6bB3=RDnq+VUdW4LhUybMW6Z1umU5+WzENtkw@mail.gmail.com>
 <2055650045.647466.1478718173676@mail.yahoo.com>
From: Till Rohrmann <trohrmann@apache.org>
Date: Thu, 10 Nov 2016 11:13:34 +0100
Message-ID: <CAC27z=MEMxgxkkGG8ArEBeBkzmG+ExrdhYJk1DSy7EVwTWQr7A@mail.gmail.com>
Subject: Re: Why did the Flink Cluster JM crash?
To: amir bahmanyari <amirtousa@yahoo.com>
Cc: "user@flink.apache.org" <user@flink.apache.org>
Content-Type: multipart/alternative; boundary=001a1148f89c50dffb0540efa036
archived-at: Thu, 10 Nov 2016 10:13:39 -0000

--001a1148f89c50dffb0540efa036
Content-Type: text/plain; charset=UTF-8

The amount of data should be fine. Try to set the number of slots to the
number of cores you have available.

As long as you have more Kafka topics than Flink Kafka consumers (subtasks)
you should be fine. But I think you can also decrease the number of Kafka
partitions a little bit. I guess that an extensive number of partitions
also comes with a price. But I'm no expert there.

Hope your experiments run well with these settings.

Cheers,
Till

On Wed, Nov 9, 2016 at 8:02 PM, amir bahmanyari <amirtousa@yahoo.com> wrote:

> Thanks Till.
> I have been trying out many many configuration combinations to get to the
> peak of what I can get as a reasonable performance.
> And yes, when I drop the number of slots, I dont get OOM. However, I dont
> get the response I want either.
> The amount of data I send is kinda huge; about 105 G that's sent in an
> stretch of 3.5 hours to a 4 nodes cluster running my Beam app receiving
> from a 2 nodes cluster of Kafka.
> From what I understand, you are suggesting that to get the best
> performance, the total number of slots should be equal to the total number
> of cores distributed in the cluster.
> For the sake of making sure we have done that, I would go back and repeat
> the testing with that in mind.
> Fyi, the Kafka partitions are 4096. Roughly, 1024 per 16 cores per one
> node. Is this reasonable?
> Once I know the answer to this question, I will go ahead and readjust my
> config and repeat the test.
> I appreciate your response.
> Amir-
>
> ------------------------------
> *From:* Till Rohrmann <till.rohrmann@gmail.com>
> *To:* amir bahmanyari <amirtousa@yahoo.com>
> *Cc:* "user@flink.apache.org" <user@flink.apache.org>
> *Sent:* Wednesday, November 9, 2016 1:27 AM
> *Subject:* Re: Why did the Flink Cluster JM crash?
>
> Hi Amir,
>
> I fear that 900 slots per task manager is a bit too many unless your
> machine has 900 cores. As a rule of thumb you should allocate as many slots
> as your machines have cores. Maybe you could try to decrease the number of
> slots and see if you still observe an OOM error.
>
> Cheers,
> Till
>
> On Wed, Nov 9, 2016 at 12:10 AM, amir bahmanyari <amirtousa@yahoo.com>
> wrote:
>
> Ok. There is an OOM exception...but this used to work fine with the same
> configurations.
> There are four nodes: beam1 through 4.
> The Kafka partitions are 4096 > 3584 deg of parallelism.
>
> jobmanager.rpc.address: beam1
> jobmanager.rpc.port: 6123
> jobmanager.heap.mb: 1024
> taskmanager.heap.mb: 102400
> taskmanager.numberOfTaskSlots:  896
> taskmanager.memory. preallocate: false
>
> parallelism.default: 3584
>
>
> Thanks for your valuable time Till.
>
> AnonymousParDo -> AnonymousParDo (3584/3584) (
> ebe8da5bda017ee31ad774c5bc5e5e 88) switched from DEPLOYING to RUNNING
> 2016-11-08 22:51:44,471 INFO  org.apache.flink.runtime.
> executiongraph.ExecutionGraph        - Source: Read(UnboundedKafkaSource)
> -> AnonymousParDo -> AnonymousParDo (3573/3584) (
> ddf5a8939c1fc4ad1e6d71f17fe5ab 0b) switched from RUNNING to FAILED
> 2016-11-08 22:51:44,474 INFO  org.apache.flink.runtime.
> executiongraph.ExecutionGraph        - Source: Read(UnboundedKafkaSource)
> -> AnonymousParDo -> AnonymousParDo (1/3584) (
> 865c54432153a0230e62bf7610118f f8) switched from RUNNING to CANCELING
> 2016-11-08 22:51:44,474 INFO  org.apache.flink.runtime.
> jobmanager.JobManager                - Status of job
> e61cada683c0f7a709101c26c2c9a1 7c (benchbeamrunners-abahman- 1108225128)
> changed to FAILING.
> j*ava.lang.OutOfMemoryError: unable to create new native thread*
> at java.lang.Thread.start0(Native Method)
> at java.lang.Thread.start(Thread. java:714)
> at java.util.concurrent. ThreadPoolExecutor.addWorker(
> ThreadPoolExecutor.java:950)
> at java.util.concurrent. ThreadPoolExecutor. ensurePrestart(
> ThreadPoolExecutor.java:1587)
> at java.util.concurrent. ScheduledThreadPoolExecutor. delayedExecute(
> ScheduledThreadPoolExecutor. java:334)
> at java.util.concurrent. ScheduledThreadPoolExecutor. schedule(
> ScheduledThreadPoolExecutor. java:533)
> at java.util.concurrent. Executors$ DelegatedScheduledExecutorServ
> ice.schedule(Executors.java: 729)
> at org.apache.flink.streaming. runtime.tasks.StreamTask.
> registerTimer(StreamTask.java: 652)
> at org.apache.flink.streaming. api.operators. AbstractStreamOperator.
> registerTimer( AbstractStreamOperator.java: 250)
> at org.apache.flink.streaming. api.operators. StreamingRuntimeContext.
> registerTimer( StreamingRuntimeContext.java: 92)
> at org.apache.beam.runners.flink. translation.wrappers.streaming.io.
> UnboundedSourceWrapper. setNextWatermarkTimer( UnboundedSourceWrapper.java:
> 381)
> at org.apache.beam.runners.flink. translation.wrappers.streaming.io.
> UnboundedSourceWrapper.run( UnboundedSourceWrapper.java: 233)
> at org.apache.flink.streaming. api.operators.StreamSource.
> run(StreamSource.java:78)
> at org.apache.flink.streaming. runtime.tasks. SourceStreamTask.run(
> SourceStreamTask.java:56)
> at org.apache.flink.streaming. runtime.tasks.StreamTask.
> invoke(StreamTask.java:224)
> at org.apache.flink.runtime. taskmanager.Task.run(Task. java:559)
> at java.lang.Thread.run(Thread. java:745)
>
>
>
> ------------------------------
> *From:* Till Rohrmann <till.rohrmann@gmail.com>
> *To:* user@flink.apache.org; amir bahmanyari <amirtousa@yahoo.com>
> *Sent:* Tuesday, November 8, 2016 2:11 PM
> *Subject:* Re: Why did the Flink Cluster JM crash?
>
> Hi Amir,
>
> what does the JM logs say?
>
> Cheers,
> Till
>
> On Tue, Nov 8, 2016 at 9:33 PM, amir bahmanyari <amirtousa@yahoo.com>
> wrote:
>
> Hi colleagues,
> I started the cluster all fine. Started the Beam app running in the Flink
> Cluster fine.
> Dashboard showed all tasks being consumed and open for business.
> I started sending data to the Beam app, and all of the sudden the Flink JM
> crashed.
> Exceptions below.
> Thanks+regards
> Amir
>
> java.lang.RuntimeException: Pipeline execution failed
>         at org.apache.beam.runners.flink. FlinkRunner.run(FlinkRunner.
> java:113)
>         at org.apache.beam.runners.flink. FlinkRunner.run(FlinkRunner.
> java:48)
>         at org.apache.beam.sdk.Pipeline. run(Pipeline.java:183)
>         at benchmark.flinkspark.flink. BenchBeamRunners.main(
> BenchBeamRunners.java:622)  //p.run();
>         at sun.reflect. NativeMethodAccessorImpl. invoke0(Native Method)
>         at sun.reflect. NativeMethodAccessorImpl. invoke(
> NativeMethodAccessorImpl.java: 62)
>         at sun.reflect. DelegatingMethodAccessorImpl. invoke(
> DelegatingMethodAccessorImpl. java:43)
>         at java.lang.reflect.Method. invoke(Method.java:498)
>         at org.apache.flink.client. program.PackagedProgram.
> callMainMethod( PackagedProgram.java:505)
>         at org.apache.flink.client. program.PackagedProgram.
> invokeInteractiveModeForExecut ion(PackagedProgram.java:403)
>         at org.apache.flink.client. program.Client.runBlocking(
> Client.java:248)
>         at org.apache.flink.client. CliFrontend. executeProgramBlocking(
> CliFrontend.java:866)
>         at org.apache.flink.client. CliFrontend.run(CliFrontend. java:333)
>         at org.apache.flink.client. CliFrontend.parseParameters(
> CliFrontend.java:1189)
>         at org.apache.flink.client. CliFrontend.main(CliFrontend.
> java:1239)
> Caused by: org.apache.flink.client. program. ProgramInvocationException:
> The program execution failed: Communication with JobManager failed: Lost
> connection to the JobManager.
>         at org.apache.flink.client. program.Client.runBlocking(
> Client.java:381)
>         at org.apache.flink.client. program.Client.runBlocking(
> Client.java:355)
>         at org.apache.flink.streaming. api.environment.
> StreamContextEnvironment. execute( StreamContextEnvironment.java: 65)
>         at org.apache.beam.runners.flink. FlinkPipelineExecutionEnvironm
> ent.executePipeline( FlinkPipelineExecutionEnvironm ent.java:118)
>         at org.apache.beam.runners.flink. FlinkRunner.run(FlinkRunner.
> java:110)
>         ... 14 more
> Caused by: org.apache.flink.runtime. client.JobExecutionException:
> Communication with JobManager failed: Lost connection to the JobManager.
>         at org.apache.flink.runtime. client.JobClient.
> submitJobAndWait(JobClient. java:140)
>         at org.apache.flink.client. program.Client.runBlocking(
> Client.java:379)
>         ... 18 more
> Caused by: org.apache.flink.runtime. client.
> JobClientActorConnectionTimeou tException: Lost connection to the
> JobManager.
>         at org.apache.flink.runtime. client.JobClientActor.
> handleMessage(JobClientActor. java:244)
>         at org.apache.flink.runtime.akka. FlinkUntypedActor.
> handleLeaderSessionID( FlinkUntypedActor.java:88)
>         at org.apache.flink.runtime.akka. FlinkUntypedActor.onReceive(
> FlinkUntypedActor.java:68)
>         at akka.actor.UntypedActor$$ anonfun$receive$1.applyOrElse(
> UntypedActor.scala:167)
>         at akka.actor.Actor$class. aroundReceive(Actor.scala:465)
>         at akka.actor.UntypedActor. aroundReceive(UntypedActor. scala:97)
>         at akka.actor.ActorCell. receiveMessage(ActorCell. scala:516)
>         at akka.actor.ActorCell.invoke( ActorCell.scala:487)
>         at akka.dispatch.Mailbox. processMailbox(Mailbox.scala: 254)
>         at akka.dispatch.Mailbox.run( Mailbox.scala:221)
>         at akka.dispatch.Mailbox.exec( Mailbox.scala:231)
>         at scala.concurrent.forkjoin. ForkJoinTask.doExec(
> ForkJoinTask.java:260)
>         at scala.concurrent.forkjoin. ForkJoinPool$WorkQueue.
> pollAndExecAll(ForkJoinPool. java:1253)
>         at scala.concurrent.forkjoin. ForkJoinPool$WorkQueue.
> runTask(ForkJoinPool.java: 1346)
>         at scala.concurrent.forkjoin. ForkJoinPool.runWorker(
> ForkJoinPool.java:1979)
>         at scala.concurrent.forkjoin. ForkJoinWorkerThread.run(
> ForkJoinWorkerThread.java:107)
>
>
>
>
>
>
>
>

--001a1148f89c50dffb0540efa036
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The amount of data should be fine. Try to set the number o=
f slots to the number of cores you have available.<div><br></div><div>As lo=
ng as you have more Kafka topics than Flink Kafka consumers (subtasks) you =
should be fine. But I think you can also decrease the number of Kafka parti=
tions a little bit. I guess that an extensive number of partitions also com=
es with a price. But I&#39;m no expert there.</div><div><br></div><div>Hope=
 your experiments run well with these settings.</div><div><br></div><div>Ch=
eers,</div><div>Till</div></div><div class=3D"gmail_extra"><br><div class=
=3D"gmail_quote">On Wed, Nov 9, 2016 at 8:02 PM, amir bahmanyari <span dir=
=3D"ltr">&lt;<a href=3D"mailto:amirtousa@yahoo.com" target=3D"_blank">amirt=
ousa@yahoo.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div=
><div style=3D"color:#000;background-color:#fff;font-family:HelveticaNeue,H=
elvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif;font-size:12px"><div=
 dir=3D"ltr" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_4779=
1"><span id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47792">T=
hanks Till.</span></div><div dir=3D"ltr" id=3D"m_947160225833289481yui_3_16=
_0_ym19_1_1478660034859_47795"><span id=3D"m_947160225833289481yui_3_16_0_y=
m19_1_1478660034859_47794">I have been trying out many many configuration c=
ombinations to get to the peak of what I can get as a reasonable performanc=
e.</span></div><div dir=3D"ltr" id=3D"m_947160225833289481yui_3_16_0_ym19_1=
_1478660034859_47795"><span id=3D"m_947160225833289481yui_3_16_0_ym19_1_147=
8660034859_47831">And yes, when I drop the number of slots, I dont get OOM.=
 However, I dont get the response I want either.</span></div><div dir=3D"lt=
r" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47567"><span i=
d=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47566">The amount =
of data I send is kinda huge; about 105 G that&#39;s sent in an stretch of =
3.5 hours to a 4 nodes cluster running my Beam app receiving from a 2 nodes=
 cluster of Kafka.</span></div><div dir=3D"ltr" id=3D"m_947160225833289481y=
ui_3_16_0_ym19_1_1478660034859_47569"><span id=3D"m_947160225833289481yui_3=
_16_0_ym19_1_1478660034859_47662">From what I understand, you are suggestin=
g that to get the best performance, the total number of slots should be equ=
al to the total number of cores distributed in the cluster.</span></div><di=
v dir=3D"ltr" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_475=
69"><span id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47747">=
For the sake of making sure we have done that, I would go back and repeat t=
he testing with that in mind.</span></div><div dir=3D"ltr" id=3D"m_94716022=
5833289481yui_3_16_0_ym19_1_1478660034859_47569"><span id=3D"m_947160225833=
289481yui_3_16_0_ym19_1_1478660034859_47745">Fyi, the Kafka partitions are =
4096. Roughly, 1024 per 16 cores per one node. Is this reasonable?</span></=
div><div class=3D"m_947160225833289481qtdSeparateBR" id=3D"m_94716022583328=
9481yui_3_16_0_ym19_1_1478660034859_47571">Once I know the answer to this q=
uestion, I will go ahead and readjust my config and repeat the test.</div><=
div class=3D"m_947160225833289481qtdSeparateBR" id=3D"m_947160225833289481y=
ui_3_16_0_ym19_1_1478660034859_47571">I appreciate your response.</div><div=
 class=3D"m_947160225833289481qtdSeparateBR" id=3D"m_947160225833289481yui_=
3_16_0_ym19_1_1478660034859_47571">Amir-<br><br></div><div class=3D"m_94716=
0225833289481yahoo_quoted" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478=
660034859_47580" style=3D"display:block">  <div style=3D"font-family:Helvet=
icaNeue,Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif;font-size:1=
2px" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47579"> <div=
 style=3D"font-family:HelveticaNeue,Helvetica Neue,Helvetica,Arial,Lucida G=
rande,sans-serif;font-size:16px" id=3D"m_947160225833289481yui_3_16_0_ym19_=
1_1478660034859_47578"> <div dir=3D"ltr" id=3D"m_947160225833289481yui_3_16=
_0_ym19_1_1478660034859_47577"> <font size=3D"2" face=3D"Arial" id=3D"m_947=
160225833289481yui_3_16_0_ym19_1_1478660034859_47659"><span class=3D""> <hr=
 size=3D"1" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47868=
"> <b><span style=3D"font-weight:bold">From:</span></b> Till Rohrmann &lt;<=
a href=3D"mailto:till.rohrmann@gmail.com" target=3D"_blank">till.rohrmann@g=
mail.com</a>&gt;<br> </span><b><span style=3D"font-weight:bold">To:</span><=
/b> amir bahmanyari &lt;<a href=3D"mailto:amirtousa@yahoo.com" target=3D"_b=
lank">amirtousa@yahoo.com</a>&gt; <br><b><span style=3D"font-weight:bold">C=
c:</span></b> &quot;<a href=3D"mailto:user@flink.apache.org" target=3D"_bla=
nk">user@flink.apache.org</a>&quot; &lt;<a href=3D"mailto:user@flink.apache=
.org" target=3D"_blank">user@flink.apache.org</a>&gt;<br> <b><span style=3D=
"font-weight:bold">Sent:</span></b> Wednesday, November 9, 2016 1:27 AM<spa=
n class=3D""><br> <b><span style=3D"font-weight:bold">Subject:</span></b> R=
e: Why did the Flink Cluster JM crash?<br> </span></font> </div> <div class=
=3D"m_947160225833289481y_msg_container" id=3D"m_947160225833289481yui_3_16=
_0_ym19_1_1478660034859_47585"><br><div id=3D"m_947160225833289481yiv751823=
0257"><div id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859_47584"=
><div dir=3D"ltr" id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034859=
_47583">Hi Amir,<div id=3D"m_947160225833289481yui_3_16_0_ym19_1_1478660034=
859_47607"><br clear=3D"none"></div><span class=3D""><div id=3D"m_947160225=
833289481yui_3_16_0_ym19_1_1478660034859_47582">I fear that 900 slots per t=
ask manager is a bit too many unless your machine has 900 cores. As a rule =
of thumb you should allocate as many slots as your machines have cores. May=
be you could try to decrease the number of slots and see if you still obser=
ve an OOM error.</div><div><br clear=3D"none"></div><div>Cheers,</div><div>=
Till</div></span></div><div class=3D"m_947160225833289481yiv7518230257yqt81=
01856391" id=3D"m_947160225833289481yiv7518230257yqt81251"><div class=3D"m_=
947160225833289481yiv7518230257gmail_extra"><br clear=3D"none"><div class=
=3D"m_947160225833289481yiv7518230257gmail_quote"><span class=3D"">On Wed, =
Nov 9, 2016 at 12:10 AM, amir bahmanyari <span dir=3D"ltr">&lt;<a rel=3D"no=
follow" shape=3D"rect" href=3D"mailto:amirtousa@yahoo.com" target=3D"_blank=
">amirtousa@yahoo.com</a>&gt;</span> wrote:<br clear=3D"none"></span><block=
quote class=3D"m_947160225833289481yiv7518230257gmail_quote" style=3D"margi=
n:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div style=
=3D"color:#000;background-color:#fff;font-family:HelveticaNeue,Helvetica Ne=
ue,Helvetica,Arial,Lucida Grande,sans-serif;font-size:12px"><span class=3D"=
"><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_298852555312904=
5968yui_3_16_0_ym19_1_1477610403404_853677"><span id=3D"m_94716022583328948=
1yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853706">=
Ok. There is an OOM exception...but this used to work fine with the same co=
nfigurations.</span></div><div dir=3D"ltr" id=3D"m_947160225833289481yiv751=
8230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853677"><span>T=
here are four nodes:=C2=A0</span>beam1 through 4.</div><div dir=3D"ltr" id=
=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1=
_1477610403404_853677">The Kafka partitions are 4096 &gt; 3584 deg of paral=
lelism.</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988=
525553129045968yui_3_16_0_ym19_1_1477610403404_853477"><span><br clear=3D"n=
one"></span></div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m=
_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853553">jobmanager.rpc.=
address: beam1</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853561">jobmanager.rp=
c.port: 6123</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m=
_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853569">jobmanager.heap=
.mb: 1024</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29=
88525553129045968yui_3_16_0_ym19_1_1477610403404_853577">taskmanager.heap.m=
b: 102400</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29=
88525553129045968yui_3_16_0_ym19_1_1477610403404_853586">taskmanager.number=
OfTaskSlots: =C2=A0896=C2=A0</div></span><div dir=3D"ltr" id=3D"m_947160225=
833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_=
853594">taskmanager.memory. preallocate: false</div><span class=3D""><div d=
ir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_=
3_16_0_ym19_1_1477610403404_853598"><br clear=3D"none" id=3D"m_947160225833=
289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853=
599"></div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_298852=
5553129045968yui_3_16_0_ym19_1_1477610403404_853477"></div><div dir=3D"ltr"=
 id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym1=
9_1_1477610403404_853600">parallelism.default:=C2=A03584</div><div dir=3D"l=
tr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_=
ym19_1_1477610403404_853601"><br clear=3D"none" id=3D"m_947160225833289481y=
iv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853602"></=
div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129=
045968yui_3_16_0_ym19_1_1477610403404_853475"><span><br clear=3D"none"></sp=
an></div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29885255=
53129045968yui_3_16_0_ym19_1_1477610403404_853473"><span id=3D"m_9471602258=
33289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_8=
53763">Thanks for your valuable time Till.</span></div><div dir=3D"ltr" id=
=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1=
_1477610403404_853482"><span><br clear=3D"none"></span></div></span><div di=
r=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3=
_16_0_ym19_1_1477610403404_853383">AnonymousParDo -&gt; AnonymousParDo (358=
4/3584) ( ebe8da5bda017ee31ad774c5bc5e5e 88) switched from DEPLOYING to RUN=
NING</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525=
553129045968yui_3_16_0_ym19_1_1477610403404_853384">2016-11-08 22:51:44,471=
 INFO =C2=A0org.apache.flink.runtime. executiongraph.ExecutionGraph =C2=A0 =
=C2=A0 =C2=A0 =C2=A0- Source: Read(UnboundedKafkaSource) -&gt; AnonymousPar=
Do -&gt; AnonymousParDo (3573/3584) ( ddf5a8939c1fc4ad1e6d71f17fe5ab 0b) sw=
itched from RUNNING to FAILED</div><div dir=3D"ltr" id=3D"m_947160225833289=
481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853385=
">2016-11-08 22:51:44,474 INFO =C2=A0org.apache.flink.runtime. executiongra=
ph.ExecutionGraph =C2=A0 =C2=A0 =C2=A0 =C2=A0- Source: Read(UnboundedKafkaS=
ource) -&gt; AnonymousParDo -&gt; AnonymousParDo (1/3584) ( 865c54432153a02=
30e62bf7610118f f8) switched from RUNNING to CANCELING</div><div dir=3D"ltr=
" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym=
19_1_1477610403404_853386">2016-11-08 22:51:44,474 INFO =C2=A0org.apache.fl=
ink.runtime. jobmanager.JobManager =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0- Status of job e61cada683c0f7a709101c26c2c9a1 7c (benchbe=
amrunners-abahman- 1108225128) changed to FAILING.</div><span class=3D""><d=
iv dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968=
yui_3_16_0_ym19_1_1477610403404_853387">j<b><u>ava.lang.OutOfMemoryError: u=
nable to create new native thread</u></b></div><div dir=3D"ltr" id=3D"m_947=
160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_14776104=
03404_853388"><span id=3D"m_947160225833289481yiv7518230257m_29885255531290=
45968yui_3_16_0_ym19_1_1477610403404_853389" style=3D"white-space:pre-wrap"=
>	</span>at java.lang.Thread.start0(Native Method)</div><div dir=3D"ltr" id=
=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1=
_1477610403404_853390"><span id=3D"m_947160225833289481yiv7518230257m_29885=
25553129045968yui_3_16_0_ym19_1_1477610403404_853391" style=3D"white-space:=
pre-wrap">	</span>at java.lang.Thread.start(Thread. java:714)</div><div dir=
=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_=
16_0_ym19_1_1477610403404_853392"><span id=3D"m_947160225833289481yiv751823=
0257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853393" style=3D"w=
hite-space:pre-wrap">	</span>at java.util.concurrent. ThreadPoolExecutor.ad=
dWorker( ThreadPoolExecutor.java:950)</div><div dir=3D"ltr" id=3D"m_9471602=
25833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_147761040340=
4_853394"><span id=3D"m_947160225833289481yiv7518230257m_298852555312904596=
8yui_3_16_0_ym19_1_1477610403404_853395" style=3D"white-space:pre-wrap">	</=
span>at java.util.concurrent. ThreadPoolExecutor. ensurePrestart( ThreadPoo=
lExecutor.java:1587)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv751=
8230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853396"><span i=
d=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_=
1_1477610403404_853397" style=3D"white-space:pre-wrap">	</span>at java.util=
.concurrent. ScheduledThreadPoolExecutor. delayedExecute( ScheduledThreadPo=
olExecutor. java:334)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv75=
18230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853398"><span =
id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19=
_1_1477610403404_853399" style=3D"white-space:pre-wrap">	</span>at java.uti=
l.concurrent. ScheduledThreadPoolExecutor. schedule( ScheduledThreadPoolExe=
cutor. java:533)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230=
257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853400"><span id=3D=
"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_14=
77610403404_853401" style=3D"white-space:pre-wrap">	</span>at java.util.con=
current. Executors$ DelegatedScheduledExecutorServ ice.schedule(Executors.j=
ava: 729)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29=
88525553129045968yui_3_16_0_ym19_1_1477610403404_853402"><span id=3D"m_9471=
60225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_147761040=
3404_853403" style=3D"white-space:pre-wrap">	</span>at org.apache.flink.str=
eaming. runtime.tasks.StreamTask. registerTimer(StreamTask.java: 652)</div>=
<div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29885255531290459=
68yui_3_16_0_ym19_1_1477610403404_853404"><span id=3D"m_947160225833289481y=
iv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853405" st=
yle=3D"white-space:pre-wrap">	</span>at org.apache.flink.streaming. api.ope=
rators. AbstractStreamOperator. registerTimer( AbstractStreamOperator.java:=
 250)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_298852=
5553129045968yui_3_16_0_ym19_1_1477610403404_853406"><span id=3D"m_94716022=
5833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404=
_853407" style=3D"white-space:pre-wrap">	</span>at org.apache.flink.streami=
ng. api.operators. StreamingRuntimeContext. registerTimer( StreamingRuntime=
Context.java: 92)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv751823=
0257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853408"><span id=
=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1=
_1477610403404_853409" style=3D"white-space:pre-wrap">	</span>at org.apache=
.beam.runners.flink. translation.wrappers.<a rel=3D"nofollow" shape=3D"rect=
" href=3D"http://streaming.io/" target=3D"_blank">streaming<wbr>.io</a>. Un=
boundedSourceWrapper. setNextWatermarkTimer( UnboundedSourceWrapper.java: 3=
81)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29885255=
53129045968yui_3_16_0_ym19_1_1477610403404_853410"><span id=3D"m_9471602258=
33289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_8=
53411" style=3D"white-space:pre-wrap">	</span>at org.apache.beam.runners.fl=
ink. translation.wrappers.<a rel=3D"nofollow" shape=3D"rect" href=3D"http:/=
/streaming.io/" target=3D"_blank">streaming<wbr>.io</a>. UnboundedSourceWra=
pper.run( UnboundedSourceWrapper.java: 233)</div><div dir=3D"ltr" id=3D"m_9=
47160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_147761=
0403404_853412"><span id=3D"m_947160225833289481yiv7518230257m_298852555312=
9045968yui_3_16_0_ym19_1_1477610403404_853413" style=3D"white-space:pre-wra=
p">	</span>at org.apache.flink.streaming. api.operators.StreamSource. run(S=
treamSource.java:78)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv751=
8230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853414"><span i=
d=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_=
1_1477610403404_853415" style=3D"white-space:pre-wrap">	</span>at org.apach=
e.flink.streaming. runtime.tasks. SourceStreamTask.run( SourceStreamTask.ja=
va:56)</div><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29885=
25553129045968yui_3_16_0_ym19_1_1477610403404_853416"><span id=3D"m_9471602=
25833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_147761040340=
4_853417" style=3D"white-space:pre-wrap">	</span>at org.apache.flink.stream=
ing. runtime.tasks.StreamTask. invoke(StreamTask.java:224)</div><div dir=3D=
"ltr" id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_=
0_ym19_1_1477610403404_853418"><span id=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853419" style=3D"whit=
e-space:pre-wrap">	</span>at org.apache.flink.runtime. taskmanager.Task.run=
(Task. java:559)</div><div dir=3D"ltr"></div></span><div dir=3D"ltr" id=3D"=
m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_147=
7610403404_853420"><span id=3D"m_947160225833289481yiv7518230257m_298852555=
3129045968yui_3_16_0_ym19_1_1477610403404_853421" style=3D"white-space:pre-=
wrap">	</span>at java.lang.Thread.run(Thread. java:745)</div><div><div clas=
s=3D"h5"><div dir=3D"ltr" id=3D"m_947160225833289481yiv7518230257m_29885255=
53129045968yui_3_16_0_ym19_1_1477610403404_853780"><span><br clear=3D"none"=
></span></div><div class=3D"m_947160225833289481yiv7518230257m_298852555312=
9045968qtdSeparateBR" id=3D"m_947160225833289481yiv7518230257m_298852555312=
9045968yui_3_16_0_ym19_1_1477610403404_853468"><br clear=3D"none"><br clear=
=3D"none"></div><div class=3D"m_947160225833289481yiv7518230257m_2988525553=
129045968yahoo_quoted" id=3D"m_947160225833289481yiv7518230257m_29885255531=
29045968yui_3_16_0_ym19_1_1477610403404_853119" style=3D"display:block">  <=
div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_=
ym19_1_1477610403404_853118" style=3D"font-family:HelveticaNeue,Helvetica N=
eue,Helvetica,Arial,Lucida Grande,sans-serif;font-size:12px"> <div id=3D"m_=
947160225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_14776=
10403404_853117" style=3D"font-family:HelveticaNeue,Helvetica Neue,Helvetic=
a,Arial,Lucida Grande,sans-serif;font-size:16px"><span class=3D"m_947160225=
833289481yiv7518230257"> </span><div dir=3D"ltr" id=3D"m_947160225833289481=
yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853140"> =
<font id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16_=
0_ym19_1_1477610403404_853466" size=3D"2" face=3D"Arial"> </font><hr size=
=3D"1"> <b><span style=3D"font-weight:bold">From:</span></b> Till Rohrmann =
&lt;<a rel=3D"nofollow" shape=3D"rect" href=3D"mailto:till.rohrmann@gmail.c=
om" target=3D"_blank">till.rohrmann@gmail.com</a>&gt;<br clear=3D"none"> <b=
><span style=3D"font-weight:bold">To:</span></b> <a rel=3D"nofollow" shape=
=3D"rect" href=3D"mailto:user@flink.apache.org" target=3D"_blank">user@flin=
k.apache.org</a>; amir bahmanyari &lt;<a rel=3D"nofollow" shape=3D"rect" hr=
ef=3D"mailto:amirtousa@yahoo.com" target=3D"_blank">amirtousa@yahoo.com</a>=
&gt; <br clear=3D"none"> <b><span style=3D"font-weight:bold">Sent:</span></=
b> Tuesday, November 8, 2016 2:11 PM<br clear=3D"none"> <b><span style=3D"f=
ont-weight:bold">Subject:</span></b> Re: Why did the Flink Cluster JM crash=
?<br clear=3D"none">  </div> <div class=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968y_msg_container" id=3D"m_947160225833289481yiv7518230=
257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853116"><br clear=
=3D"none"><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968=
yiv5984956385"><div id=3D"m_947160225833289481yiv7518230257m_29885255531290=
45968yui_3_16_0_ym19_1_1477610403404_853429"><span class=3D"m_9471602258332=
89481yiv7518230257"></span><div dir=3D"ltr" id=3D"m_947160225833289481yiv75=
18230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403404_853428">Hi Ami=
r,<div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yui_3_16=
_0_ym19_1_1477610403404_853441"><br clear=3D"none"></div><div id=3D"m_94716=
0225833289481yiv7518230257m_2988525553129045968yui_3_16_0_ym19_1_1477610403=
404_853427">what does the JM logs say?</div><div><br clear=3D"none"></div><=
div>Cheers,</div><div>Till</div></div><div class=3D"m_947160225833289481yiv=
7518230257m_2988525553129045968yiv5984956385yqt7567116124" id=3D"m_94716022=
5833289481yiv7518230257m_2988525553129045968yiv5984956385yqt49919"><div cla=
ss=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385gm=
ail_extra"><br clear=3D"none"><div class=3D"m_947160225833289481yiv75182302=
57m_2988525553129045968yiv5984956385gmail_quote"><span class=3D"m_947160225=
833289481yiv7518230257">On Tue, Nov 8, 2016 at 9:33 PM, amir bahmanyari <sp=
an dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" href=3D"mailto:amirto=
usa@yahoo.com" target=3D"_blank">amirtousa@yahoo.com</a>&gt;</span> wrote:<=
br clear=3D"none"></span><blockquote class=3D"m_947160225833289481yiv751823=
0257m_2988525553129045968yiv5984956385gmail_quote" style=3D"margin:0 0 0 .8=
ex;border-left:1px #ccc solid;padding-left:1ex"><div><div style=3D"color:#0=
00;background-color:#fff;font-family:HelveticaNeue,Helvetica Neue,Helvetica=
,Arial,Lucida Grande,sans-serif;font-size:12px"><span class=3D"m_9471602258=
33289481yiv7518230257"></span><div id=3D"m_947160225833289481yiv7518230257m=
_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477=
610403404_831329">Hi colleagues,</div><div id=3D"m_947160225833289481yiv751=
8230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym1=
9_1_1477610403404_831329">I started the cluster all fine. Started the Beam =
app running in the Flink Cluster fine.</div><div id=3D"m_947160225833289481=
yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16=
_0_ym19_1_1477610403404_831329">Dashboard showed all tasks being consumed a=
nd open for business.</div><div id=3D"m_947160225833289481yiv7518230257m_29=
88525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610=
403404_831329">I started sending data to the Beam app, and all of the sudde=
n the Flink JM crashed.</div><div id=3D"m_947160225833289481yiv7518230257m_=
2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_14776=
10403404_831329">Exceptions below.</div><div id=3D"m_947160225833289481yiv7=
518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_y=
m19_1_1477610403404_831329">Thanks+regards</div><div id=3D"m_94716022583328=
9481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_=
3_16_0_ym19_1_1477610403404_831329">Amir</div><div id=3D"m_9471602258332894=
81yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_=
16_0_ym19_1_1477610403404_831329"><br clear=3D"none"></div><div id=3D"m_947=
160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_81741161467=
6118975yui_3_16_0_ym19_1_1477610403404_831462">java.lang.RuntimeException: =
Pipeline execution failed</div><span class=3D"m_947160225833289481yiv751823=
0257"></span><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045=
968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831463"=
>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.beam.runners.flink. FlinkRunner.=
run(FlinkRunner. java:113)</div><div id=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_14=
77610403404_831464">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.beam.runners.=
flink. FlinkRunner.run(FlinkRunner. java:48)</div><div id=3D"m_947160225833=
289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yu=
i_3_16_0_ym19_1_1477610403404_831465">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.ap=
ache.beam.sdk.Pipeline. run(Pipeline.java:183)</div><div id=3D"m_9471602258=
33289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975=
yui_3_16_0_ym19_1_1477610403404_831466">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at benc=
hmark.flinkspark.flink. BenchBeamRunners.main( BenchBeamRunners.java:622) =
=C2=A0//p.run();</div><div id=3D"m_947160225833289481yiv7518230257m_2988525=
553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_147761040340=
4_831467">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at sun.reflect. NativeMethodAccessorI=
mpl. invoke0(Native Method)</div><div id=3D"m_947160225833289481yiv75182302=
57m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1=
477610403404_831468">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at sun.reflect. NativeMeth=
odAccessorImpl. invoke( NativeMethodAccessorImpl.java: 62)</div><div id=3D"=
m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_817411=
614676118975yui_3_16_0_ym19_1_1477610403404_831469">=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 at sun.reflect. DelegatingMethodAccessorImpl. invoke( DelegatingMeth=
odAccessorImpl. java:43)</div><div id=3D"m_947160225833289481yiv7518230257m=
_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477=
610403404_831470">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.lang.reflect.Method. =
invoke(Method.java:498)</div><div id=3D"m_947160225833289481yiv7518230257m_=
2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_14776=
10403404_831471">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink.client. pr=
ogram.PackagedProgram. callMainMethod( PackagedProgram.java:505)</div><div =
id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_=
817411614676118975yui_3_16_0_ym19_1_1477610403404_831472">=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 at org.apache.flink.client. program.PackagedProgram. invokeInter=
activeModeForExecut ion(PackagedProgram.java:403)</div><div id=3D"m_9471602=
25833289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118=
975yui_3_16_0_ym19_1_1477610403404_831473">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at o=
rg.apache.flink.client. program.Client.runBlocking( Client.java:248)</div><=
div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv59849563=
85m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831474">=C2=A0 =C2=A0=
 =C2=A0 =C2=A0 at org.apache.flink.client. CliFrontend. executeProgramBlock=
ing( CliFrontend.java:866)</div><div id=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_14=
77610403404_831475">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink.client.=
 CliFrontend.run(CliFrontend. java:333)</div><div id=3D"m_94716022583328948=
1yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_1=
6_0_ym19_1_1477610403404_831476">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.=
flink.client. CliFrontend.parseParameters( CliFrontend.java:1189)</div><div=
 id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385m=
_817411614676118975yui_3_16_0_ym19_1_1477610403404_831477">=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 at org.apache.flink.client. CliFrontend.main(CliFrontend. jav=
a:1239)</div><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045=
968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831478"=
>Caused by: org.apache.flink.client. program. ProgramInvocationException: T=
he program execution failed: Communication with JobManager failed: Lost con=
nection to the JobManager.</div><div id=3D"m_947160225833289481yiv751823025=
7m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_14=
77610403404_831479">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink.client.=
 program.Client.runBlocking( Client.java:381)</div><div id=3D"m_94716022583=
3289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975y=
ui_3_16_0_ym19_1_1477610403404_831480">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.a=
pache.flink.client. program.Client.runBlocking( Client.java:355)</div><div =
id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_=
817411614676118975yui_3_16_0_ym19_1_1477610403404_831481">=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 at org.apache.flink.streaming. api.environment. StreamContextEnv=
ironment. execute( StreamContextEnvironment.java: 65)</div><div id=3D"m_947=
160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_81741161467=
6118975yui_3_16_0_ym19_1_1477610403404_831482">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =
at org.apache.beam.runners.flink. FlinkPipelineExecutionEnvironm ent.execut=
ePipeline( FlinkPipelineExecutionEnvironm ent.java:118)</div><div id=3D"m_9=
47160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614=
676118975yui_3_16_0_ym19_1_1477610403404_831483">=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 at org.apache.beam.runners.flink. FlinkRunner.run(FlinkRunner. java:110=
)</div><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv=
5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831484">=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 ... 14 more</div><span class=3D"m_9471602258332894=
81yiv7518230257"></span><div id=3D"m_947160225833289481yiv7518230257m_29885=
25553129045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403=
404_831485">Caused by: org.apache.flink.runtime. client.JobExecutionExcepti=
on: Communication with JobManager failed: Lost connection to the JobManager=
.</div><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv=
5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831486">=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink.runtime. client.JobClient. sub=
mitJobAndWait(JobClient. java:140)</div><div id=3D"m_947160225833289481yiv7=
518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_y=
m19_1_1477610403404_831487">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink=
.client. program.Client.runBlocking( Client.java:379)</div><div id=3D"m_947=
160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_81741161467=
6118975yui_3_16_0_ym19_1_1477610403404_831488">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =
... 18 more</div><span class=3D"m_947160225833289481yiv7518230257"></span><=
div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv59849563=
85m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831489">Caused by: or=
g.apache.flink.runtime. client. JobClientActorConnectionTimeou tException: =
Lost connection to the JobManager.</div><div id=3D"m_947160225833289481yiv7=
518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_16_0_y=
m19_1_1477610403404_831490">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flink=
.runtime. client.JobClientActor. handleMessage(JobClientActor. java:244)</d=
iv><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984=
956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831491">=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 at org.apache.flink.runtime.akka. FlinkUntypedActor. h=
andleLeaderSessionID( FlinkUntypedActor.java:88)</div><div id=3D"m_94716022=
5833289481yiv7518230257m_2988525553129045968yiv5984956385m_8174116146761189=
75yui_3_16_0_ym19_1_1477610403404_831492">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at or=
g.apache.flink.runtime.akka. FlinkUntypedActor.onReceive( FlinkUntypedActor=
.java:68)</div><div id=3D"m_947160225833289481yiv7518230257m_29885255531290=
45968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_83149=
3">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at akka.actor.UntypedActor$$ anonfun$receive=
$1.applyOrElse( UntypedActor.scala:167)</div><div id=3D"m_94716022583328948=
1yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui_3_1=
6_0_ym19_1_1477610403404_831494">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at akka.actor.=
Actor$class. aroundReceive(Actor.scala:465)</div><div id=3D"m_9471602258332=
89481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975yui=
_3_16_0_ym19_1_1477610403404_831495">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at akka.ac=
tor.UntypedActor. aroundReceive(UntypedActor. scala:97)</div><div id=3D"m_9=
47160225833289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614=
676118975yui_3_16_0_ym19_1_1477610403404_831496">=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 at akka.actor.ActorCell. receiveMessage(ActorCell. scala:516)</div><div=
 id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385m=
_817411614676118975yui_3_16_0_ym19_1_1477610403404_831497">=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 at akka.actor.ActorCell.invoke( ActorCell.scala:487)</div><di=
v id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956385=
m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831498">=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 at akka.dispatch.Mailbox. processMailbox(Mailbox.scala: 254)<=
/div><div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv59=
84956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831499">=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 at akka.dispatch.Mailbox.run( Mailbox.scala:221)</div>=
<div id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv5984956=
385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831500">=C2=A0 =C2=
=A0 =C2=A0 =C2=A0 at akka.dispatch.Mailbox.exec( Mailbox.scala:231)</div><d=
iv id=3D"m_947160225833289481yiv7518230257m_2988525553129045968yiv598495638=
5m_817411614676118975yui_3_16_0_ym19_1_1477610403404_831501">=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 at scala.concurrent.forkjoin. ForkJoinTask.doExec( ForkJoinTa=
sk.java:260)</div><div id=3D"m_947160225833289481yiv7518230257m_29885255531=
29045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_83=
1502">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at scala.concurrent.forkjoin. ForkJoinPoo=
l$WorkQueue. pollAndExecAll(ForkJoinPool. java:1253)</div><div id=3D"m_9471=
60225833289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676=
118975yui_3_16_0_ym19_1_1477610403404_831503">=C2=A0 =C2=A0 =C2=A0 =C2=A0 a=
t scala.concurrent.forkjoin. ForkJoinPool$WorkQueue. runTask(ForkJoinPool.j=
ava: 1346)</div><div id=3D"m_947160225833289481yiv7518230257m_2988525553129=
045968yiv5984956385m_817411614676118975yui_3_16_0_ym19_1_1477610403404_8315=
04">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at scala.concurrent.forkjoin. ForkJoinPool.=
runWorker( ForkJoinPool.java:1979)</div><div dir=3D"ltr" id=3D"m_9471602258=
33289481yiv7518230257m_2988525553129045968yiv5984956385m_817411614676118975=
yui_3_16_0_ym19_1_1477610403404_831505">=C2=A0 =C2=A0 =C2=A0 =C2=A0 at scal=
a.concurrent.forkjoin. ForkJoinWorkerThread.run( ForkJoinWorkerThread.java:=
107)</div></div></div></blockquote></div><br clear=3D"none"></div></div></d=
iv></div><br clear=3D"none"><br clear=3D"none"></div> </div> </div>  </div>=
</div></div></div></div></blockquote></div><br clear=3D"none"></div></div><=
/div></div><br><br></div> </div> </div>  </div></div></div></blockquote></d=
iv><br></div>

--001a1148f89c50dffb0540efa036--