flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fabian Hueske <fhue...@gmail.com>
Subject Re: Fwd: HA : My job didn't restart even if task manager restarted.
Date Fri, 08 Sep 2017 15:38:25 GMT

sorry for the late response!
I'm not familiar with the details of the failure recovery but Till (in CC)
knows the code in depth.
Maybe he can figure out what's going on.

Best, Fabian

2017-09-06 5:35 GMT+02:00 sunny yun <seonhee.yun@gmail.com>:

> I am still struggling to solve this problem.
> I have no doubt that the JOB should automatically restart after restarting
> the TASK MANAGER in YARN MODE. Is it a misunderstood?
> Problem seems that *JOB MANAGER still try to connect to old TASK MANAGER
> even after new TASK MANAGER container be created.*
> When I killed TM on node#2 then new TM container is created on node#3, but
> JM still tries to connect to TM on node#2 according to the log file. (It
> was
> not a log I posted before, when I found it while continuing the test.
> Normally the TM be created on the same node after killed.)
> So new TM don't know JOB info and JM show us JOB with fail status.
> If anyone has succeeded in the same situation(YARN + TM FAILURE), please
> just tell me.
> That will be big help to me.
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.
> n4.nabble.com/

View raw message