flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Renjie Liu <liurenjie2...@gmail.com>
Subject Re: Task manager number mismatch container number on mesos
Date Mon, 10 Apr 2017 06:05:48 GMT
This happens again.
I've checked job manager's log and it reports the lost of task manager as
expected.
However, there's nothing valuable in the task manager's log. I've checked
the output of jstack and what's interesting is that several threads get
blocked when allocating memory. But the jvm heap usage is low and no gc
happens.






On Thu, Mar 23, 2017 at 10:24 PM Renjie Liu <liurenjie2008@gmail.com> wrote:

I'm not sure how to reproduce this bug, and I'll post it next time it
happens.

On Thu, Mar 23, 2017 at 10:21 PM Robert Metzger <rmetzger@apache.org> wrote:

Could you provide the logs of the task manager that still runs as a
container but doesn't show up as a Taskmanager?

On Thu, Mar 23, 2017 at 11:38 AM, Renjie Liu <liurenjie2008@gmail.com>
wrote:

Permanent. I've waited for several minutes and the task manager is still
lost.

On Thu, Mar 23, 2017 at 6:34 PM Ufuk Celebi <uce@apache.org> wrote:

When it happens, is it temporary or permanent?

Looping in Till and Eron who worked on the Mesos runner.

– Ufuk

On Thu, Mar 23, 2017 at 11:09 AM, Renjie Liu <liurenjie2008@gmail.com>
wrote:
> Hi, all:
> We are using flink 1.2.0 on mesos. We found the number of task managers
> mismatches with container number occasinally. That's the mesos container
> still exists but it can't be found on the monitor web page of flink
master.
> This case doesn't happen frequently and it's hard to reproduce.
> --
> Liu, Renjie
> Software Engineer, MVAD

-- 
Liu, Renjie
Software Engineer, MVAD


-- 
Liu, Renjie
Software Engineer, MVAD

-- 
Liu, Renjie
Software Engineer, MVAD

Mime
View raw message