Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 68710200C40 for ; Thu, 23 Mar 2017 15:21:16 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 67269160B75; Thu, 23 Mar 2017 14:21:16 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id ADEC3160B6F for ; Thu, 23 Mar 2017 15:21:15 +0100 (CET) Received: (qmail 87146 invoked by uid 500); 23 Mar 2017 14:21:14 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 87137 invoked by uid 99); 23 Mar 2017 14:21:14 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Mar 2017 14:21:14 +0000 Received: from mail-wm0-f45.google.com (mail-wm0-f45.google.com [74.125.82.45]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 06CB11A07BD for ; Thu, 23 Mar 2017 14:21:13 +0000 (UTC) Received: by mail-wm0-f45.google.com with SMTP id n11so64285228wma.0 for ; Thu, 23 Mar 2017 07:21:13 -0700 (PDT) X-Gm-Message-State: AFeK/H2F/fXahqMfZgzs0dQ2NQHGruNatuptON/x9xbDK2+dM4snC2vt+fts5qb1X+fte3rH3laOmQVlFYmtMQ== X-Received: by 10.28.138.140 with SMTP id m134mr13798520wmd.134.1490278872794; Thu, 23 Mar 2017 07:21:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.80.181.59 with HTTP; Thu, 23 Mar 2017 07:20:52 -0700 (PDT) In-Reply-To: References: From: Robert Metzger Date: Thu, 23 Mar 2017 15:20:52 +0100 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Task manager number mismatch container number on mesos To: "user@flink.apache.org" Cc: Till Rohrmann , ewright@live.com Content-Type: multipart/alternative; boundary=001a11443602d44b68054b6696b7 archived-at: Thu, 23 Mar 2017 14:21:16 -0000 --001a11443602d44b68054b6696b7 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Could you provide the logs of the task manager that still runs as a container but doesn't show up as a Taskmanager? On Thu, Mar 23, 2017 at 11:38 AM, Renjie Liu wrote: > Permanent. I've waited for several minutes and the task manager is still > lost. > > On Thu, Mar 23, 2017 at 6:34 PM Ufuk Celebi wrote: > >> When it happens, is it temporary or permanent? >> >> Looping in Till and Eron who worked on the Mesos runner. >> >> =E2=80=93 Ufuk >> >> On Thu, Mar 23, 2017 at 11:09 AM, Renjie Liu >> wrote: >> > Hi, all: >> > We are using flink 1.2.0 on mesos. We found the number of task manager= s >> > mismatches with container number occasinally. That's the mesos contain= er >> > still exists but it can't be found on the monitor web page of flink >> master. >> > This case doesn't happen frequently and it's hard to reproduce. >> > -- >> > Liu, Renjie >> > Software Engineer, MVAD >> > -- > Liu, Renjie > Software Engineer, MVAD > --001a11443602d44b68054b6696b7 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Could you provide the logs of the task manager that still = runs as a container but doesn't show up as a Taskmanager?

On Thu, Mar 23, 2017 at 1= 1:38 AM, Renjie Liu <liurenjie2008@gmail.com> wrote:
Permanent. I've waite= d for several minutes and the task manager is still lost.

On Thu, Mar 23, 2017 at 6:34 PM Ufuk Celebi <uce@apache.org> wrote:
When it happens, is it temporary or permanent?

Looping in Till and Eron who worked on the Mesos runner.

=E2=80=93 Ufuk

On Thu, Mar 23, 2017 at 11:09 AM, Renjie Liu <liurenjie2008@gmail.com> wrote:
> Hi, all:
> We are using flink 1.2.0 on mesos. We found the number of task manager= s
> mismatches with container number occasinally. That's the mesos con= tainer
> still exists but it can't be found on the monitor web page of flin= k master.
> This case doesn't happen frequently and it's hard to reproduce= .
> --
> Liu, Renjie
> Software Engineer, MVAD
--
Liu, Renjie
Software Engineer, MVA= D

--001a11443602d44b68054b6696b7--