Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3646E19F06 for ; Wed, 16 Mar 2016 18:37:34 +0000 (UTC) Received: (qmail 57036 invoked by uid 500); 16 Mar 2016 18:37:33 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 56985 invoked by uid 500); 16 Mar 2016 18:37:33 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 56961 invoked by uid 99); 16 Mar 2016 18:37:33 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 16 Mar 2016 18:37:33 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 8475B2C1F58 for ; Wed, 16 Mar 2016 18:37:33 +0000 (UTC) Date: Wed, 16 Mar 2016 18:37:33 +0000 (UTC) From: "Siqi Li (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-4831) Recovered containers will be killed after NM stateful restart MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-4831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197895#comment-15197895 ] Siqi Li commented on YARN-4831: ------------------------------- When NM does a stateful restart, the ContainerManagerImpl will try to recover applications, and containers, and then send out ApplicationFinishEvent to apps that in appsState.getFinishedApplications(). The ApplicationFinishEvent could result in newly recovered containers to transit from NEW to DONE with a KillOnNewTransition. We could add an additional check in KillOnNewTransition to avoid killing completed containers. > Recovered containers will be killed after NM stateful restart > -------------------------------------------------------------- > > Key: YARN-4831 > URL: https://issues.apache.org/jira/browse/YARN-4831 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Siqi Li > > {code} > 2016-03-04 19:43:48,130 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container: Container container_1456335621285_0040_01_000066 transitioned from NEW to DONE > 2016-03-04 19:43:48,130 INFO org.apache.hadoop.yarn.server.nodemanager.NMAuditLogger: USER=henkins-service OPERATION=Container Finished - Killed TARGET=ContainerImpl RESULT=SUCCESS APPID=application_1456335621285_0040 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)