Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 65BA5EA0B for ; Fri, 1 Mar 2013 05:57:18 +0000 (UTC) Received: (qmail 14923 invoked by uid 500); 1 Mar 2013 05:57:18 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 13964 invoked by uid 500); 1 Mar 2013 05:57:16 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 13524 invoked by uid 99); 1 Mar 2013 05:57:15 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Mar 2013 05:57:15 +0000 Date: Fri, 1 Mar 2013 05:57:15 +0000 (UTC) From: "Siddharth Seth (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590260#comment-13590260 ] Siddharth Seth commented on YARN-376: ------------------------------------- The eclipse failures is not related. Committing this. > Apps that have completed can appear as RUNNING on the NM UI > ----------------------------------------------------------- > > Key: YARN-376 > URL: https://issues.apache.org/jira/browse/YARN-376 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager > Affects Versions: 2.0.3-alpha, 0.23.6 > Reporter: Jason Lowe > Assignee: Jason Lowe > Priority: Blocker > Attachments: YARN-376_branch-0.23.txt, YARN-376.patch, YARN-376.patch, YARN-376.patch, YARN-376.patch, YARN-376-trunk.txt > > > On a busy cluster we've noticed a growing number of applications appear as RUNNING on a nodemanager web pages but the applications have long since finished. Looking at the NM logs, it appears the RM never told the nodemanager that the application had finished. This is also reflected in a jstack of the NM process, since many more log aggregation threads are running then one would expect from the number of actively running applications. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira