Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 31050 invoked from network); 5 May 2010 20:07:24 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 May 2010 20:07:24 -0000 Received: (qmail 91090 invoked by uid 500); 5 May 2010 20:07:24 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 91060 invoked by uid 500); 5 May 2010 20:07:24 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 91052 invoked by uid 99); 5 May 2010 20:07:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 20:07:24 +0000 X-ASF-Spam-Status: No, hits=-1393.4 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 20:07:24 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o45K73l4001738 for ; Wed, 5 May 2010 20:07:03 GMT Message-ID: <30942496.29151273090023510.JavaMail.jira@thor> Date: Wed, 5 May 2010 16:07:03 -0400 (EDT) From: "Todd Lipcon (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Commented: (MAPREDUCE-1755) Zombie tasks kept alive by logging system In-Reply-To: <6035765.24991273079044480.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-1755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864479#action_12864479 ] Todd Lipcon commented on MAPREDUCE-1755: ---------------------------------------- Do you have a jstack from the JVM? > Zombie tasks kept alive by logging system > ----------------------------------------- > > Key: MAPREDUCE-1755 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1755 > Project: Hadoop Map/Reduce > Issue Type: Bug > Affects Versions: 0.20.2 > Reporter: Allen Wittenauer > Attachments: stderr.txt, syslog.txt, tightloop.txt > > > I'm currently looking at a task that, as far as the task tracker is concerned, is dead. Like long long long ago dead. It was a failed task that ran out of heap. Rather than just kill it, I thought I would see what it was doing, since it was clearly using system resources. It would appear the system is trying to log but failing. I'm guessing we're missing an error condition and not doing the appropriate thing. See the comments for more. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.