From mapreduce-issues-return-13339-apmail-hadoop-mapreduce-issues-archive=hadoop.apache.org@hadoop.apache.org Wed May 05 20:21:31 2010 Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 34468 invoked from network); 5 May 2010 20:21:31 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 May 2010 20:21:31 -0000 Received: (qmail 16450 invoked by uid 500); 5 May 2010 20:21:31 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 16423 invoked by uid 500); 5 May 2010 20:21:31 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 16412 invoked by uid 99); 5 May 2010 20:21:31 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 20:21:31 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 20:21:28 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o45KL7Bg001923 for ; Wed, 5 May 2010 20:21:07 GMT Message-ID: <18242477.29661273090867084.JavaMail.jira@thor> Date: Wed, 5 May 2010 16:21:07 -0400 (EDT) From: "Allen Wittenauer (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Updated: (MAPREDUCE-1755) Zombie tasks kept alive by logging system In-Reply-To: <6035765.24991273079044480.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-1755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allen Wittenauer updated MAPREDUCE-1755: ---------------------------------------- Attachment: jstack.txt As reqeusted, a jstack. > Zombie tasks kept alive by logging system > ----------------------------------------- > > Key: MAPREDUCE-1755 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1755 > Project: Hadoop Map/Reduce > Issue Type: Bug > Affects Versions: 0.20.2 > Reporter: Allen Wittenauer > Attachments: jstack.txt, stderr.txt, syslog.txt, tightloop.txt > > > I'm currently looking at a task that, as far as the task tracker is concerned, is dead. Like long long long ago dead. It was a failed task that ran out of heap. Rather than just kill it, I thought I would see what it was doing, since it was clearly using system resources. It would appear the system is trying to log but failing. I'm guessing we're missing an error condition and not doing the appropriate thing. See the comments for more. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.