Return-Path: Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: (qmail 56393 invoked from network); 30 Jul 2009 06:56:40 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 30 Jul 2009 06:56:40 -0000 Received: (qmail 83414 invoked by uid 500); 30 Jul 2009 06:56:39 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 83327 invoked by uid 500); 30 Jul 2009 06:56:39 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 83297 invoked by uid 99); 30 Jul 2009 06:56:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Jul 2009 06:56:38 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Jul 2009 06:56:35 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id C789F234C044 for ; Wed, 29 Jul 2009 23:56:14 -0700 (PDT) Message-ID: <1904938242.1248936974806.JavaMail.jira@brutus> Date: Wed, 29 Jul 2009 23:56:14 -0700 (PDT) From: "Nathan Marz (JIRA)" To: common-dev@hadoop.apache.org Subject: [jira] Resolved: (HADOOP-5330) Zombie tasks remain after jobs finish/fail/get killed MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nathan Marz resolved HADOOP-5330. --------------------------------- Resolution: Invalid As it turns out, this was caused by an application problem. We were running embedded Solr instances in the tasks that were preventing the process from exiting. The fix was to close the Solr instances at task completion. > Zombie tasks remain after jobs finish/fail/get killed > ----------------------------------------------------- > > Key: HADOOP-5330 > URL: https://issues.apache.org/jira/browse/HADOOP-5330 > Project: Hadoop Common > Issue Type: Bug > Affects Versions: 0.19.1 > Reporter: Nathan Marz > > I'm seeing a lot of "task attempts" around our hadoop cluster for jobs that are no longer around. The attempts seem to be "hung", as they sit there forever. Additionally, they seem to take up map and reduce slots in the cluster unless MapReduce is restarted. This causes real jobs to be unable to utilize the whole cluster. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.