Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 40B8895C0 for ; Wed, 23 May 2012 20:09:43 +0000 (UTC) Received: (qmail 92608 invoked by uid 500); 23 May 2012 20:09:42 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 92545 invoked by uid 500); 23 May 2012 20:09:42 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 92523 invoked by uid 99); 23 May 2012 20:09:42 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 May 2012 20:09:42 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 8091C14281C for ; Wed, 23 May 2012 20:09:42 +0000 (UTC) Date: Wed, 23 May 2012 20:09:42 +0000 (UTC) From: "Thomas Graves (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <865459993.12931.1337803782530.JavaMail.jiratomcat@issues-vm> In-Reply-To: <1095474367.22350.1334330897389.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (MAPREDUCE-4152) map task left hanging after AM dies trying to connect to RM MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-4152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated MAPREDUCE-4152: ------------------------------------- Status: Open (was: Patch Available) > map task left hanging after AM dies trying to connect to RM > ----------------------------------------------------------- > > Key: MAPREDUCE-4152 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4152 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2 > Affects Versions: 0.23.2 > Reporter: Thomas Graves > Assignee: Thomas Graves > Attachments: MAPREDUCE-4152.patch, MAPREDUCE-4152.patch, MAPREDUCE-4152.patch > > > We had an instance where the RM went down for more then an hour. The application master exited with "Could not contact RM after 360000 milliseconds" > 2012-04-11 10:43:36,040 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1333003059741_15999Job Transitioned from RUNNING to ERROR -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira