Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F372D37A for ; Thu, 23 Aug 2012 18:32:43 +0000 (UTC) Received: (qmail 95508 invoked by uid 500); 23 Aug 2012 18:32:43 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 95428 invoked by uid 500); 23 Aug 2012 18:32:43 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 95316 invoked by uid 99); 23 Aug 2012 18:32:43 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Aug 2012 18:32:43 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 963142C0A57 for ; Thu, 23 Aug 2012 18:32:42 +0000 (UTC) Date: Fri, 24 Aug 2012 05:32:42 +1100 (NCT) From: "Aaron T. Myers (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <172528053.6711.1345746762616.JavaMail.jiratomcat@arcas> In-Reply-To: <1516160795.9317.1299708299616.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (MAPREDUCE-2374) "Text File Busy" errors launching MR tasks MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron T. Myers updated MAPREDUCE-2374: -------------------------------------- Resolution: Fixed Fix Version/s: (was: 0.22.1) 2.2.0-alpha 1.2.0 Assignee: Andy Isaacson (was: Todd Lipcon) Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) I've just committed Andy's latest patches to to branch-1, branch-2, and trunk. Thanks a lot for the contribution, Andy, and thanks a lot to everyone for the discussion to get to the bottom of this issue. > "Text File Busy" errors launching MR tasks > ------------------------------------------ > > Key: MAPREDUCE-2374 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2374 > Project: Hadoop Map/Reduce > Issue Type: Bug > Affects Versions: 0.22.0 > Reporter: Todd Lipcon > Assignee: Andy Isaacson > Fix For: 1.2.0, 2.2.0-alpha > > Attachments: failed_taskjvmsh.strace, mapreduce-2374-2.txt, mapreduce-2374-branch-1.patch, mapreduce-2374-on-20sec.txt, mapreduce-2374.txt, mapreduce-2374.txt, mapreduce-2374.txt, successfull_taskjvmsh.strace > > > Some very small percentage of tasks fail with a "Text file busy" error. > The following was the original diagnosis: > {quote} > Our use of PrintWriter in TaskController.writeCommand is unsafe, since that class swallows all IO exceptions. We're not currently checking for errors, which I'm seeing result in occasional task failures with the message "Text file busy" - assumedly because the close() call is failing silently for some reason. > {quote} > .. but turned out to be another issue as well (see below) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira