Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 90D3A86EA for ; Thu, 18 Aug 2011 21:14:52 +0000 (UTC) Received: (qmail 81386 invoked by uid 500); 18 Aug 2011 21:14:51 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 80799 invoked by uid 500); 18 Aug 2011 21:14:50 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 80780 invoked by uid 99); 18 Aug 2011 21:14:49 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Aug 2011 21:14:49 +0000 X-ASF-Spam-Status: No, hits=-2001.1 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Aug 2011 21:14:48 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 77A8AC320F for ; Thu, 18 Aug 2011 21:14:28 +0000 (UTC) Date: Thu, 18 Aug 2011 21:14:28 +0000 (UTC) From: "Bharath Mundlapudi (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <68240356.50547.1313702068480.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <2072931558.41877.1313512780128.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (MAPREDUCE-2846) approx 10% of all tasks fail with DefaultTaskController MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13087290#comment-13087290 ] Bharath Mundlapudi commented on MAPREDUCE-2846: ----------------------------------------------- Hi Allen, Can you post how you are configuring mapred.local.dir values? We have not seen this problem in our cluster since we run with Linux task controller. But, Eli is right, we did change Default task controller to make it consistent. Giving more information will help us to understand better like how many disks you have, mapred.local.dir value etc. or even mapred-site.xml. I am asking this information to get an idea of how we can reproduce in our test cluster? > approx 10% of all tasks fail with DefaultTaskController > ------------------------------------------------------- > > Key: MAPREDUCE-2846 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2846 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: task, task-controller, tasktracker > Affects Versions: 0.20.204.0 > Reporter: Allen Wittenauer > Priority: Blocker > > After upgrading our test 0.20.203 grid to 0.20.204-rc2, we ran terasort to verify operation. While the job completed successfully, approx 10% of the tasks failed with task runner execution errors and the inability to create symlinks for attempt logs. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira