hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bharath Mundlapudi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-2846) approx 10% of all tasks fail with DefaultTaskController
Date Thu, 18 Aug 2011 21:14:28 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13087290#comment-13087290
] 

Bharath Mundlapudi commented on MAPREDUCE-2846:
-----------------------------------------------

Hi Allen, 

Can you post how you are configuring mapred.local.dir values? We have not seen this problem
in our cluster since we run with Linux task controller. But, Eli is right, we did change Default
task controller to make it consistent. Giving more information will help us to understand
better like how many disks you have, mapred.local.dir value etc. or even mapred-site.xml.
I am asking this information to get an idea of how we can reproduce in our test cluster?



> approx 10% of all tasks fail with DefaultTaskController
> -------------------------------------------------------
>
>                 Key: MAPREDUCE-2846
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2846
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: task, task-controller, tasktracker
>    Affects Versions: 0.20.204.0
>            Reporter: Allen Wittenauer
>            Priority: Blocker
>
> After upgrading our test 0.20.203 grid to 0.20.204-rc2, we ran terasort to verify operation.
 While the job completed successfully, approx 10% of the tasks failed with task runner execution
errors and the inability to create symlinks for attempt logs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message