hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allen Wittenauer (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-2846) approx 10% of all tasks fail with DefaultTaskController
Date Tue, 16 Aug 2011 18:15:31 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13085880#comment-13085880
] 

Allen Wittenauer commented on MAPREDUCE-2846:
---------------------------------------------

*nods*  I'm mostly convinced it is a race condition in MR-2415.  I haven't had enough time
to start playing in the source to track it down more.  I did talk to Owen about already, but
thought it might be useful to at least get the JIRA filed to put more eyes on it since race
conditions are usually pretty awful to track down.

> approx 10% of all tasks fail with DefaultTaskController
> -------------------------------------------------------
>
>                 Key: MAPREDUCE-2846
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2846
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: task, task-controller, tasktracker
>    Affects Versions: 0.20.204.0
>            Reporter: Allen Wittenauer
>            Priority: Blocker
>
> After upgrading our test 0.20.203 grid to 0.20.204-rc2, we ran terasort to verify operation.
 While the job completed successfully, approx 10% of the tasks failed with task runner execution
errors and the inability to create symlinks for attempt logs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message