hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-185) tasks are lost during pollForNewTask
Date Tue, 02 May 2006 04:01:47 GMT
     [ http://issues.apache.org/jira/browse/HADOOP-185?page=all ]

Owen O'Malley updated HADOOP-185:
---------------------------------

    Attachment: launch-time-out.patch

This patch adds a new thread to the JobTracker that every three minutes checks the list of
launched tasks that haven't been reported yet. If any of them are older than 10 minutes, they
are marked as failed.

> tasks are lost during pollForNewTask
> ------------------------------------
>
>          Key: HADOOP-185
>          URL: http://issues.apache.org/jira/browse/HADOOP-185
>      Project: Hadoop
>         Type: Bug

>   Components: mapred
>     Versions: 0.1.1
>     Reporter: Owen O'Malley
>     Assignee: Owen O'Malley
>     Priority: Critical
>      Fix For: 0.2
>  Attachments: launch-time-out.patch
>
> There is the potential for "losing" tasks that are assigned by the JobTracker to a TaskTracker,
but that fail during returning the result (usually due to a RPC timeout). In this case, the
Job becomes "wedged" in that the tasks will never run and never time out.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Mime
View raw message