hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod K V (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-5280) When expiring a lost launched task, JT doesn't remove the attempt from the taskidToTIPMap.
Date Wed, 18 Feb 2009 12:37:02 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-5280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674587#action_12674587
] 

Vinod K V commented on HADOOP-5280:
-----------------------------------

On one of the clusters, a map attempt was expired as a lost task in ExpireLaunchingTasks thread,
but it was not removed from taskidToTIPMap. All the reducers were informed that the map has
failed. In the next heartbeat the TT came back reporting the attempt as a success, thereby
preventing launch of any new map attempts for this task. 
Subsequently, all the reduces just got stalled waiting for the output from this map task and
the whole job got stock with no progress. 

> When expiring a lost launched task, JT doesn't remove the attempt from the taskidToTIPMap.
> ------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5280
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5280
>             Project: Hadoop Core
>          Issue Type: Bug
>            Reporter: Vinod K V
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message