hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ZhuGuanyin (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-5407) Sometimes, Reduce tasks hang, State is unassigned
Date Thu, 05 Mar 2009 06:37:56 GMT
Sometimes, Reduce tasks hang, State is unassigned
-------------------------------------------------

                 Key: HADOOP-5407
                 URL: https://issues.apache.org/jira/browse/HADOOP-5407
             Project: Hadoop Core
          Issue Type: Bug
          Components: mapred
    Affects Versions: 0.19.0
            Reporter: ZhuGuanyin


Hi, all

When our cluster runs for a long time, some reduce tasks running on some tasktrackers hang.
Their states are UNASSIGNED.  Then, all reduce tasks on these tasktracker will hang.

We kill the hang reduce task, then the reduce task attempt is re-scheduled to this tasktracker,
the attempt task continues to hang. We fail it, it goes to another tasktracker, it is executed
successfully. 

Tasktracker which has hang reduce task will receive new reduce task, but the reduce  task
continue to hang for ever.

When we reboot the tasktracker machine, reduce task no longer hangs.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message