hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Siddharth Seth (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-2504) MR 279: race in JobHistoryEventHandler stop
Date Tue, 17 May 2011 16:35:47 GMT
MR 279: race in JobHistoryEventHandler stop 
--------------------------------------------

                 Key: MAPREDUCE-2504
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2504
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mrv2
            Reporter: Siddharth Seth
            Assignee: Siddharth Seth
             Fix For: 0.23.0


The condition to stop the eventHandling thread currently requires it to be 'stopped' AND interrupted.
If an interrupt arrives after a take, but before handleEvent is called - the interrupt status
ends up being handled by hadoop.util.Shell.runCommand() - which ignores it (and in the process
resets the flag).
The eventHandling thread subsequently hangs on eventQueue.take()
This currently randomly fails unit tests - and can hang MR AMs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message