hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amareshwari Sriramadasu (JIRA)" <j...@apache.org>
Subject [jira] Created: (MAPREDUCE-1895) MapEventFetcherThread should not iterate over jobs that are not localized
Date Fri, 25 Jun 2010 05:51:50 GMT
MapEventFetcherThread should not iterate over jobs that are not localized
-------------------------------------------------------------------------

                 Key: MAPREDUCE-1895
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1895
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: tasktracker
            Reporter: Amareshwari Sriramadasu


We have seen a scenario of lost trackers on our clusters because of the following:
TaskLauncher has locked a TaskTracker$RunningJob and doing localizeJob, which involves DFS
operations. Map-event
fetcher has locked TaskTracker.runningJobs map and is waiting to lock the RunningJob object.
TaskTracker offerService
is waiting to lock TaskTracker.runningJobs map, thus failing to send heartbeats in 10 minutes.


So, I think map-event fetcher should circuit jobs that are not localized.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message