hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Devaraj Das (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-1324) FSError encountered by one running task should not be fatal to other tasks on that node
Date Thu, 03 May 2007 17:24:15 GMT
FSError encountered by one running task should not be fatal to other tasks on that node
---------------------------------------------------------------------------------------

                 Key: HADOOP-1324
                 URL: https://issues.apache.org/jira/browse/HADOOP-1324
             Project: Hadoop
          Issue Type: Improvement
          Components: mapred
            Reporter: Devaraj Das


Currently, if one task encounters a FSError, it reports that to the TaskTracker and the TaskTracker
reinitializes itself and effectively loses state of all the other running tasks too. This
can probably be improved especially after the fix for HADOOP-1252. The TaskTracker should
probably avoid reinitializing itself and instead get blacklisted for that job. Other tasks
should be allowed to continue as long as they can (complete successfully, or, fail either
due to disk problems or otherwise).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message