hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (HADOOP-569) Hadoop should allow the user to dynamically change the number of times to re-try failed tasks before declaring the job fail
Date Thu, 24 May 2007 06:57:17 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Owen O'Malley resolved HADOOP-569.
----------------------------------

       Resolution: Duplicate
    Fix Version/s: 0.13.0
         Assignee:     (was: Owen O'Malley)

This was fixed by HADOOP-1304. The number of failures/task is configurable, but not dynamic.

> Hadoop should allow the user to dynamically change the number of times to re-try failed
tasks before declaring the job fail
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-569
>                 URL: https://issues.apache.org/jira/browse/HADOOP-569
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Runping Qi
>             Fix For: 0.13.0
>
>
> Hadoop has a built-in mechanism to fail a job if some tasks failed more than 3 times.
This mechanism works fine in most scenarios. However, in some other cases, it is highly desirable
for the user to change (increase) that number. My current running job demonstrates such a
scenario: The job has run more than 2.5 days. It is close to complete (90+%). Everything indicates
that it will finish eventually in a day, except for one potential danger: some of the tasks
are in their 3rd try! 
> It will be extremely helpful if I can change the maximun number of tries to 6 instead
of 4!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message