hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (MAPREDUCE-6776) yarn.app.mapreduce.client.job.max-retries should have a more useful default
Date Fri, 07 Oct 2016 22:00:22 GMT

    [ https://issues.apache.org/jira/browse/MAPREDUCE-6776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15556364#comment-15556364
] 

Hudson commented on MAPREDUCE-6776:
-----------------------------------

SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #10570 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/10570/])
MAPREDUCE-6776. yarn.app.mapreduce.client.job.max-retries should have a (rkanter: rev f3f37e6fb8172f6434e06eb9a137c0c155b3952e)
* (edit) hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/mapred-default.xml
* (edit) hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java
* (edit) hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/JobClientUnitTest.java


> yarn.app.mapreduce.client.job.max-retries should have a more useful default
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-6776
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6776
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: client
>    Affects Versions: 2.8.0
>            Reporter: Daniel Templeton
>            Assignee: Miklos Szegedi
>             Fix For: 2.9.0, 3.0.0-alpha2
>
>         Attachments: MAPREDUCE-6776.001.patch, MAPREDUCE-6776.002.patch, MAPREDUCE-6776.003.patch
>
>
> The default is 0, so any communication failure results in a client failure.  Oozie doesn't
like that.  If the RM is failing over and Oozie gets a communication failure, it assumes the
target job has failed.  I propose raising the default to something modest like 3 or 5.  The
default retry interval is 2s.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org


Mime
View raw message