hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod Kumar Vavilapalli (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3518) default rm/am expire interval should not less than default resourcemanager connect wait time
Date Mon, 04 May 2015 19:18:07 GMT

    [ https://issues.apache.org/jira/browse/YARN-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14527100#comment-14527100
] 

Vinod Kumar Vavilapalli commented on YARN-3518:
-----------------------------------------------

We need to be careful here. Clients from gateway machines should be treated separately from
AMs - a distinction we don't have today. It actually makes sense for clients to retry for
a longer time than is usual for AMs.

> default rm/am expire interval should not less than default resourcemanager connect wait
time
> --------------------------------------------------------------------------------------------
>
>                 Key: YARN-3518
>                 URL: https://issues.apache.org/jira/browse/YARN-3518
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager, resourcemanager
>            Reporter: sandflee
>            Assignee: sandflee
>              Labels: configuration, newbie
>         Attachments: YARN-3518.001.patch
>
>
> take am for example, if am can't connect to RM, after am expire (600s), RM relaunch am,
and there will be two am at the same time util resourcemanager connect max wait time(900s)
passed.
> DEFAULT_RESOURCEMANAGER_CONNECT_MAX_WAIT_MS =  15 * 60 * 1000;
> DEFAULT_RM_AM_EXPIRY_INTERVAL_MS = 600000;
> DEFAULT_RM_NM_EXPIRY_INTERVAL_MS = 600000;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message