hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wangda Tan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-5015) Support sliding window retry capability for container restart
Date Thu, 08 Mar 2018 02:41:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16390626#comment-16390626
] 

Wangda Tan commented on YARN-5015:
----------------------------------

[~csingh], could you explain a bit about how this logic will be shared by RM and AM? Per my
understanding, restart AM container should be handled by NM, correct? Did you mean AM needs
to implement similar logic to restart its container? If so, why not directly leverage NM logics
to handle container auto restart?

bq. The default value of remainingRetries is -1, that is, when it is not set, it is -1.
How about set initial remainingRetries directly to maxRetries? Which can avoid such check

> Support sliding window retry capability for container restart 
> --------------------------------------------------------------
>
>                 Key: YARN-5015
>                 URL: https://issues.apache.org/jira/browse/YARN-5015
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>            Reporter: Varun Vasudev
>            Assignee: Chandni Singh
>            Priority: Major
>              Labels: oct16-medium
>         Attachments: YARN-5015.01.patch, YARN-5015.02.patch, YARN-5015.03.patch
>
>
> We support sliding window retry policy for AM restarts (Introduced in YARN-611). Similar
sliding window retry policy is needed for container restarts.
> With this change, we can introduce a common class for SlidingWindowRetryPolicy ( suggested
by [~vvasudev] in the comments) and integrate it to container restart. 
> In a subsequent jira, we can modify the AM code to use SlidingWindowRetryPolicy which
will unify the AM and container restart code.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message