hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jian He (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-4770) Auto-restart of containers should work across NM restarts.
Date Fri, 18 Nov 2016 00:52:58 GMT

    [ https://issues.apache.org/jira/browse/YARN-4770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15675321#comment-15675321

Jian He commented on YARN-4770:

bq. If container crashed during the NM reboot, container would transit to RELAUNCHING state.
I will check it again.
Is this working now ? if so, we can close this.

> Auto-restart of containers should work across NM restarts.
> ----------------------------------------------------------
>                 Key: YARN-4770
>                 URL: https://issues.apache.org/jira/browse/YARN-4770
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Vinod Kumar Vavilapalli
> See my comment [here|https://issues.apache.org/jira/browse/YARN-3998?focusedCommentId=15133367&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15133367]
on YARN-3998. Need to take care of two things:
>  - The relaunch feature needs to work across NM restarts, so we should save the retry-context
and policy per container into the state-store and reload it for continue relaunching after
NM restart.
>  - We should also handle restarting of any containers that may have crashed during the
NM reboot.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message