hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "sandflee (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3161) Containers' information are lost in some cases when RM restart
Date Tue, 10 Feb 2015 02:19:34 GMT

    [ https://issues.apache.org/jira/browse/YARN-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313383#comment-14313383
] 

sandflee commented on YARN-3161:
--------------------------------

if the NM machine crashes while RM restart, it seems we'll lost the container info forever

> Containers' information are lost in some cases when RM restart
> --------------------------------------------------------------
>
>                 Key: YARN-3161
>                 URL: https://issues.apache.org/jira/browse/YARN-3161
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.6.0
>            Reporter: Jun Gong
>
> When RM restart, containers' information will be lost for the following scenarios:
> 1. NM restarts before it sends containers' information to the new active RM. 
> 2. NM stops and it could not send containers' information to the new active RM.
> Without those containers' information, corresponding AM will never get their status through
RM, and AM would just wait them for ever.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message