hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jian He (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2630) TestDistributedShell#testDSRestartWithPreviousRunningContainers fails
Date Wed, 01 Oct 2014 05:50:34 GMT

    [ https://issues.apache.org/jira/browse/YARN-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14154395#comment-14154395
] 

Jian He commented on YARN-2630:
-------------------------------

bq. Is it correct to only notify NM when keepContainersAcrossApplicationAttempts is set? 
I added this check because in work-preserving AM restart, 2nd AM needs to know about the previous
AM's finished containers. So we should not pre-maturely make NM remove the containers, in
case RM restarted.

> TestDistributedShell#testDSRestartWithPreviousRunningContainers fails
> ---------------------------------------------------------------------
>
>                 Key: YARN-2630
>                 URL: https://issues.apache.org/jira/browse/YARN-2630
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Jian He
>            Assignee: Jian He
>         Attachments: YARN-2630.1.patch, YARN-2630.2.patch
>
>
> The problem is that after YARN-1372, in work-preserving AM restart, the re-launched AM
will also receive previously failed AM container. But DistributedShell logic is not expecting
this extra completed container.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message