hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jian He (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-4740) container complete msg may lost while AM restart in race condition
Date Thu, 03 Mar 2016 21:32:18 GMT

    [ https://issues.apache.org/jira/browse/YARN-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15178657#comment-15178657
] 

Jian He commented on YARN-4740:
-------------------------------

[~sandflee], actually, with this fix, the 2nd AM may possibly receive duplicated container
statuses, while the 1st AM has already received it ?

> container complete msg may lost while AM restart in race condition
> ------------------------------------------------------------------
>
>                 Key: YARN-4740
>                 URL: https://issues.apache.org/jira/browse/YARN-4740
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: sandflee
>            Assignee: sandflee
>         Attachments: YARN-4740.01.patch, YARN-4740.02.patch
>
>
> 1, container completed, and the msg is store in RMAppAttempt.justFinishedContainers
> 2,  AM allocate and before allocateResponse came to AM, AM crashed
> 3,  AM restart and couldn't get the container complete msg.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message