hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3387) Previous AM's container complete message couldn't pass to current am if am restarted and rm changed
Date Sat, 25 Apr 2015 11:35:50 GMT

    [ https://issues.apache.org/jira/browse/YARN-3387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512450#comment-14512450
] 

Hudson commented on YARN-3387:
------------------------------

FAILURE: Integrated in Hadoop-Hdfs-trunk-Java8 #165 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Java8/165/])
YARN-3387. Previous AM's container completed status couldn't pass to current AM if AM and
RM restarted during the same time. Contributed by Sandflee (jianhe: rev d03dcb9635dbd79a45d229d1cab5fd28e5e49f49)
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestWorkPreservingRMRestart.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/attempt/RMAppAttemptImpl.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
* hadoop-yarn-project/CHANGES.txt


> Previous AM's container complete message couldn't pass to current am if am restarted
and rm changed
> ---------------------------------------------------------------------------------------------------
>
>                 Key: YARN-3387
>                 URL: https://issues.apache.org/jira/browse/YARN-3387
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.6.0
>            Reporter: sandflee
>            Assignee: sandflee
>            Priority: Critical
>              Labels: patch
>             Fix For: 2.8.0
>
>         Attachments: YARN-3387.001.patch, YARN-3387.002.patch
>
>
> suppose am work preserving and rm ha is enabled.
> container complete message is passed to appattemt.justFinishedContainers in rm。in normal
situation,all attempt in one app shares the same justFinishedContainers, but when rm changed,
every attempt has it's own justFinishedContainers, so in situations below, container complete
message couldn't passed to am:
> 1, am restart
> 2, rm changes
> 3, container launched by first am completes
> container complete message will be passed to appAttempt1 not appAttempt2, but am pull
finished containers from appAttempt2 (currentAppAttempt)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message