hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2459) RM crashes if App gets rejected for any reason and HA is enabled
Date Thu, 11 Sep 2014 11:29:40 GMT

    [ https://issues.apache.org/jira/browse/YARN-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129910#comment-14129910
] 

Hudson commented on YARN-2459:
------------------------------

SUCCESS: Integrated in Hadoop-Yarn-trunk #677 (See [https://builds.apache.org/job/Hadoop-Yarn-trunk/677/])
YARN-2459. RM crashes if App gets rejected for any reason and HA is enabled. Contributed by
Jian He (xgong: rev 47bdfa044aa1d587b24edae8b1b0c796d829c960)
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMRestart.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/TestRMAppTransitions.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
* hadoop-yarn-project/CHANGES.txt
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/RMAppManager.java
Fix CHANGES.txt. Credit Mayank Bansal for his contributions on YARN-2459 (xgong: rev 7d38ffc8d3500d428bdad5640e9e70d66ed5ea60)
* hadoop-yarn-project/CHANGES.txt


> RM crashes if App gets rejected for any reason and HA is enabled
> ----------------------------------------------------------------
>
>                 Key: YARN-2459
>                 URL: https://issues.apache.org/jira/browse/YARN-2459
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.4.1
>            Reporter: Mayank Bansal
>            Assignee: Mayank Bansal
>             Fix For: 2.6.0
>
>         Attachments: YARN-2459-1.patch, YARN-2459-2.patch, YARN-2459.3.patch, YARN-2459.4.patch,
YARN-2459.5.patch, YARN-2459.6.patch
>
>
> If RM HA is enabled and used Zookeeper store for RM State Store.
> If for any reason Any app gets rejected and directly goes to NEW to FAILED
> then final transition makes that to RMApps and Completed Apps memory structure but that
doesn't make it to State store.
> Now when RMApps default limit reaches it starts deleting apps from memory and store.
In that case it try to delete this app from store and fails which causes RM to crash.
> Thanks,
> Mayank



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message