hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-4101) RM should print alert messages if Zookeeper and Resourcemanager gets connection issue
Date Thu, 03 Sep 2015 05:27:46 GMT

    [ https://issues.apache.org/jira/browse/YARN-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728526#comment-14728526
] 

Hudson commented on YARN-4101:
------------------------------

SUCCESS: Integrated in Hadoop-Mapreduce-trunk #2287 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/2287/])
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection
issue. Contributed by Xuan Gong (jianhe: rev 09c64ba1ba8be7a2ac31f4e42efb8c99b682399f)
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebServices.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/EmbeddedElectorService.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/AboutBlock.java
* hadoop-yarn-project/CHANGES.txt
* hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ha/ActiveStandbyElector.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/RMWebApp.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/RMWebAppFilter.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/AdminService.java
* hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/dao/ClusterInfo.java


> RM should print alert messages if Zookeeper and Resourcemanager gets connection issue
> -------------------------------------------------------------------------------------
>
>                 Key: YARN-4101
>                 URL: https://issues.apache.org/jira/browse/YARN-4101
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: yarn
>            Reporter: Yesha Vora
>            Assignee: Xuan Gong
>            Priority: Critical
>             Fix For: 2.8.0, 2.7.2, 2.6.2
>
>         Attachments: YARN-4101.1.patch, YARN-4101.2.patch, YARN-4101.3.patch
>
>
> Currently, There is no way for user to understand Zk-RM has connection issues. In HA
environment, RM is highly dependent on Zookeeper. If connection between RM and Zk is jeopardized,
cluster is likely to be gone in bad state.
> Example: Rm1 is active and Rm2 is standby. If connection between Rm2 and Zk is lost,
Rm2 will never become active. In this case, if Rm1 hits an error and could not be started,
cluster goes in bad state. This situation is very hard to debug for user. In this case, if
we can develop better prompting of messages, User could fix the Zk-RM connection issue and
could avoid getting in bad state.
> Thus, We need a better way to prompt alert to user if connection between Zk -> Active
RM or Zk -> standby RM is getting bad.
> Here are the suggestions.
> 1) Print connection lost alert in RM UI
> 2) Print alert messages while running any Yarn command such as yarn logs, yarn applications
etc



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message