hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohith (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.
Date Tue, 23 Sep 2014 09:32:33 GMT

    [ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14144604#comment-14144604
] 

Rohith commented on YARN-2588:
------------------------------

Consider RM initially in standby.
1. StandBy RM
2. StandBy -->Active (Failed because of ZK session expire)
3. RM remain in StandBy state.
4. StandBy -->Active will fail with below exception
{code}
2014-09-20 14:21:36,980 | WARN  | AsyncDispatcher event handler-EventThread | Exception handling
the winning of election | org.apache.hadoop.ha.ActiveStandbyElector.becomeActive(ActiveStandbyElector.java:809)
org.apache.hadoop.ha.ServiceFailedException: RM could not transition to Active
	at org.apache.hadoop.yarn.server.resourcemanager.EmbeddedElectorService.becomeActive(EmbeddedElectorService.java:128)
	at org.apache.hadoop.ha.ActiveStandbyElector.becomeActive(ActiveStandbyElector.java:805)
	at org.apache.hadoop.ha.ActiveStandbyElector.processResult(ActiveStandbyElector.java:416)
	at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:602)
	at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
Caused by: org.apache.hadoop.ha.ServiceFailedException: Error when transitioning to Active
mode
	at org.apache.hadoop.yarn.server.resourcemanager.AdminService.transitionToActive(AdminService.java:290)
	at org.apache.hadoop.yarn.server.resourcemanager.EmbeddedElectorService.becomeActive(EmbeddedElectorService.java:126)
	... 4 more
Caused by: org.apache.hadoop.service.ServiceStateException: RMActiveServices cannot enter
state STARTED from state STOPPED
	at org.apache.hadoop.service.ServiceStateModel.checkStateTransition(ServiceStateModel.java:129)
	at org.apache.hadoop.service.ServiceStateModel.enterState(ServiceStateModel.java:111)
{code}


> Standby RM does not transitionToActive if previous transitionToActive is failed with
ZK exception.
> --------------------------------------------------------------------------------------------------
>
>                 Key: YARN-2588
>                 URL: https://issues.apache.org/jira/browse/YARN-2588
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 3.0.0, 2.6.0, 2.5.1
>            Reporter: Rohith
>            Assignee: Rohith
>
> Consider scenario where, StandBy RM is failed to transition to Active because of ZK exception(connectionLoss
or SessionExpired). Then any further transition to Active for same RM does not move RM to
Active state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message