hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ramkrishna.s.vasudevan (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-4479) TestMasterFailover failure in Hbase-0.92#17
Date Mon, 03 Oct 2011 11:23:33 GMT

    [ https://issues.apache.org/jira/browse/HBASE-4479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13119246#comment-13119246
] 

ramkrishna.s.vasudevan commented on HBASE-4479:
-----------------------------------------------

{code}
2011-10-01 06:13:12,255 ERROR [main-EventThread] master.HMaster(1275): Primary master encountered
unexpected exception while trying to recover from ZooKeeper session expiry. Proceeding with
server abort.
java.lang.IllegalArgumentException: n must be positive
	at java.util.Random.nextInt(Random.java:250)
	at org.apache.hadoop.hbase.master.DefaultLoadBalancer.retainAssignment(DefaultLoadBalancer.java:590)
	at org.apache.hadoop.hbase.master.AssignmentManager.assignAllUserRegions(AssignmentManager.java:1880)
	at org.apache.hadoop.hbase.master.AssignmentManager.processRegionsInTransition(AssignmentManager.java:385)
	at org.apache.hadoop.hbase.master.AssignmentManager.processRegionsInTransition(AssignmentManager.java:338)
	at org.apache.hadoop.hbase.master.HMaster.tryRecoveringExpiredZKSession(HMaster.java:1250)
	at org.apache.hadoop.hbase.master.HMaster.abortNow(HMaster.java:1273)
	at org.apache.hadoop.hbase.master.HMaster.abort(HMaster.java:1212)
	at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.connectionEvent(ZooKeeperWatcher.java:324)
	at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:242)
	at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530)
	at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506)
2011-10-01 06:13:12,255 FATAL [main-EventThread] master.HMaster(1213): master:42546-0x132be1ce3a00009
master:42546-0x132be1ce3a00009 received expired from ZooKeeper, aborting
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired
	at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.connectionEvent(ZooKeeperWatcher.java:324)
	at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:242)
	at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530)
	at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506)
2011-10-01 06:13:12,256 INFO  [main-EventThread] master.HMaster(1343): Aborting

{code}
                
> TestMasterFailover failure in Hbase-0.92#17
> -------------------------------------------
>
>                 Key: HBASE-4479
>                 URL: https://issues.apache.org/jira/browse/HBASE-4479
>             Project: HBase
>          Issue Type: Bug
>            Reporter: ramkrishna.s.vasudevan
>            Assignee: ramkrishna.s.vasudevan
>            Priority: Minor
>
> When the master restarted it was not able to get any servers online and the restart was
a clean restart.
> Hence there were no regions to assign.
> Hence the retainAssignment tries to get one of the regions and uses RANDOM.getInt(size).
 Here size is 0.
> So ideally 0 is not accepted here.  Hence we have got an exception making the master
not to come up and the test case timeout.
> Though we need to see if really no regions was expected when the master came up, but
this JIRA's intent is to deal such scenario where the size can be 0.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message