helix-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ming Fang (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HELIX-96) ZkBaseDataAccessor.get() hangs during Zookeeper failure
Date Thu, 16 May 2013 18:07:15 GMT
Ming Fang created HELIX-96:

             Summary: ZkBaseDataAccessor.get() hangs during Zookeeper failure
                 Key: HELIX-96
                 URL: https://issues.apache.org/jira/browse/HELIX-96
             Project: Apache Helix
          Issue Type: Bug
          Components: helix-core
    Affects Versions: 0.6.0-incubating
            Reporter: Ming Fang

During our failure testing with Zookeeper running in standard alone mode, we sometimes see
our application hanging in the callstack below...

   java.lang.Thread.State: TIMED_WAITING (parking)
	at sun.misc.Unsafe.park(Native Method)
	- parking to wait for  <0x187c1f10> (a java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
	at java.util.concurrent.locks.LockSupport.parkUntil(LockSupport.java:237)
	at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitUntil(AbstractQueuedSynchronizer.java:2072)
	at org.I0Itec.zkclient.ZkClient.waitForKeeperState(ZkClient.java:636)
	at org.I0Itec.zkclient.ZkClient.waitUntilConnected(ZkClient.java:619)
	at org.I0Itec.zkclient.ZkClient.waitUntilConnected(ZkClient.java:615)
	at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:679)
	at org.apache.helix.manager.zk.ZkClient.readData(ZkClient.java:254)
	at org.I0Itec.zkclient.ZkClient.readData(ZkClient.java:761)
	at org.apache.helix.manager.zk.ZkBaseDataAccessor.get(ZkBaseDataAccessor.java:315)
	at org.apache.helix.manager.zk.ZkCacheBaseDataAccessor.get(ZkCacheBaseDataAccessor.java:461)

The comment in ZKClient.java line 677 seems to say that eventually it would get a Disconnected
event and then throw an exception, but we waited for many minutes.

Also we were able to resume by simply restarting Zookeeper.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message