zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Krzysztof Rybak <krzysztof.ry...@motorolasolutions.com>
Subject Zookeeper exception: Timeout while waiting for epoch from quorum
Date Thu, 20 Oct 2016 14:50:25 GMT
Hi All,
first mail in the group so sorry for possible inconsistency in advance.
Zookeeper version is zookeeper-3.4.6.

I'm facing a problem when zookeeper is reconfiguring a cluster.

Initial state:
machine A:
zk1(follower)
zk2(leader)
machine B:
zk3(follower)

zk1 and zk2 are stopped (in that order).
zk1 is started on machine B.
zk1 and zk3 are not creating a cluster, status is (using srvr word)
'This ZooKeeper instance is not currently serving requests'

A part of a log is:
[2016-10-20 04:03:10,053] WARN Unexpected exception
(org.apache.zookeeper.server.quorum.QuorumPeer)
java.lang.InterruptedException: Timeout while waiting for epoch from quorum
at
org.apache.zookeeper.server.quorum.Leader.getEpochToPropose(Leader.java:878)
at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:377)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:799)
[2016-10-20 04:03:10,054] INFO Shutting down
(org.apache.zookeeper.server.quorum.Leader)
[2016-10-20 04:03:10,054] INFO Shutdown called
(org.apache.zookeeper.server.quorum.Leader)
java.lang.Exception: shutdown Leader! reason: Forcing shutdown
at org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:499)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:805)
[2016-10-20 04:03:10,054] INFO shutting down
(org.apache.zookeeper.server.ZooKeeperServer)

What is interesting: when zk2(previous leader) is started on machine B
(instead of zk1) cluster is configured correctly.
The same situation happens when all happen on the single machine.

Issue is similar to this, but algorithm used by me is 3 (by default and
confirmed with electionAlg=3 in .cfg files)
https://issues.apache.org/jira/browse/ZOOKEEPER-2400

thanks,
Krzysztof

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message