zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Krzysztof Rybak <krzysztof.ry...@motorolasolutions.com>
Subject Zookeeper exception: Timeout while waiting for epoch from quorum
Date Thu, 20 Oct 2016 14:50:25 GMT
Hi All,
first mail in the group so sorry for possible inconsistency in advance.
Zookeeper version is zookeeper-3.4.6.

I'm facing a problem when zookeeper is reconfiguring a cluster.

Initial state:
machine A:
machine B:

zk1 and zk2 are stopped (in that order).
zk1 is started on machine B.
zk1 and zk3 are not creating a cluster, status is (using srvr word)
'This ZooKeeper instance is not currently serving requests'

A part of a log is:
[2016-10-20 04:03:10,053] WARN Unexpected exception
java.lang.InterruptedException: Timeout while waiting for epoch from quorum
at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:377)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:799)
[2016-10-20 04:03:10,054] INFO Shutting down
[2016-10-20 04:03:10,054] INFO Shutdown called
java.lang.Exception: shutdown Leader! reason: Forcing shutdown
at org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:499)
at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:805)
[2016-10-20 04:03:10,054] INFO shutting down

What is interesting: when zk2(previous leader) is started on machine B
(instead of zk1) cluster is configured correctly.
The same situation happens when all happen on the single machine.

Issue is similar to this, but algorithm used by me is 3 (by default and
confirmed with electionAlg=3 in .cfg files)


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message