Hi there,
I have a cluster of 3 machines, running zookeeper 3.3.1.
zk1 fails to join the quorum while zk2 and zk3 interact correctly. zk1 is
stuck in the election loop. See the log below.
I checked the config files, the connectivity between the machines. I can't
find anything wrong.
Any ideas?
thanks in advance,
alexis
2011-01-07 00:14:23,156 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumPeer@551] - Initializing leader
election protocol...
2011-01-07 00:14:23,157 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@649] - New election.
My id = 1, Proposed zxid = 0
2011-01-07 00:14:23,158 - DEBUG [WorkerSender Thread:QuorumCnxManager@346] -
Opening channel to server 2
2011-01-07 00:14:23,159 - DEBUG [WorkerReceiver
Thread:FastLeaderElection$Messenger$WorkerReceiver@214] - Receive new
notification message. My id = 1
2011-01-07 00:14:23,160 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@689] - Notification:
1, 0, 1, 1, LOOKING, LOOKING, 1
2011-01-07 00:14:23,160 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@495] - id: 1, proposed
id: 1, zxid: 0, proposed zxid: 0
2011-01-07 00:14:23,161 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@717] - Adding vote:
>From = 1, Proposed leader = 1, Porposed zxid = 0, Proposed epoch = 1
2011-01-07 00:14:23,162 - INFO [WorkerSender Thread:QuorumCnxManager@162] -
Have smaller server identifier, so dropping the connection: (2, 1)
2011-01-07 00:14:23,162 - DEBUG [WorkerSender Thread:QuorumCnxManager@346] -
Opening channel to server 3
2011-01-07 00:14:23,172 - INFO [WorkerSender Thread:QuorumCnxManager@162] -
Have smaller server identifier, so dropping the connection: (3, 1)
2011-01-07 00:14:23,365 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
2011-01-07 00:14:23,366 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
2011-01-07 00:14:23,366 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel to
server 2
2011-01-07 00:14:23,367 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
server identifier, so dropping the connection: (2, 1)
2011-01-07 00:14:23,367 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel to
server 3
2011-01-07 00:14:23,378 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
server identifier, so dropping the connection: (3, 1)
2011-01-07 00:14:23,378 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@683] - Notification
time out: 400
2011-01-07 00:14:23,785 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
2011-01-07 00:14:23,785 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
2011-01-07 00:14:23,786 - DEBUG
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel to
server 2
2011-01-07 00:14:26,786 - INFO
[QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
server identifier, so dropping the connection: (2, 1)
...
|