zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pramod Srinivasan <pra...@juniper.net.INVALID>
Subject Zookeeper server not responding.
Date Fri, 24 Jan 2020 01:27:06 GMT
Hello Everyone,

I am using Zookeeper 3.5.1-alpha and I see a problem when I am using a 2 node setup.

Node 1 Zookeeper logs:

2020-01-11 11:29:52,141 [myid:2147483653] - INFO  [QuorumPeerListener:QuorumCnxManager$Listener@631]
- My election bind port: 0.0.0.0/0.0.0.0:61898
2020-01-11 11:29:52,149 [myid:2147483653] - ERROR [WorkerSender[myid=2147483653]:NIOServerCnxnFactory$1@92]
- Thread Thread[WorkerSender[myid=2147483653],5,main] died
java.lang.NullPointerException
        at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(Unknown
Source)
        at java.util.concurrent.LinkedBlockingQueue.poll(Unknown Source)
        at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.run(FastLeaderElection.java:462)
        at java.lang.Thread.run(Unknown Source)
2020-01-11 11:29:52,161 [myid:2147483653] - INFO  [QuorumPeer[myid=2147483653](plain=/0:0:0:0:0:0:0:0:61896)(secure=disabled):QuorumPeer@986]
- LOOKING

Node 2 Zookeeper logs:

2020-01-11 11:29:51,852 [myid:2147483652] - WARN  [WorkerSender[myid=2147483652]:QuorumCnxManager@459]
- Cannot open channel to 2147483653 at election address /128.0.0.5:61898
java.net.ConnectException: Connection refused
        at java.net.PlainSocketImpl.socketConnect(Native Method)
        at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)
        at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)
        at java.net.AbstractPlainSocketImpl.connect(Unknown Source)
        at java.net.SocksSocketImpl.connect(Unknown Source)
        at java.net.Socket.connect(Unknown Source)
        at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:444)
        at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:485)
        at org.apache.zookeeper.server.quorum.QuorumCnxManager.toSend(QuorumCnxManager.java:421)
        at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.process(FastLeaderElection.java:486)
        at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.run(FastLeaderElection.java:465)
        at java.lang.Thread.run(Unknown Source)

Zookeeper server on the nodes never recover from this state and clients are unable to connect
to the server. Any hint on what the problem is based on the back trace on Node 1 logs? Is
this a Zookeeper server code issue or a setup issue?

Thanks,
Pramod


Mime
View raw message