zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steph van Schalkwyk <svanschalk...@gmail.com>
Subject Re: Follower never recovers and keeps saying ZooKeeperServer not running
Date Wed, 04 Jul 2018 05:55:04 GMT
I've seen this bur can't remember what caused it.
What's your zNode size?
steph

On Tue, Jul 3, 2018, 9:17 PM Benjamin Jaton <benjamin.jaton@gmail.com>
wrote:

> Hello,
>
> I'm wondering what can cause a ZK follower to check out like this:
>
> 2018-07-03T13:43:28,814 [myid:] - ERROR [LearnerHandler-/10.0.0.248:40282
> :LearnerHandler@620] - Unexpected exception causing shutdown while sock
> still open
> java.io.EOFException: null
>         at java.io.DataInputStream.readInt(DataInputStream.java:392)
> ~[?:1.8.0_171]
>         at
> org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
> ~[zookeeper-3.5.0-alpha.jar:3.5.2-alpha-SNAPSHOT-1702518]
>         at
>
> org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(QuorumPacket.java:83)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArchive.java:99)
> ~[zookeeper-3.5.0-alpha.jar:3.5.2-alpha-SNAPSHOT-1702518]
>         at
>
> org.apache.zookeeper.server.quorum.LearnerHandler.run(LearnerHandler.java:514)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T13:43:28,814 [myid:] - WARN  [LearnerHandler-/10.0.0.248:40282
> :LearnerHandler@635] - ******* GOODBYE /10.0.0.248:40282 ********
> 2018-07-03T13:43:28,814 [myid:] - ERROR [LearnerHandler-/10.0.0.237:54408
> :LearnerHandler@620] - Unexpected exception causing shutdown while sock
> still open
> java.io.EOFException: null
>         at java.io.DataInputStream.readInt(DataInputStream.java:392)
> ~[?:1.8.0_171]
>         at
> org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
> ~[zookeeper-3.5.0-alpha.jar:3.5.2-alpha-SNAPSHOT-1702518]
>         at
>
> org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(QuorumPacket.java:83)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArchive.java:99)
> ~[zookeeper-3.5.0-alpha.jar:3.5.2-alpha-SNAPSHOT-1702518]
>         at
>
> org.apache.zookeeper.server.quorum.LearnerHandler.run(LearnerHandler.java:514)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T13:43:28,814 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46754
> 2018-07-03T13:43:28,814 [myid:] - WARN  [LearnerHandler-/10.0.0.237:54408
> :LearnerHandler@635] - ******* GOODBYE /10.0.0.237:54408 ********
> 2018-07-03T13:43:28,814 [myid:] - INFO
> [ConnnectionExpirer:MBeanRegistry@128] - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Leader,name3=Connections,name4=10.0.0.111,name5=0x200005f37660000]
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33198
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46780
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33268
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [ConnnectionExpirer:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.111:39162 which had sessionid 0x200005f37660000
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46810
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46824
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46826
> 2018-07-03T13:43:28,815 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):Leader@613
> ]
> - Shutting down
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33316
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33548
> 2018-07-03T13:43:28,815 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):Leader@619
> ]
> - Shutdown called
> java.lang.Exception: shutdown Leader! reason: Not sufficient followers
> synced, only synced with sids: [ [2] ]
>         at
> org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:619)
> [zookeeper-3.5.0-alpha.jar:?]
>         at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:590)
> [zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:1077)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T13:43:28,815 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:33198 (no session
> established for client)
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Leader,name3=Connections,name4=10.0.0.237,name5=0x20000629130001b]
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection@688] - Notification: 2 (message
> format version), 3 (n.leader), 0x9000001cb (n.zxid), 0x0 (n.round), LOOKING
> (n.state), 3 (n.sid), 0x9 (n.peerEPoch), LEADING (my state)300000066
> (n.config version)
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.237:45602 which had
> sessionid
> 0x20000629130001b
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Leader,name3=Connections,name4=10.0.0.248,name5=0x2000005cf2a0008]
> 2018-07-03T13:43:28,816 [myid:] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection@688] - Notification: 2 (message
> format version), 3 (n.leader), 0x9000001cb (n.zxid), 0x7 (n.round), LOOKING
> (n.state), 3 (n.sid), 0x9 (n.peerEPoch), LEADING (my state)300000066
> (n.config version)
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:53764 which had
> sessionid
> 0x2000005cf2a0008
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:33268 (no session
> established for client)
> 2018-07-03T13:43:28,816 [myid:] - WARN
> [NIOWorkerThread-20:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Leader,name3=Connections,name4=10.0.0.248,name5=0x2000005cf2a000a]
> 2018-07-03T13:43:28,816 [myid:] - INFO
> [NIOWorkerThread-20:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.237:46826 (no session established for client)
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:53758 which had
> sessionid
> 0x2000005cf2a000a
> 2018-07-03T13:43:28,816 [myid:] - INFO
> [LearnerCnxAcceptor-0.0.0.0/0.0.0.0:2888:Leader$LearnerCnxAcceptor@373] -
> exception while shutting down acceptor: java.net.SocketException: Socket
> closed
> 2018-07-03T13:43:28,816 [myid:] - WARN
> [NIOWorkerThread-20:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,816 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Leader,name3=Connections,name4=10.0.0.248,name5=0x30000209a080002]
> 2018-07-03T13:43:28,816 [myid:] - INFO
> [NIOWorkerThread-20:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.237:46810 (no session established for client)
> 2018-07-03T13:43:28,817 [myid:] - WARN
> [NIOWorkerThread-6:NIOServerCnxn@369]
> - Exception causing close of session 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:53760 which had
> sessionid
> 0x30000209a080002
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [NIOWorkerThread-6:NIOServerCnxn@606]
> - Closed socket connection for client /10.0.0.237:46780 (no session
> established for client)
> 2018-07-03T13:43:28,817 [myid:] - WARN
> [NIOWorkerThread-4:NIOServerCnxn@369]
> - Exception causing close of session 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,817 [myid:] - INFO  [SessionTracker:ZooKeeperServer@384
> ]
> - Expiring session 0x300006dbd91000b, timeout of 15000ms exceeded
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):ZooKeeperServer@498
> ]
> - shutting down
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [NIOWorkerThread-4:NIOServerCnxn@606]
> - Closed socket connection for client /10.0.0.248:33548 (no session
> established for client)
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):SessionTrackerImpl@232
> ]
> - Shutting down
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [SessionTracker:QuorumZooKeeperServer@132] - Submitting global
> closeSession
> request for session 0x300006dbd91000b
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):LeaderRequestProcessor@77
> ]
> - Shutting down
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):PrepRequestProcessor@967
> ]
> - Shutting down
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [SessionTracker:SessionTrackerImpl@158] - SessionTrackerImpl exited loop!
> 2018-07-03T13:43:28,817 [myid:] - INFO  [ProcessThread(sid:2
> cport:-1)::PrepRequestProcessor@154] - PrepRequestProcessor exited loop!
> 2018-07-03T13:43:28,815 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46908
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33786
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:46954
> 2018-07-03T13:43:28,817 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33992
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47010
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34240
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47084
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47142
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47248
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34532
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47314
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47342
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47384
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34706
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47452
> 2018-07-03T13:43:28,818 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47496
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47512
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34878
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47580
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47620
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35086
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47700
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47748
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47772
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47886
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47894
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47920
> 2018-07-03T13:43:28,819 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:47998
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48018
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48020
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48136
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48138
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48140
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48172
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35898
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48294
> 2018-07-03T13:43:28,820 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48296
> 2018-07-03T13:43:28,817 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):ProposalRequestProcessor@88
> ]
> - Shutting down
> 2018-07-03T13:43:28,816 [myid:] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection@688] - Notification: 2 (message
> format version), 1 (n.leader), 0x9000001cb (n.zxid), 0x7 (n.round), LOOKING
> (n.state), 1 (n.sid), 0x9 (n.peerEPoch), LEADING (my state)300000066
> (n.config version)
> 2018-07-03T13:43:28,817 [myid:] - WARN
> [NIOWorkerThread-20:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,821 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48298
> 2018-07-03T13:43:28,821 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):CommitProcessor@358
> ]
> - Shutting down
> 2018-07-03T13:43:28,821 [myid:] - INFO
> [NIOWorkerThread-20:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.237:46754 (no session established for client)
> 2018-07-03T13:43:28,821 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:48334
> 2018-07-03T13:43:28,821 [myid:] - INFO
> [CommitProcessor:2:CommitProcessor@193] - CommitProcessor exited loop!
> 2018-07-03T13:43:28,821 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:36162
> 2018-07-03T13:43:28,821 [myid:] - WARN
> [NIOWorkerThread-20:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,821 [myid:] - WARN
> [NIOWorkerThread-16:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,821 [myid:] - WARN
> [NIOWorkerThread-31:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T13:43:28,821 [myid:] - WARN
> [NIOWorkerThread-11:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
>
> The rest are more failed clients connections attempts and "ZooKeeperServer
> not running" messages for hours.
>
>
> There was an event before that that could be linked:
>
> 2018-07-03T12:42:27,639 [myid:] - INFO
> [NIOWorkerThread-7:NIOServerCnxn@606]
> - Closed socket connection for client /10.0.0.237:35536 (no session
> established for client)
> 2018-07-03T12:47:28,864 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33256
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35568
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33544
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35672
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:33788
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35756
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34014
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35840
> 2018-07-03T12:47:28,865 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34244
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35924
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34466
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:35994
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34730
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36086
> 2018-07-03T12:47:28,866 [myid:] - WARN
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):Follower@93
> ]
> - Exception when following the leader
> java.net.SocketException: Broken pipe (Write failed)
>         at java.net.SocketOutputStream.socketWrite0(Native Method)
> ~[?:1.8.0_171]
>         at
> java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:111)
> ~[?:1.8.0_171]
>         at java.net.SocketOutputStream.write(SocketOutputStream.java:155)
> ~[?:1.8.0_171]
>         at
> java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82)
> ~[?:1.8.0_171]
>         at
> java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140)
> ~[?:1.8.0_171]
>         at
> org.apache.zookeeper.server.quorum.Learner.writePacket(Learner.java:141)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.Learner.ping(Learner.java:609)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
>
> org.apache.zookeeper.server.quorum.Follower.processPacket(Follower.java:116)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.Follower.followLeader(Follower.java:90)
> [zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:1064)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34944
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36166
> 2018-07-03T12:47:28,866 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Follower]
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:34952
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36172
> 2018-07-03T12:47:28,866 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):Follower@198
> ]
> - shutdown called
> java.lang.Exception: shutdown Follower
>         at
> org.apache.zookeeper.server.quorum.Follower.shutdown(Follower.java:198)
> [zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:1068)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35178
> 2018-07-03T12:47:28,866 [myid:] - INFO
> [NIOWorkerThread-14:ZooKeeperServer@969] - Client attempting to establish
> new session at /10.0.0.248:33256
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36262
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35244
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36276
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35396
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36334
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35472
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36368
> 2018-07-03T12:47:28,867 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35708
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36446
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:35930
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36530
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:36150
> 2018-07-03T12:47:28,867 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:33544 (no session
> established for client)
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36608
> 2018-07-03T12:47:28,868 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):NIOServerCnxn@606
> ]
> - Closed socket connection for client /10.0.0.248:33256 which had
> sessionid
> 0x200003aa13c0010
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36692
> 2018-07-03T12:47:28,868 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):LearnerZooKeeperServer@167
> ]
> - Shutting down
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.248:36418
> 2018-07-03T12:47:28,868 [myid:] - ERROR
> [FollowerRequestProcessor:2:ZooKeeperCriticalThread@48] - Severe
> unrecoverable error, from thread : FollowerRequestProcessor:2
> java.net.SocketException: Socket closed
>         at
> java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:118)
> ~[?:1.8.0_171]
>         at java.net.SocketOutputStream.write(SocketOutputStream.java:155)
> ~[?:1.8.0_171]
>         at
> java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82)
> ~[?:1.8.0_171]
>         at
> java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140)
> ~[?:1.8.0_171]
>         at
> org.apache.zookeeper.server.quorum.Learner.writePacket(Learner.java:141)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
> org.apache.zookeeper.server.quorum.Learner.request(Learner.java:190)
> ~[zookeeper-3.5.0-alpha.jar:?]
>         at
>
> org.apache.zookeeper.server.quorum.FollowerRequestProcessor.run(FollowerRequestProcessor.java:101)
> [zookeeper-3.5.0-alpha.jar:?]
> 2018-07-03T12:47:28,868 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):ZooKeeperServer@498
> ]
> - shutting down
> 2018-07-03T12:47:28,868 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):FollowerRequestProcessor@138
> ]
> - Shutting down
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [FollowerRequestProcessor:2:ZooKeeperServer$ZooKeeperServerListenerImpl@475
> ]
> - Thread FollowerRequestProcessor:2 exits, error code 1
> 2018-07-03T12:47:28,868 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):CommitProcessor@358
> ]
> - Shutting down
> 2018-07-03T12:47:28,869 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):FinalRequestProcessor@492
> ]
> - shutdown of request processor complete
> 2018-07-03T12:47:28,869 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):MBeanRegistry@128
> ]
> - Unregister MBean
>
> [org.apache.ZooKeeperService:name0=ReplicatedServer_id2,name1=replica.2,name2=Follower,name3=InMemoryDataTree]
> 2018-07-03T12:47:28,870 [myid:] - INFO
>
> [QuorumPeer[myid=2](plain=/0:0:0:0:0:0:0:0:2181)(secure=disabled):SyncRequestProcessor@191
> ]
> - Shutting down
> 2018-07-03T12:47:28,870 [myid:] - WARN
> [NIOWorkerThread-17:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,870 [myid:] - INFO
> [NIOWorkerThread-17:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.248:34952 (no session established for client)
> 2018-07-03T12:47:28,870 [myid:] - WARN
> [NIOWorkerThread-17:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,870 [myid:] - INFO
> [NIOWorkerThread-17:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.248:34014 (no session established for client)
> 2018-07-03T12:47:28,870 [myid:] - WARN
> [NIOWorkerThread-32:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,868 [myid:] - INFO
> [NIOServerCxnFactory.AcceptThread:/0.0.0.0:2181
> :NIOServerCnxnFactory$AcceptThread@296] - Accepted socket connection from
> /
> 10.0.0.237:36766
> 2018-07-03T12:47:28,871 [myid:] - INFO
> [NIOWorkerThread-32:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.248:36150 (no session established for client)
> 2018-07-03T12:47:28,871 [myid:] - WARN
> [NIOWorkerThread-32:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,871 [myid:] - WARN
> [NIOWorkerThread-17:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,871 [myid:] - INFO
> [NIOWorkerThread-32:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.248:33788 (no session established for client)
> 2018-07-03T12:47:28,871 [myid:] - INFO
> [NIOWorkerThread-17:NIOServerCnxn@606] - Closed socket connection for
> client /10.0.0.248:35930 (no session established for client)
> 2018-07-03T12:47:28,871 [myid:] - WARN
> [NIOWorkerThread-32:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,871 [myid:] - WARN
> [NIOWorkerThread-17:NIOServerCnxn@369] - Exception causing close of
> session
> 0x0: ZooKeeperServer not running
> 2018-07-03T12:47:28,871 [myid:] - INFO
> [CommitProcessor:2:CommitProcessor@193] - CommitProcessor exited loop!
>
> In particular I don't like the sound of this "Severe unrecoverable error,
> from thread : FollowerRequestProcessor" from "ZooKeeperCriticalThread".
>
> Is this a fatal event for the whole ZK server?
>
> Thanks
> Benjamin
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message