zookeeper-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Abraham Fine <af...@apache.org>
Subject Re: java.io.EOFException
Date Thu, 29 Jun 2017 19:49:45 GMT

Hi Upendar-

The exception that is most concerning is: 
2017-06-27 21:24:58,840 [myid:3] - WARN  [
zookeeper3-prod.aws.npp.neustar.biz/XXXX:3888:QuorumCnxManager@260] -
Exception reading or writing challenge: java.io.E
OFException
2017-06-27 21:24:58,840 [myid:3] - ERROR
[LearnerHandler-/XXXX:39920:LearnerHandler@633] - Unexpected exception
causing shutdown while sock still open
java.io.EOFException
        at java.io.DataInputStream.readInt(DataInputStream.java:392)
        at
org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
        at
org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(QuorumPacket.java:83)
        at
org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArchive.java:103)
        at
org.apache.zookeeper.server.quorum.LearnerHandler.run(LearnerHandler.java:309)

In my experience this is related to timeouts that are too small on your
learners (the timeouts here are derived from the tickTime, syncLimit,
and initLimit)? Are you able to send the corresponding exception that
occurred on the learner (that may make what's going on here a little
clearer)?

Thanks,
Abe
On Thu, Jun 29, 2017, at 09:59, Michael Han wrote:
> On Wed, Jun 28, 2017 at 11:59 PM, Mike Richardson <mike@motum.be> wrote:
> 
> > Unsubscribe
> >
> >
> >
> Unsubscribe does not work like this. To unsubscribe, please click the
> Unsubscribe
> from List
> <%75%73%65%72%2D%75%6E%73%75%62%73%63%72%69%62%65%40%7A%6F%6F%6B%65%65%70%65%72%2E%61%70%61%63%68%65%2E%6F%72%67>
> link
> from https://zookeeper.apache.org/lists.html.
> 
> 
> > Mike Richardson
> >
> > Senior Software Engineer
> >
> >
> >
> > *MoTuM N.V. | Dellingstraat 34 | B-2800 MECHELEN | Belgium*
> >
> >
> > T +32(0)15 28 16 63 <+32%2015%2028%2016%2063>
> > M +32 494 63 98 15 <+32%20494%2063%2098%2015>
> >
> >
> > www.motum.be
> >
> > On 28 June 2017 at 18:12, upendar devu <devulapalli8@gmail.com> wrote:
> >
> >> We have 3 instances of zookeeper and seen below error and then the
> >> zookeeper server process was auto stopped for a while and then it was auto
> >> restarted.
> >>
> >> We see this error at least once in a month .  we have data directory
> >> within
> >> the zookeeper AWS instance  could you please help us.
> >>
> >> Thanks
> >>
> >> Zookeeper log :
> >>
> >>
> >> 2017-06-27 21:24:58,839 [myid:3] - WARN
> >>  [NIOServerCxn.Factory:/IP:2181:NIOServerCnxn@357] - caught end of stream
> >> exception
> >> EndOfStreamException: Unable to read additional data from client sessionid
> >> 0x0, likely client has closed socket
> >>         at
> >> org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:228)
> >>         at
> >> org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServ
> >> erCnxnFactory.java:208)
> >>         at java.lang.Thread.run(Thread.java:745)
> >> 2017-06-27 21:24:58,839 [myid:3] - INFO
> >>  [NIOServerCxn.Factory:/1XX.XX.XX.XX:2181:NIOServerCnxn@1007] - Closed
> >> socket connection for client /XX.XX.XX.XX:57497 (no sessi
> >> on established for client)
> >> 2017-06-27 21:24:58,840 [myid:3] - INFO  [
> >> zookeeper3-prod.aws.npp.neustar.biz/XX.XX.XX.XX:3888:QuorumC
> >> nxManager$Listener@511]
> >> - Received connection request /XX.XX>XX
> >> 2:35419
> >> 2017-06-27 21:24:58,840 [myid:3] - WARN  [
> >> zookeeper3-prod.aws.npp.neustar.biz/XXXX:3888:QuorumCnxManager@260] -
> >> Exception reading or writing challenge: java.io.E
> >> OFException
> >> 2017-06-27 21:24:58,840 [myid:3] - ERROR
> >> [LearnerHandler-/XXXX:39920:LearnerHandler@633] - Unexpected exception
> >> causing shutdown while sock still open
> >> java.io.EOFException
> >>         at java.io.DataInputStream.readInt(DataInputStream.java:392)
> >>         at
> >> org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
> >>         at
> >> org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(
> >> QuorumPacket.java:83)
> >>         at
> >> org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArc
> >> hive.java:103)
> >>         at
> >> org.apache.zookeeper.server.quorum.LearnerHandler.run(Learne
> >> rHandler.java:309)
> >> 2017-06-27 21:24:58,841 [myid:3] - WARN
> >>  [LearnerHandler-/IP:39920:LearnerHandler@646] - ******* GOODBYE
> >> /XXXX:39920 ********
> >> 2017-06-27 21:25:03,187 [myid:3] - INFO
> >>  [NIOServerCxn.Factory:/IP:2181:NIOServerCnxnFactory@197] - Accepted
> >> socket
> >> connection from /1XXX:36060
> >> 2017-06-27 21:25:03,187 [myid:3] - INFO
> >>  [NIOServerCxn.Factory:/IP:2181:NIOServerCnxn@827] - Processing srvr
> >> command from /XXX:36060
> >> 2017-06-27 21:25:03,188 [myid:3] - INFO  [Thread-661253:NIOServerCnxn@1
> >> 007]
> >> - Closed socket connection for client /XXX:36060 (no session establis
> >> 2017-06-27 21:27:17,496 [myid:3] - INFO
> >>  [CommitProcessor:3:ZooKeeperServer@617] - Established session
> >> 0x35b125b84aa028a with negotiated timeout 30000 for client /IP:40526
> >> 2017-06-27 21:27:38,763 [myid:] - INFO  [main:QuorumPeerConfig@103] -
> >> Reading configuration from: /opt/zookeeper/current/bin/../conf/zoo.cfg
> >> 2017-06-27 21:27:38,780 [myid:] - INFO  [main:QuorumPeerConfig@340] -
> >> Defaulting to majority quorums
> >> 2017-06-27 21:27:38,784 [myid:3] - INFO  [main:DatadirCleanupManager@78]
> >> -
> >> autopurge.snapRetainCount set to 3
> >> 2017-06-27 21:27:38,784 [myid:3] - INFO  [main:DatadirCleanupManager@79]
> >> -
> >> autopurge.purgeInterval set to 1
> >> 2017-06-27 21:27:38,799 [myid:3] - INFO
> >>  [PurgeTask:DatadirCleanupManager$PurgeTask@138] - Purge task started.
> >> 2017-06-27 21:27:38,821 [myid:3] - INFO  [main:QuorumPeerMain@127] -
> >> Starting quorum peer
> >> 2017-06-27 21:27:38,869 [myid:3] - INFO  [main:NIOServerCnxnFactory@94] -
> >> binding to port /IP:2181
> >> 2017-06-27 21:27:38,
> >>
> >
> >
> 
> 
> -- 
> Cheers
> Michael.

Mime
View raw message