zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arshad Mohammad (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (ZOOKEEPER-2570) ZooKeeper clients are timed out when ZooKeeper servers are very busy
Date Thu, 13 Oct 2016 19:00:25 GMT

     [ https://issues.apache.org/jira/browse/ZOOKEEPER-2570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Arshad Mohammad updated ZOOKEEPER-2570:
---------------------------------------
    Attachment: ZOOKEEPER-2570-01.patch

Submitting a test patch

> ZooKeeper clients are timed out when ZooKeeper servers are very busy
> --------------------------------------------------------------------
>
>                 Key: ZOOKEEPER-2570
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2570
>             Project: ZooKeeper
>          Issue Type: Bug
>            Reporter: Arshad Mohammad
>            Assignee: Arshad Mohammad
>            Priority: Critical
>         Attachments: ZOOKEEPER-2570-01.patch
>
>
> ZooKeeper clients are timed out when ZooKeeper servers are very busy. Clients throw below
exception and fail all the pending operations
> {code}
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss
> 	at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
> {code}
> Clients log bellow information
> {noformat}
> 2016-09-22 01:49:08,001 [myid:127.0.0.1:11228] - WARN  [main-SendThread(127.0.0.1:11228):ClientCnxn$SendThread@1181]
- Client session timed out, have not heard from server in 13908ms for sessionid 0x20000d21b280000
> 2016-09-22 01:49:08,001 [myid:127.0.0.1:11228] - INFO  [main-SendThread(127.0.0.1:11228):ClientCnxn$SendThread@1229]
- Client session timed out, have not heard from server in 13908ms for sessionid 0x20000d21b280000,
closing socket connection and attempting reconnect
> {noformat}
> *STEPS TO REPRODECE:*
> # Create multi operation
> {code}
> List<Op> ops = new ArrayList<Op>();
>         for (int i = 0; i < N; i++) {
>             Op create = Op.create(rootNode + "/" + i, "".getBytes(), ZooDefs.Ids.OPEN_ACL_UNSAFE,
>                     CreateMode.PERSISTENT);
>             ops.add(create);
>         }
> {code}
> Chose N in such a way that the total multi operation request  size is less than but near
1 MB.  For bigger request size increase jute.maxbuffer in servers
> # Submit the multi operation request
> {code} zooKeeper.multi(ops);{code} 
> # After repeating above steps few times issue is reproduced



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message