curator-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fangjin Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CURATOR-153) PathChildrenCache occasionally cannot reconnect to ZK
Date Mon, 13 Oct 2014 17:39:33 GMT

    [ https://issues.apache.org/jira/browse/CURATOR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169607#comment-14169607
] 

Fangjin Yang commented on CURATOR-153:
--------------------------------------

The problem seems to be primarily caused when Zookeeper's transaction ID overflows and the
service needs to be restarted.

> PathChildrenCache occasionally cannot reconnect to ZK
> -----------------------------------------------------
>
>                 Key: CURATOR-153
>                 URL: https://issues.apache.org/jira/browse/CURATOR-153
>             Project: Apache Curator
>          Issue Type: Bug
>          Components: Recipes
>    Affects Versions: 2.4.2, 2.5.0, 2.6.0
>            Reporter: Fangjin Yang
>
> We use Curator as part of the Druid open source project (druid.io). We've had issues
where if ZK is brought down and back up, numerous nodes cannot reconnect. The issue is very
difficult to reproduce locally but we've seen it often in production. The issue appears to
be in PathChildrenCache. There is a longer description here:
> https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/druid-development/54avmEvLN3E/orZ1taF8hFsJ



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message