Return-Path: X-Original-To: apmail-curator-dev-archive@minotaur.apache.org Delivered-To: apmail-curator-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 115B117957 for ; Thu, 9 Oct 2014 18:50:35 +0000 (UTC) Received: (qmail 24374 invoked by uid 500); 9 Oct 2014 18:50:34 -0000 Delivered-To: apmail-curator-dev-archive@curator.apache.org Received: (qmail 24339 invoked by uid 500); 9 Oct 2014 18:50:34 -0000 Mailing-List: contact dev-help@curator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@curator.apache.org Delivered-To: mailing list dev@curator.apache.org Received: (qmail 24285 invoked by uid 99); 9 Oct 2014 18:50:34 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Oct 2014 18:50:34 +0000 Date: Thu, 9 Oct 2014 18:50:34 +0000 (UTC) From: "Scott Blum (JIRA)" To: dev@curator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CURATOR-153) PathChildrenCache occasionally cannot reconnect to ZK MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CURATOR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165533#comment-14165533 ] Scott Blum commented on CURATOR-153: ------------------------------------ Maybe PathChildrenCache.refresh().BackgroundCallback.processResult() needs to check the event.getType() == CHILDREN? > PathChildrenCache occasionally cannot reconnect to ZK > ----------------------------------------------------- > > Key: CURATOR-153 > URL: https://issues.apache.org/jira/browse/CURATOR-153 > Project: Apache Curator > Issue Type: Bug > Components: Recipes > Affects Versions: 2.4.2, 2.5.0, 2.6.0 > Reporter: Fangjin Yang > > We use Curator as part of the Druid open source project (druid.io). We've had issues where if ZK is brought down and back up, numerous nodes cannot reconnect. The issue is very difficult to reproduce locally but we've seen it often in production. The issue appears to be in PathChildrenCache. There is a longer description here: > https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/druid-development/54avmEvLN3E/orZ1taF8hFsJ -- This message was sent by Atlassian JIRA (v6.3.4#6332)