Return-Path: X-Original-To: apmail-curator-dev-archive@minotaur.apache.org Delivered-To: apmail-curator-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2064417B1C for ; Tue, 21 Apr 2015 01:28:59 +0000 (UTC) Received: (qmail 59312 invoked by uid 500); 21 Apr 2015 01:28:59 -0000 Delivered-To: apmail-curator-dev-archive@curator.apache.org Received: (qmail 59262 invoked by uid 500); 21 Apr 2015 01:28:59 -0000 Mailing-List: contact dev-help@curator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@curator.apache.org Delivered-To: mailing list dev@curator.apache.org Received: (qmail 59249 invoked by uid 99); 21 Apr 2015 01:28:59 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Apr 2015 01:28:59 +0000 Date: Tue, 21 Apr 2015 01:28:58 +0000 (UTC) From: "Jordan Zimmerman (JIRA)" To: dev@curator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CURATOR-153) PathChildrenCache occasionally cannot reconnect to ZK MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CURATOR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jordan Zimmerman updated CURATOR-153: ------------------------------------- Fix Version/s: (was: 2.7.2) awaiting-response > PathChildrenCache occasionally cannot reconnect to ZK > ----------------------------------------------------- > > Key: CURATOR-153 > URL: https://issues.apache.org/jira/browse/CURATOR-153 > Project: Apache Curator > Issue Type: Bug > Components: Framework, Recipes > Affects Versions: 2.4.2, 2.5.0, 2.6.0 > Reporter: Fangjin Yang > Assignee: Jordan Zimmerman > Priority: Critical > Fix For: awaiting-response > > > We use Curator as part of the Druid open source project (druid.io). We've had issues where if ZK is brought down and back up, numerous nodes cannot reconnect. The issue is very difficult to reproduce locally but we've seen it often in production. The issue appears to be in PathChildrenCache. There is a longer description here: > https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/druid-development/54avmEvLN3E/orZ1taF8hFsJ -- This message was sent by Atlassian JIRA (v6.3.4#6332)