Return-Path: X-Original-To: apmail-curator-dev-archive@minotaur.apache.org Delivered-To: apmail-curator-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 23CC3105A5 for ; Wed, 2 Apr 2014 19:02:24 +0000 (UTC) Received: (qmail 59579 invoked by uid 500); 2 Apr 2014 19:02:17 -0000 Delivered-To: apmail-curator-dev-archive@curator.apache.org Received: (qmail 59312 invoked by uid 500); 2 Apr 2014 19:02:16 -0000 Mailing-List: contact dev-help@curator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@curator.apache.org Delivered-To: mailing list dev@curator.apache.org Received: (qmail 59235 invoked by uid 99); 2 Apr 2014 19:02:14 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Apr 2014 19:02:14 +0000 Date: Wed, 2 Apr 2014 19:02:14 +0000 (UTC) From: "Jay Bae (JIRA)" To: dev@curator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CURATOR-76) Adding leader selection ChildReaper recipe MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CURATOR-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13958040#comment-13958040 ] Jay Bae commented on CURATOR-76: -------------------------------- Thanks a lot Jordan. > Adding leader selection ChildReaper recipe > ------------------------------------------ > > Key: CURATOR-76 > URL: https://issues.apache.org/jira/browse/CURATOR-76 > Project: Apache Curator > Issue Type: Improvement > Components: Recipes > Reporter: Jay Bae > Assignee: Jordan Zimmerman > Fix For: 2.4.2 > > Attachments: CURATOR-76.patch > > > We are having serious data corruption issue when we are rolling restart of zookeeper servers due to one application which is using ChildReaper recipe. I am not sure its root cause but my theory is, when the multiple instances are running ChildReaper recipe, they would conflict each other among checking exist and deleting paths. This conflict can cause data corruption. We observed all servers died due to corrupted data and we had to manually copy log/snapshot data and restart them. -- This message was sent by Atlassian JIRA (v6.2#6252)