Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 2705E200C1A for ; Mon, 13 Feb 2017 15:55:52 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 25956160B60; Mon, 13 Feb 2017 14:55:52 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6F837160B4D for ; Mon, 13 Feb 2017 15:55:51 +0100 (CET) Received: (qmail 76828 invoked by uid 500); 13 Feb 2017 14:55:50 -0000 Mailing-List: contact dev-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@zookeeper.apache.org Delivered-To: mailing list dev@zookeeper.apache.org Received: (qmail 76817 invoked by uid 99); 13 Feb 2017 14:55:50 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Feb 2017 14:55:50 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 322C6DFC31; Mon, 13 Feb 2017 14:55:50 +0000 (UTC) From: revans2 To: dev@zookeeper.apache.org Reply-To: dev@zookeeper.apache.org References: In-Reply-To: Subject: [GitHub] zookeeper issue #157: ZOOKEEPER-2678: Discovery and Sync can take a very lon... Content-Type: text/plain Message-Id: <20170213145550.322C6DFC31@git1-us-west.apache.org> Date: Mon, 13 Feb 2017 14:55:50 +0000 (UTC) archived-at: Mon, 13 Feb 2017 14:55:52 -0000 Github user revans2 commented on the issue: https://github.com/apache/zookeeper/pull/157 @rakeshadr If it makes you feel any better we have been running with an older version of this patch in production for a while. We have used it as part of a rolling upgrade at least 10 times in production where if it were not there we would have had some very painful outages. I have also manually tested it at least 50 times shooting the leader under load (10,000 operations/second) on a 3.4 GB DB, watching it recover, and then validating the integrity of the DB to be sure we didn't get any corruption. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---