Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 858E410F4D for ; Tue, 3 Sep 2013 12:59:03 +0000 (UTC) Received: (qmail 30996 invoked by uid 500); 3 Sep 2013 12:59:03 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 30880 invoked by uid 500); 3 Sep 2013 12:59:03 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 30854 invoked by uid 99); 3 Sep 2013 12:58:59 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Sep 2013 12:58:59 +0000 Date: Tue, 3 Sep 2013 12:58:59 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-1572) single node zookeeper failure kills connected accumulo servers MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13756587#comment-13756587 ] ASF subversion and git services commented on ACCUMULO-1572: ----------------------------------------------------------- Commit 4ed51ecbca7d4120c5c31531ecbebb5d56a7b79f in branch refs/heads/1.5.1-SNAPSHOT from [~ecn] [ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=4ed51ec ] ACCUMULO-1572 apply missing patch; prevent logger from killing itself on a Disconnect event > single node zookeeper failure kills connected accumulo servers > -------------------------------------------------------------- > > Key: ACCUMULO-1572 > URL: https://issues.apache.org/jira/browse/ACCUMULO-1572 > Project: Accumulo > Issue Type: Bug > Components: master, tserver > Affects Versions: 1.5.0 > Reporter: Eric Newton > Assignee: Eric Newton > Priority: Blocker > Fix For: 1.4.5, 1.5.1, 1.6.0 > > > Drew Thornton writes on the user mailing list: > {quote} > If one zookeeper node is shutdown/fails/whatever and the rest of the ensemble stays up, the tablet servers attached as clients to the shutdown node immediately fail. If one of the clients happens to be the master, the cluster goes down. > Accumulo does not seem to be failing over to the remaining zookeeper nodes, and this causes me to restart the individual tablet servers again. > The zookeeper ensemble is very stable and has plenty of bandwidth/memory/processing, so taking one node down out of five doesn't crash the zookeepers, just the tablet servers... > {quote} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira