From notifications-return-4200-apmail-accumulo-notifications-archive=accumulo.apache.org@accumulo.apache.org Thu Mar 21 16:01:16 2013 Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5F8A3F677 for ; Thu, 21 Mar 2013 16:01:16 +0000 (UTC) Received: (qmail 5650 invoked by uid 500); 21 Mar 2013 16:01:16 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 5610 invoked by uid 500); 21 Mar 2013 16:01:16 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 5598 invoked by uid 99); 21 Mar 2013 16:01:16 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Mar 2013 16:01:16 +0000 Date: Thu, 21 Mar 2013 16:01:16 +0000 (UTC) From: "Keith Turner (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (ACCUMULO-513) Need to deal with half-dead tservers MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Keith Turner updated ACCUMULO-513: ---------------------------------- Affects Version/s: 1.4.0 > Need to deal with half-dead tservers > ------------------------------------ > > Key: ACCUMULO-513 > URL: https://issues.apache.org/jira/browse/ACCUMULO-513 > Project: Accumulo > Issue Type: Bug > Affects Versions: 1.4.0 > Reporter: Keith Turner > Assignee: Keith Turner > Fix For: 1.5.0 > > > Every once in a while a tablet server will go zombie. The tablet server is alive and talking to the master and zookeeper, but not accepting new connections from clients. To keep these types of tablet servers from causing problems a process could try to scan a tablet on each tablet server periodically. If it can not scan a tablet within a certain number of tries, it could delete the tservers lock in zookeeper. The monitor process could do this. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira