Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@minotaur.apache.org Received: (qmail 1881 invoked from network); 29 Apr 2010 07:09:09 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Apr 2010 07:09:09 -0000 Received: (qmail 9072 invoked by uid 500); 29 Apr 2010 07:09:09 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 8961 invoked by uid 500); 29 Apr 2010 07:09:09 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 8948 invoked by uid 99); 29 Apr 2010 07:09:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 07:09:08 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of traviscrawford@gmail.com designates 74.125.83.48 as permitted sender) Received: from [74.125.83.48] (HELO mail-gw0-f48.google.com) (74.125.83.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 07:09:00 +0000 Received: by gwj23 with SMTP id 23so1121483gwj.35 for ; Thu, 29 Apr 2010 00:08:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:from:mime-version:date:message-id :subject:to:content-type; bh=S58ypu0ApvI2MzEKPC7JomrYCzcmI4DI2I3lZVgpvdM=; b=kurVyMPJETUuZSCJOOBLFu3KXr/wpgwOspEjLtMLS2noSC+h09KYOvAFs89DPuNStD NyRzgx+YwhdhX7TSf9KVSzs8jMRDTTaB/U8PCJZhQ+hSZFC/Cpqm28r0bn7gq+u70WKj +6SsLiPFRPd+4C7ms5JndqIowyWXb/jAGiX2A= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:mime-version:date:message-id:subject:to:content-type; b=ePKBw2E/y2uhVvxBW85+luEadsuOKdqm2gZiWrZSkf/0KHPe6VsPdUZWh/JegfmUAS NMr1FuSNZoP/OilfK4Y2DdfX8+8Qz0AwSnkdGoVGn46gOCs5f/wZpKs/FGQFakFlgMfQ vucZjeXua5lAR34zMwo31LRUh2HL7p6whghvU= Received: by 10.91.1.2 with SMTP id d2mr160705agi.121.1272524919570; Thu, 29 Apr 2010 00:08:39 -0700 (PDT) From: Travis Crawford Mime-Version: 1.0 (iPad Mail 7B367) Date: Thu, 29 Apr 2010 00:08:39 -0700 Message-ID: <-2000849610076933609@unknownmsgid> Subject: Misbehaving zk servers To: zookeeper-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Hey zookeeper gurus - We recently had a zookeeper outage when one ZK server was started with a low limit after upgrading to 3.3.0. Several days later the outage occurred when that node reached its file descriptor limit and clients started having major issues. Are there any circumstances when a ZK server will get blacklisted from the ensemble? Something similar to how tasktrackers are blacklisted when too many tasks fail. Thanks! Travis