Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F40C7DAF3 for ; Thu, 23 May 2013 13:21:36 +0000 (UTC) Received: (qmail 64628 invoked by uid 500); 23 May 2013 13:21:36 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 64312 invoked by uid 500); 23 May 2013 13:21:34 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 63154 invoked by uid 99); 23 May 2013 13:21:33 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 May 2013 13:21:33 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of briantarbox@gmail.com designates 209.85.217.181 as permitted sender) Received: from [209.85.217.181] (HELO mail-lb0-f181.google.com) (209.85.217.181) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 May 2013 13:21:27 +0000 Received: by mail-lb0-f181.google.com with SMTP id w20so3351592lbh.40 for ; Thu, 23 May 2013 06:21:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=d9SEtHTryf5VEnYHEqDwMjJcLIRpbkaK6n8jNd7+emE=; b=YWZFbToqn141DVoTLM3aldvJ8/EABOpxOtx/FXuwa9T704CbjyqCEBm2VcK0MRd4zB 51UXeprMgrGbHl8GceYynGrDKOsWJj3e0G1Cu0tt0QSuDSSWyf/anA0P5V74tNGSfQnM 0j5vdbaY5FyNq1vdW3DK1AnhGw+E+7Ze8UDNl9lpARYvqaPar/pVZDZYtQOy1awmYG2g 5k3Niq4WjVh7bmYiSZ8gHJkqCVNNMdF9FOJW5jOC7W1GTqBekgG6lQ9qZJZwEzv/FYuQ DganvQYIFs2J9LCpNq1uKryS4Hwk1SoKcG4TJzYe1xUTtxdSZ//eK2qrznYeMnoAgly7 rr4g== MIME-Version: 1.0 X-Received: by 10.152.29.5 with SMTP id f5mr6458463lah.15.1369315266394; Thu, 23 May 2013 06:21:06 -0700 (PDT) Received: by 10.112.137.101 with HTTP; Thu, 23 May 2013 06:21:06 -0700 (PDT) Date: Thu, 23 May 2013 09:21:06 -0400 Message-ID: Subject: cluster confused...would not delete node until cluster restarted... From: Brian Tarbox To: user@zookeeper.apache.org Content-Type: multipart/alternative; boundary=089e0158c5260a04c504dd628ebb X-Virus-Checked: Checked by ClamAV on apache.org --089e0158c5260a04c504dd628ebb Content-Type: text/plain; charset=ISO-8859-1 My 3 node cluster would not let me delete a node saying it was not empty...but stat showed that it was in fact empty: [zk: localhost:2181(CONNECTED) 1] delete /ROOT_A/INSTANCES/ 10.244.43.240/WORKERS *Node not empty:* /ROOT_A/INSTANCES/10.244.43.240/WORKERS [zk: localhost:2181(CONNECTED) 0] stat /ROOT_A/INSTANCES/ 10.244.43.240/WORKERS cZxid = 0xe0015ad31 ctime = Wed May 22 17:20:52 EDT 2013 mZxid = 0xe0015ad31 mtime = Wed May 22 17:20:52 EDT 2013 pZxid = 0xe0015ae3c cversion = 2 dataVersion = 0 aclVersion = 0 ephemeralOwner = 0x0 dataLength = 24 *numChildren = 0* * * * * The debug log showed this on the machine initiating the delete: 2013-05-23 09:05:37,030 [myid:1] - DEBUG [FollowerRequestProcessor:1:CommitProcessor@171] - Processing request:: sessionid:0x13ed17dc4310000 type:delete cxid:0x3 zxid:0xfffffffffffffffe txntype:unknown reqpath:/ROOT_A/INSTANCES/10.244.43.240/WORKERS 2013-05-23 09:05:37,034 [myid:1] - DEBUG [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:2181:CommitProcessor@161] - Committing request:: sessionid:0x13ed17dc4310000 type:error cxid:0x3 zxid:0xe00200606 txntype:-1 reqpath:n/a 2013-05-23 09:05:37,034 [myid:1] - DEBUG [CommitProcessor:1:FinalRequestProcessor@88] - Processing request:: sessionid:0x13ed17dc4310000 type:delete cxid:0x3 zxid:0xe00200606 txntype:-1 reqpath:/ROOT_A/INSTANCES/10.244.43.240/WORKERS 2013-05-23 09:05:37,034 [myid:1] - DEBUG [CommitProcessor:1:DataTree@949] - *Ignoring processTxn failure hdr: -1 : error: -111* * * And on another node I saw this: 2013-05-23 09:11:22,373 [myid:2] - INFO [ProcessThread(sid:2 cport:-1)::PrepRequestProcessor@627] - Got user-level KeeperException when processing sessionid:0x13ed18295a60000 type:delete cxid:0x2 zxid:0xe0020060b txntype:-1 reqpath:n/a Error Path:/ROOT_A/INSTANCES/ 10.244.43.240/WORKERS Error:KeeperErrorCode = Directory not empty for /ROOT_A/INSTANCES/10.244.43.240/WORKERS* * Third node say nothing in response to the delete. Problem went away after I serially restarted the first two machines so I'm left with a "working" system and an uneasy feeling. Was there anything I could have done other than restart the servers? Thanks. -- http://about.me/BrianTarbox --089e0158c5260a04c504dd628ebb--