Return-Path: X-Original-To: apmail-zookeeper-user-archive@www.apache.org Delivered-To: apmail-zookeeper-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C721FC648 for ; Thu, 24 May 2012 00:41:38 +0000 (UTC) Received: (qmail 64000 invoked by uid 500); 24 May 2012 00:41:37 -0000 Delivered-To: apmail-zookeeper-user-archive@zookeeper.apache.org Received: (qmail 63902 invoked by uid 500); 24 May 2012 00:41:37 -0000 Mailing-List: contact user-help@zookeeper.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@zookeeper.apache.org Delivered-To: mailing list user@zookeeper.apache.org Received: (qmail 63890 invoked by uid 99); 24 May 2012 00:41:37 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 May 2012 00:41:37 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of brian@nimbula.com designates 209.85.212.42 as permitted sender) Received: from [209.85.212.42] (HELO mail-vb0-f42.google.com) (209.85.212.42) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 May 2012 00:41:30 +0000 Received: by vbbfs19 with SMTP id fs19so5473904vbb.15 for ; Wed, 23 May 2012 17:41:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nimbula.com; s=google; h=mime-version:date:message-id:subject:from:to:cc:content-type; bh=H7fzqcx02Yo4PZ+vXJEDz4iKEkAHKCW2EuDcWPg1U0M=; b=dXC6dutq1eJAPH2YBJrMTotGZuqII6b5/wDWq39irEV22R06CZf7iWqF5R3QKt086R 8n0e4oPJ2yevpu+RN/ZuqULKtq1w7z3lw11odPYusWSuFzTHZfvU5jUAF7qErpgz6HQn mfEdy9sJhUwi/dXNzGIp/HF8I0OfGBuvQQsRc= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:date:message-id:subject:from:to:cc:content-type :x-gm-message-state; bh=H7fzqcx02Yo4PZ+vXJEDz4iKEkAHKCW2EuDcWPg1U0M=; b=U1XVEVowSy/CHlz1IFQhyxXYKp3YLfIdl28MrzPZTb5enJ01Q4mpdcRwviTs7ii/Cw UpvB0kzGQZfYlLaHfASNkesHnM+QgmaUaxmAMVENRYhxZ0bnM8GnDL4NTkzhFxIofZ+r xxqVH3pTNX2k92i7BiWJ7oBs9a1Qyyc78Doz9k7GlM+aJ6gWU+RJaLrgi3indHB9+nWi Jmsw5dhKXdA5T+cb7gctNuxPi4FPsh/mark9ZFxbrn8ISUkJWuGZbqlKqwcu5WzbriPA NmIJIDfPgmB/Tmzt2c6ui+lfgICaSSMI5oEmy8oXl2HGTGGOZ5BmTO0lOyQYrIHeSTcI 3zWQ== MIME-Version: 1.0 Received: by 10.220.150.14 with SMTP id w14mr4729400vcv.59.1337820068256; Wed, 23 May 2012 17:41:08 -0700 (PDT) Received: by 10.52.172.227 with HTTP; Wed, 23 May 2012 17:41:08 -0700 (PDT) Date: Wed, 23 May 2012 17:41:08 -0700 Message-ID: Subject: ZooKeeper's resident set size grows but does not shrink From: Brian Oki To: user@zookeeper.apache.org Cc: Brian Oki Content-Type: multipart/alternative; boundary=f46d043c81ccf11a5d04c0bd8152 X-Gm-Message-State: ALoCoQne1g7iVJ+eU7shw4bQcmam+2YXW65jv6yDCC0qCSHohuMA2uyShdwZ1NQs+ZFstFYPWMxZ --f46d043c81ccf11a5d04c0bd8152 Content-Type: text/plain; charset=ISO-8859-1 Hello, We use ZooKeeper 3.3.3. On a 3-node site, we've been using Patrick Hunt's publicly available latencies test suite to create scenarios that will help us to understand the memory, CPU and disk requirements for a deployment of ZooKeeper for our type of workload. We use a fourth node as the ZooKeeper (ZK) client to conduct the tests. We modified zk-latencies.py slightly to just create-set-delete znodes only. In particular, we create 1000 permanent znodes, each written with 250,000 bytes of data. We do this create-set-delete in a loop, sleeping for 5 seconds between iterations. We observe at the ZK leader that the Resident Set Size (RSS) memory climbs rapidly to 2.6 GB on an 8 GB RAM node. The Java heap size of each ZK server daemon is 3 GB. Further, once the test has gone through 15 iterations, all the znodes created on behalf of the test have been deleted. There is no further write activity to ZK, and no read activity at all. The system is quiesced. No other services are competing for the disk, CPU or RAM during the test. Our question is this: The RSS of the ZK leader (and the followers) seems to remain at 2.6 GB after the test has completed. Why? We would expect that since all relevant znodes for the test have been deleted, the leader's RSS should have shrunk considerably, even after 1 hour has passed. Are we missing something? We have used jmap to inspect the heap. To understand the heap contents requires detailed implementation knowledge that we don't have, so we didn't pursue this avenue any further. Configuration: 3 node servers running ZK daemons as 3-server ensemble 1 client machine each node has 8 GB RAM each node has 4 cores each node has a 465 GB disk ZK release: 3.3.3 ZK server java heap size: 3 GB GC: concurrent low-pause garbage collector NIC: bonded 1 Gb NIC Thank you. Sincerely, Brian --f46d043c81ccf11a5d04c0bd8152--