Return-Path: Delivered-To: apmail-hadoop-zookeeper-user-archive@minotaur.apache.org Received: (qmail 46810 invoked from network); 15 Jul 2010 17:10:56 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Jul 2010 17:10:56 -0000 Received: (qmail 39085 invoked by uid 500); 15 Jul 2010 17:10:55 -0000 Delivered-To: apmail-hadoop-zookeeper-user-archive@hadoop.apache.org Received: (qmail 39019 invoked by uid 500); 15 Jul 2010 17:10:55 -0000 Mailing-List: contact zookeeper-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: zookeeper-user@hadoop.apache.org Delivered-To: mailing list zookeeper-user@hadoop.apache.org Received: (qmail 39011 invoked by uid 99); 15 Jul 2010 17:10:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jul 2010 17:10:54 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of maarten@vrijheid.net designates 69.64.155.201 as permitted sender) Received: from [69.64.155.201] (HELO mail-09.name-services.com) (69.64.155.201) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jul 2010 17:10:46 +0000 Received: from [10.0.1.6] ([62.195.52.198]) by mail-09.name-services.com with Microsoft SMTPSVC(6.0.3790.3959); Thu, 15 Jul 2010 10:10:24 -0700 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1081) Subject: Re: total # of zknodes From: Maarten Koopmans In-Reply-To: <4C3F2FDA.30000@yahoo-inc.com> Date: Thu, 15 Jul 2010 19:10:21 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: <1A817668-C056-41A6-B66B-004D7D9CA622@vrijheid.net> References: <47267DF4-BAC9-48C4-A601-77B151404DC0@vrijheid.net> <4C3F2FDA.30000@yahoo-inc.com> To: zookeeper-user@hadoop.apache.org X-Mailer: Apple Mail (2.1081) X-OriginalArrivalTime: 15 Jul 2010 17:10:24.0943 (UTC) FILETIME=[9D84E3F0:01CB2440] X-Sender: maarten@vrijheid.net X-Virus-Checked: Checked by ClamAV on apache.org Thanks, I see Patrick has replied in the archives but I don't have it in = my mail (yet). I'd probably use 2 EC2 High-mem instances = (17GB/instance), and I have no watches at all, so I should be able to = store between 5-10M data, but I'll test that over the summer. I'll post = the results here (and will publish my simple sync, no-watch Scala client = as well). Best, Maarten Op 15 jul 2010, om 17:57 heeft Benjamin Reed het volgende geschreven: > i think there is a wiki page on this, but for the short answer: >=20 > the number of znodes impact two things: memory footprint and recovery = time. there is a base overhead to znodes to store its path, pointers to = the data, pointers to the acl, etc. i believe that is around 100 bytes. = you cant just divide your memory by 100+1K (for data) though, because = the GC needs to be able to run and collect things and maintain a free = space. if you use 3/4 of your available memory, that would mean with 4G = you can store about three million znodes. when there is a crash and you = recover, servers may need to read this data back off the disk or over = the network. that means it will take about a minute to read 3G from the = disk and perhaps a bit more to read it over the network, so you will = need to adjust your initLimit accordingly. >=20 > of course this is all back-of-the-envelope. i would suggest doing some = quick benchmarks to test and make sure your results are in line with = expectation. >=20 > ben >=20 > On 07/15/2010 02:56 AM, Maarten Koopmans wrote: >> Hi, >>=20 >> I am mapping a filesystem to ZooKeeper, and use it for locking and = mapping a filesystem namespace to a flat data object space (like S3). So = assuming proper nesting and small ZooKeeper nodes (< 1KB), how many = nodes could a cluster with a few GBs of memory per instance = realistically hold totally? >>=20 >> Thanks, Maarten >=20 >=20