Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 78007 invoked from network); 24 Mar 2011 20:52:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 24 Mar 2011 20:52:54 -0000 Received: (qmail 2734 invoked by uid 500); 24 Mar 2011 20:52:52 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 2711 invoked by uid 500); 24 Mar 2011 20:52:52 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 2703 invoked by uid 99); 24 Mar 2011 20:52:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Mar 2011 20:52:52 +0000 X-ASF-Spam-Status: No, hits=0.3 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS,URIBL_RHS_DOB X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a79.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Mar 2011 20:52:46 +0000 Received: from homiemail-a79.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a79.g.dreamhost.com (Postfix) with ESMTP id 2177C7D4061 for ; Thu, 24 Mar 2011 13:52:25 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=content-type :mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; q=dns; s= thelastpickle.com; b=F9W0Gkxt6dQqnJ/Q0u/mXhLX4N2PLsPF/PtH/XxZdQT a9hNgBFC7tGOZWAFE9cNjs7usCZyP/B/BctoiRRp7O+bQDExPxwWdEFA12neEKvV ST18D31ktA/JNjd/ESORpCYE52D9k6y3YCIfsQ9ZzuUBi27dwMNlcNMvLhkaVeVk = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; s= thelastpickle.com; bh=niW3VHgtFULZqTWVY3anf7loOH8=; b=DaJO0x62Rb 5l83AUrTh75AMoFawYKZ2eyd/5+VESZ59DyQhRS4xcqMQbJNtA1t2Nc+wO23ZkNP XMbeJrCgDA6WU3ZzaSDYsTd+oVpNiHZ9rWCT+k7D01fCdMa34sedJCrDy+BAC56j nbtFU/R/aEwy3RKO2ToscmM3UsUs5FGLw= Received: from [10.0.1.159] (121-73-157-230.cable.telstraclear.net [121.73.157.230]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a79.g.dreamhost.com (Postfix) with ESMTPSA id 7EF8B7D4014 for ; Thu, 24 Mar 2011 13:52:24 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1082.1) Subject: Re: Sizing a Cassandra cluster From: aaron morton In-Reply-To: Date: Fri, 25 Mar 2011 09:52:21 +1300 Content-Transfer-Encoding: quoted-printable Message-Id: References: To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1082.1) Big old guess of something in the 1000's.=20 Try benchmarking your work load and plug the numbers (my 5m is pretty = high) in... - 8 cores * 8 writers per core =3D 64 if each write request takes 5ms =3D= 1280 max per sec - 1 spindle * 16 readers per spindle =3D 16 readers if each read request = takes 5ms =3D 320 max per sec (reader and writer sizes from the help in conf/cassandra.yaml) This is really just a guess, there are a lot more things going on in the = system and it gets even more complicated once it's turned on. But I know = sometimes you just need to show you've thought about it :) Hope that helps. Aaron On 25 Mar 2011, at 02:27, Brian Fitzpatrick wrote: > Thanks for the tips on the replication factor. Any thoughts on the > number of nodes in a cluster to support an RF=3D3 with a workload of = 400 > ops/sec (4-8K sized rows, 50/50 read/write)? Based on the "sweet > spot" hardware referenced in the wiki (8-core, 16-32GB RAM), what kink > of ops/sec could I reasonably expect from each node. Just looking for > a range to make some educated guesses. >=20 > Thanks, > Brian >=20 > On Wed, Mar 23, 2011 at 9:04 PM, aaron morton = wrote: >> It really does depend on what your workload is like, and in the end = will >> involve a certain amount of fudge factor. >>=20 >> http://wiki.apache.org/cassandra/CassandraHardware provides some = guidance. >> http://wiki.apache.org/cassandra/MemtableThresholds can be used to = get a >> rough idea of the memory requirements. Note that secondary indexes = are also >> CF's with the same memory settings as the parent. >> With RF3 you can lose afford to lose one replica for a key a token = range and >> still be available (Assuming Quorum CL). With RF 5 you can lose 2 = replicas >> and still be available for the keys in the range. >> I'm been careful to say "lose X replicas" because the other nodes in = the >> cluster don't count when considering an operation for a key. Two = examples, 9 >> node cluster with RF3. If you lose nodes 2 and 3 and they are = replicas for >> node 1, Quorum operations on keys in the range for node 1 will fail = (ranges >> for 2 and 3 will be ok). If you lose nodes 2 and 5 Quorum operations = will >> succeed for all keys. >> RF 3 is reasonable starting point for some redundancy, RF 5 is more. = After >> that it's Web Scale (tm). >> Hope that helps >> Aaron >>=20 >> On 24 Mar 2011, at 04:04, Brian Fitzpatrick wrote: >>=20 >> I'm going through the process of specing out the hardware for a >> Cassandra cluster. The relevant specs: >>=20 >> - Support 460 operations/sec (50/50 read/write workload). Row size >> ranges from 4 to 8K. >> - Support 29 million objects for the first year >> - Support 365 GB storage for the first year, based on Cassandra tests >> (data + index + overhead * replication factor of 3) >>=20 >> I'm looking for advice on the node size for this cluster, recommended >> RAM per node, and whether RF=3D3 seems to be a good choice for = general >> availability and resistance to failure. >>=20 >> I've looked at the YCSB benchmark paper and through the archives of >> this email list looking for pointers. I haven't found any general >> guidelines on recommended cluster size to support X operations/sec >> with Y data size at RF factor of Z, that I could extrapolate from. >>=20 >> Any and all recommendations appreciated. >>=20 >> Thanks, >> Brian >>=20 >>=20