Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 14187 invoked from network); 24 May 2010 19:13:10 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 24 May 2010 19:13:10 -0000 Received: (qmail 68169 invoked by uid 500); 24 May 2010 19:13:09 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 68152 invoked by uid 500); 24 May 2010 19:13:09 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 68144 invoked by uid 99); 24 May 2010 19:13:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 May 2010 19:13:09 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of isoboroff@gmail.com designates 209.85.221.192 as permitted sender) Received: from [209.85.221.192] (HELO mail-qy0-f192.google.com) (209.85.221.192) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 May 2010 19:13:02 +0000 Received: by qyk30 with SMTP id 30so5946595qyk.16 for ; Mon, 24 May 2010 12:12:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=5NSFwgow7AcAgDlU2sHbcbL3ICCBxlOd57QsDtp30FU=; b=GCAGTog9soVzrP+jy9E/SrmYfNHeKcZdfK+sUZRmR2HDILOJgIol/BUJa+wdMpep4O hVP2pgRLaW5Y1jvP/unsrQkCfBlwi8cbxvZj4cCIY5VdJIoh726QSTBQpSFYR6iguMZU i7hUxX9Tch46GiLqmiq0+S0vB4jhvjPMHj5Hk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=QJH1kSb17XtIhuOUP1yWSrXiKucD80rXa0jr7z6ETZw+sH8vH/1CJ/3PTPfQaW5FKb E1TZDu8mBcBkZXVweNnukTFAI4YvxvfysJJMPnnPHOpbbxuGQlhEbrXp+CpTEOW5vHaS sQ4dFntFG2g+0FvYjQXiVfTsyRrTw91nwaRQ0= MIME-Version: 1.0 Received: by 10.224.87.75 with SMTP id v11mr3379914qal.397.1274728361638; Mon, 24 May 2010 12:12:41 -0700 (PDT) Received: by 10.229.214.144 with HTTP; Mon, 24 May 2010 12:12:41 -0700 (PDT) In-Reply-To: References: Date: Mon, 24 May 2010 15:12:41 -0400 Message-ID: Subject: Re: Ideal configuration for given hardware From: Ian Soboroff To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=00c09f89980e2e401204875bd368 X-Virus-Checked: Checked by ClamAV on apache.org --00c09f89980e2e401204875bd368 Content-Type: text/plain; charset=ISO-8859-1 My data disks on two of my nodes are RAID-5, just because of circumstances. My other nodes are JBOD. I don't notice any real difference, but I haven't strongly benched it. Ian On Mon, May 24, 2010 at 2:45 PM, Jonathan Ellis wrote: > I can think of at least 2 clusters running 32GB boxes with single > Cassandra processes on each. (16 seems to be more common.) At 64 I > would seriously consider multiple processes per machine. You'd want > to configure a Snitch such that same-machine boxes were considered the > same rack, there is no separate closeness level of same machine. > > At 32 I think you're fine with one process. Watch for latency spikes > and see how it goes. > > I would run raid 10 on the data disks if you can afford giving up the > space, otherwise raid0. I don't know that anyone's tested raid5. > > On Sun, May 23, 2010 at 3:30 PM, Aaron McCurry wrote: > > I am planning on setting up a Cassandra cluster on a small 16 node > cluster > > (possibly 32 way). Each machine has 8 cores 32 Gig of ram and 8 hds. My > > first thought is to setup one of those hds for the commit log, 6 for data > > and leave one for the OS. However I do have a concern about best > utilizing > > my memory, should I run a larger heap? Should I run several cassandra > > processes on the same box? > > My concern about the larger heap is because GC's typically get slower. > And > > if I run several procs, does cassandra realize that it's the same box for > > replication purposes? > > I do have other hd conf options, hardware RAID 0,1,or 5. > > Just looking for some general configuration options as well as some real > > world successes with similarly sized hardware. Thanks! > > Aaron > > > > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of Riptano, the source for professional Cassandra support > http://riptano.com > --00c09f89980e2e401204875bd368 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable My data disks on two of my nodes are RAID-5, just because of circumstances.= =A0 My other nodes are JBOD.=A0 I don't notice any real difference, but= I haven't strongly benched it.
Ian

On Mon, May 24, 2010 at 2:45 PM, Jonathan Ellis <jbellis@gmail.com> wrote:
I can think of at= least 2 clusters running 32GB boxes with single
Cassandra processes on each. =A0(16 seems to be more common.) =A0At 64 I would seriously consider multiple processes per machine. =A0You'd want<= br> to configure a Snitch such that same-machine boxes were considered the
same rack, there is no separate closeness level of same machine.

At 32 I think you're fine with one process. =A0Watch for latency spikes=
and see how it goes.

I would run raid 10 on the data disks if you can afford giving up the
space, otherwise raid0. =A0I don't know that anyone's tested raid5.=

On Sun, May 23, 2010 at 3:30 PM, Aaron McCurry <amccurry@gmail.com> wrote:
> I am planning on setting up a Cassandra cluster on a small 16 node clu= ster
> (possibly 32 way). =A0Each machine has 8 cores 32 Gig of ram and 8 hds= . =A0My
> first thought is to setup one of those hds for the commit log, 6 for d= ata
> and leave one for the OS. =A0However I do have a concern about best ut= ilizing
> my memory, should I run a larger heap? =A0Should I run several cassand= ra
> processes on the same box?
> My concern about the larger heap is because GC's typically get slo= wer. =A0And
> if I run several procs, does cassandra realize that it's the same = box for
> replication=A0purposes?
> I do have other hd conf options, hardware RAID 0,1,or 5.
> Just looking for some general configuration options as well as some re= al
> world successes with similarly sized hardware. =A0Thanks!
> Aaron



--
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com

--00c09f89980e2e401204875bd368--