Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 56189 invoked from network); 10 Mar 2010 08:01:22 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Mar 2010 08:01:22 -0000 Received: (qmail 54092 invoked by uid 500); 10 Mar 2010 08:00:52 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 53982 invoked by uid 500); 10 Mar 2010 08:00:52 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 53974 invoked by uid 99); 10 Mar 2010 08:00:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Mar 2010 08:00:52 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of eprosenx@gmail.com designates 209.85.218.213 as permitted sender) Received: from [209.85.218.213] (HELO mail-bw0-f213.google.com) (209.85.218.213) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Mar 2010 08:00:44 +0000 Received: by bwz5 with SMTP id 5so340722bwz.20 for ; Wed, 10 Mar 2010 00:00:24 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=FiZCV6EqQKV131YpvW0jzIkQVMvPwjWWyoVTG04joIY=; b=PrgZKb47S2lGHW3gU/Un0VWUQrog6pY5FyPUU72r0p6dqXkdBVabkCm6yxrhfip3wb aV3VELMXCcFhB6eY5F0iUTXsKv6WdS27IyFnIcfut0aO56l63WYZrG9oT/5S513KuwK5 Z7cISD1wKDoZyXnqMZ+9zXN85DeqRvopEQtNs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=K8x59K157yEeXu2dRHOvlU7ijN2VuaTJJzGJo5SMfzKFpYQUmqElctNXGLmdFZu3/x krfKFIqzt8FCT2osuZU4JWn1aJnH457bEIRt/Iwmef7hR/YB7qADCTQL2wDmQ4pvNSkA Xwa0z4yK2OU1Qfq7C5YPaiC4OyUDvmTycC6sw= MIME-Version: 1.0 Received: by 10.204.39.203 with SMTP id h11mr1067837bke.153.1268208023819; Wed, 10 Mar 2010 00:00:23 -0800 (PST) Date: Wed, 10 Mar 2010 00:00:23 -0800 Message-ID: Subject: Effective allocation of multiple disks From: Eric Rosenberry To: cassandra-user@incubator.apache.org Content-Type: multipart/alternative; boundary=000325559052c2d8b404816db081 X-Virus-Checked: Checked by ClamAV on apache.org --000325559052c2d8b404816db081 Content-Type: text/plain; charset=ISO-8859-1 Based on the documentation, it is clear that with Cassandra you want to have one disk for commitlog, and one disk for data. My question is: If you think your workload is going to require more io performance to the data disks than a single disk can handle, how would you recommend effectively utilizing additional disks? It would seem a number of vendors sell 1U boxes with four 3.5 inch disks. If we use one for commitlog, is there a way to have Cassandra itself equally split data across the three remaining disks? Or is this something that needs to be handled by the hardware level, or operating system/file system level? Options include a hardware RAID controller in a RAID 0 stripe (this is more $$$ and for what gain?), or utilizing a volume manager like LVM. Along those same lines, if you do implement some type of striping, what RAID stripe size is recommended? (I think Todd Burruss asked this earlier but I did not see a response) Thanks for any input! -Eric --000325559052c2d8b404816db081 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Based on the documentation, it is clear that with Cassandra you want to hav= e one disk for commitlog, and one disk for data.

My ques= tion is: If you think your workload is going to require more io performance= to the data disks than a single disk can handle, how would you recommend e= ffectively utilizing additional disks?

It would seem a number of vendors sell 1U boxes with fo= ur 3.5 inch disks. =A0If we use one for commitlog, is there a way to have C= assandra itself equally split data across the three remaining disks? =A0Or = is this something that needs to be handled by the hardware level, or operat= ing system/file system level?

Options include a hardware RAID controller in a RAID 0 = stripe (this is more $$$ and for what gain?), or utilizing a volume manager= like LVM.

Along those same lines, if you do imple= ment some type of striping, what RAID stripe size is recommended? =A0(I thi= nk=A0Todd Burruss asked this earlier but I did not see a response)

Thanks for any input!

-Eric
--000325559052c2d8b404816db081--