Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 54574 invoked from network); 2 Jun 2010 08:45:22 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 2 Jun 2010 08:45:22 -0000 Received: (qmail 18936 invoked by uid 500); 2 Jun 2010 08:45:21 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 18791 invoked by uid 500); 2 Jun 2010 08:45:21 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 18783 invoked by uid 99); 2 Jun 2010 08:45:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Jun 2010 08:45:20 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of scode@spotify.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Jun 2010 08:45:13 +0000 Received: by fxm6 with SMTP id 6so4408411fxm.31 for ; Wed, 02 Jun 2010 01:44:52 -0700 (PDT) MIME-Version: 1.0 Received: by 10.239.174.14 with SMTP id h14mr564719hbf.108.1275468292469; Wed, 02 Jun 2010 01:44:52 -0700 (PDT) Received: by 10.239.152.4 with HTTP; Wed, 2 Jun 2010 01:44:52 -0700 (PDT) In-Reply-To: <1275373111.2481.27.camel@supertool-2907-01> References: <1275373111.2481.27.camel@supertool-2907-01> Date: Wed, 2 Jun 2010 10:44:52 +0200 Message-ID: Subject: Re: writing speed test From: =?UTF-8?Q?Peter_Sch=C3=BCller?= To: user@cassandra.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org Since this thread has now gone on for a while... As far as I can tell you never specify the characteristics of your writes. Evaluating expected write throughput in terms of "MB/s to disk" is pretty impossible if one does not know anything about the nature of the writes. If you're expecting 50 MB, is that reasonable? I don't know; if you're writing a gazillion one-byte values with shortish keys, 50 MB/seconds translates to a *huge* amounts of writes per second and you're likely to be CPU bound even in the most efficient implementation reasonably possible. If on the other hand you're writing large values (say slabs of 128k) you might more reasonably be expecting higher disk throughput. I don't have enough hands-on experience with cassandra to have a feel for the CPU vs. disk in terms of bottlenecking, and when we expect to bottleneck on what, but I can say that it's definitely going to matter quite a lot what *kind* of writes you're doing. This tends to be the case regardless of the database system. -- / Peter Schuller aka scode