Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 68300 invoked from network); 6 May 2010 21:40:38 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 May 2010 21:40:38 -0000 Received: (qmail 55447 invoked by uid 500); 6 May 2010 21:40:37 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 55411 invoked by uid 500); 6 May 2010 21:40:37 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 55403 invoked by uid 99); 6 May 2010 21:40:37 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 21:40:37 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 21:40:30 +0000 Received: by wwb34 with SMTP id 34so101711wwb.31 for ; Thu, 06 May 2010 14:40:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=/FpguiFwWVPUBqizuD/FpkF+/ltm1jWw34xFwOTpgWg=; b=F5g02Qeb5tSMbMAN4UHADZKme+6nQ4uy7yeugAEFPxGYOakCBApp1+LasbhugqFvrN QRaTivnu7yN4HzgNZ/1Fkz2V6UZ5X46FpK5s/y8/xWl8CSLEiwA0pCeZO2XfrbweNB7d r5JL0OazZjujvmBlkDnPpdWQKnd75zqi3sTDw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=Kkq1cLsfrb800vTLFI+1HkJpR7VgwYyU1ykwIf8MkF6LHbzsjM8OPVlOaC3u3zppx0 K0Ii1ScOV+llPjaAYdnhMlvRRj2gnZoX3O0JC7KgkDWmZhOiKREE5oj5mxiHofKvfdCJ ZkY3hC9/+B709rzWslW/2K442/CqbpzIIxi0A= Received: by 10.216.91.76 with SMTP id g54mr4275560wef.2.1273182010623; Thu, 06 May 2010 14:40:10 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.22.10 with HTTP; Thu, 6 May 2010 14:39:50 -0700 (PDT) In-Reply-To: References: <8A606DEA-CB57-4D0B-90C0-FE79B2DE22E9@discovereads.com> From: Jonathan Ellis Date: Thu, 6 May 2010 16:39:50 -0500 Message-ID: Subject: Re: performance tuning - where does the slowness come from? To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Yes, that makes sense. If you never have a warm cache then it's probably disk seek time creating that latency, in which case there isn't a whole lot you can do about it short of adding more capacity (so at least it's cached at the OS level). iostat -x could substantiate this guess. On Thu, May 6, 2010 at 12:56 PM, Ran Tavory wrote: > Jonathan, I think it's the case of large values in the columns. The > problematic CF is a key-value store, so it has only one column per row, > however the value of that column can be large. It's a java serialized obj= ect > (uncompressed) which, may be 100s of bytes, maybe even a few megs. This C= F > also suffers from zero cache hits since each time a read is for a unique > key. > I ran stress.py and I see much better=A0results=A0(reads are < 1ms) so I = assume > my cluster is=A0healthy, so I need to fix the app. Would 1meg bytes objec= t > explain a 30ms (sometimes even more) read latency? The boxes aren't fancy= , > not sure exactly what hardware we have there but it's "commodity"... > Thanks! > > On Thu, May 6, 2010 at 5:22 PM, Jonathan Ellis wrote: >> >> columns, not CFs. >> >> put another way, how wide are the rows in the slow CF? >> >> On Wed, May 5, 2010 at 11:30 PM, Ran Tavory wrote: >> > I have a few CFs but the one I'm seeing slowness in, which is the one >> > with >> > plenty of cache misses has only one column per key. >> > Latency varies b/w 10m and 60ms but I'd say average is 30ms. >> > >> > On Thu, May 6, 2010 at 4:25 AM, Jonathan Ellis >> > wrote: >> >> >> >> How many columns are in the rows you are reading from? >> >> >> >> 30ms is quite high, so I suspect you have relatively large rows, in >> >> which case decreasing the column index threshold may help. >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of Riptano, the source for professional Cassandra support >> http://riptano.com > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com