Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BB58E7A1F for ; Thu, 3 Nov 2011 22:34:25 +0000 (UTC) Received: (qmail 71507 invoked by uid 500); 3 Nov 2011 22:34:23 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 71474 invoked by uid 500); 3 Nov 2011 22:34:23 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 71466 invoked by uid 99); 3 Nov 2011 22:34:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Nov 2011 22:34:23 +0000 X-ASF-Spam-Status: No, hits=2.4 required=5.0 tests=HTML_FONT_FACE_BAD,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.213.44] (HELO mail-yw0-f44.google.com) (209.85.213.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Nov 2011 22:34:14 +0000 Received: by ywt2 with SMTP id 2so2081488ywt.31 for ; Thu, 03 Nov 2011 15:33:50 -0700 (PDT) MIME-Version: 1.0 Received: by 10.146.181.25 with SMTP id d25mr3068965yaf.3.1320359630286; Thu, 03 Nov 2011 15:33:50 -0700 (PDT) Received: by 10.147.32.5 with HTTP; Thu, 3 Nov 2011 15:33:50 -0700 (PDT) X-Originating-IP: [173.167.104.65] Date: Thu, 3 Nov 2011 15:33:50 -0700 Message-ID: Subject: Read perf investigation From: Ian Danforth To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=000e0cd52b44bd31b004b0dc2ef0 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd52b44bd31b004b0dc2ef0 Content-Type: text/plain; charset=ISO-8859-1 All, I've done a bit more homework, and I continue to see long 200ms to 300ms read times for some keys. Test Setup EC2 M1Large sending requests to a 5 node C* cluster also in EC2, also all M1Large. RF=3. ReadConsistency = ONE. I'm using pycassa from python for all communication. Data Model One column family with tens of millions of rows. The number of columns per row varies between 0 and 1440 (per minute records). The values are all ints. All data stored on EBS volumes. Total load per node is ~110GB. According to VMstat I'm not swapping at all. Highest %Util I see Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util xvdf 0.00 2788.00 17.00 267.50 1168.00 23020.00 85.02 32.37 107.73 1.22 34.60 A more average profile I see is: Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util xvdf 0.00 0.00 21.00 0.00 1288.00 0.00 61.33 0.37 18.38 9.43 19.80 QUESTION Where should I look next? I'd love to get a profile of exactly where cassandra is spending its time on a per call basis. Thanks in advance, Ian --000e0cd52b44bd31b004b0dc2ef0 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable All,

=A0I've done a bit more homework, and I continu= e to see long 200ms to 300ms read times for some keys.

=
Test Setup

EC2 M1Large sending requests to a = 5 node C* cluster also in EC2, also all M1Large. RF=3D3. ReadConsistency = =3D ONE. I'm using pycassa from python for all communication.

Data Model

One column family w= ith tens of millions of rows. The number of columns per row varies between = 0 and 1440 (per minute records). The values are all ints. All data stored o= n EBS volumes. Total load per node is ~110GB.

According to VMstat I'm not swapping at all.

Highest %Util I see
Device: =A0 =A0 =A0 =A0 rrqm/s = =A0 wrqm/s =A0 =A0 r/s =A0 =A0 w/s =A0 rsec/s =A0 wsec/s avgrq-sz avgqu-sz = =A0 await =A0svctm =A0%util
xvdf =A0 =A0 =A0 =A0 =A0 =A0 =A00.00 =A02788.00 =A0 17.00 =A0267.50 = =A01168.00 23020.00 =A0 =A085.02 =A0 =A032.37 =A0107.73 =A0 1.22 =A034.60

A more average profile I see is:

Device: =A0 =A0 =A0 =A0 rrqm/s =A0 wrqm/s =A0 =A0 r/s =A0 =A0 w/s =A0 rse= c/s =A0 wsec/s avgrq-sz avgqu-sz =A0 await =A0svctm =A0%util
xvdf =A0 =A0 =A0 =A0 =A0 =A0 =A00.00 =A0 =A0 0.00 =A0 21.00 =A0 =A00.0= 0 =A01288.00 =A0 =A0 0.00 =A0 =A061.33 =A0 =A0 0.37 =A0 18.38 =A0 9.43 =A01= 9.80

QUESTION

Where should I look next? I'd love to get a= profile of exactly where cassandra is spending its time on a per call basi= s.

Thanks in advance,

Ian
--000e0cd52b44bd31b004b0dc2ef0--