Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 75684 invoked from network); 16 Feb 2010 18:17:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Feb 2010 18:17:05 -0000 Received: (qmail 1939 invoked by uid 500); 16 Feb 2010 18:17:05 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 1895 invoked by uid 500); 16 Feb 2010 18:17:05 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 1886 invoked by uid 99); 16 Feb 2010 18:17:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 18:17:05 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of weijunli@gmail.com designates 209.85.222.172 as permitted sender) Received: from [209.85.222.172] (HELO mail-pz0-f172.google.com) (209.85.222.172) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 18:16:57 +0000 Received: by pzk2 with SMTP id 2so123289pzk.21 for ; Tue, 16 Feb 2010 10:16:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=alvVz6GCqimzSovVY51wUI9+ThkGQCG8fTspGf2H9sU=; b=wM7/5Eyhkm4X3Gls/sZF5AChEirLIOTsoUgDALPXaseGKBnIBDHwi3ki07YH8TAojn 8dcHxrHVqJ9D3PRA0MWrL+YSiTYBUiTqLLJk+BBKzw+0nBIBJ7GlVH2WNrFZcO9lkPqx 17sCncehWchwjF59yOGXimIaYMyk4MDtr0Znc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=pTRh+hXX+11SdB9wcl5LF7fIvRFdlhhGQqCRuN3NvdMBo3jmT+EMmmPWYc2mUfcM8G O3J9Pc+Hs9Cb/rbbRQYZwo82LM/T5Zeym0qGzYO6uRQbG1slXi1lPcPjtPDWy5DVUmUu 0bU1QYeDsvjKZnosakXXMM72BxsvRRaszvZ5A= MIME-Version: 1.0 Received: by 10.115.67.11 with SMTP id u11mr3070484wak.69.1266344195862; Tue, 16 Feb 2010 10:16:35 -0800 (PST) In-Reply-To: References: <468b21171001200244n2521e77esa84964946f0eb20b@mail.gmail.com> <012601caade5$21a79fb0$64f6df10$@com> <005101caaedd$2bb3ce90$831b6bb0$@com> <1NhIrQ-0007Jt-In@mail.eleven.de> Date: Tue, 16 Feb 2010 10:16:35 -0800 Message-ID: Subject: Re: Cassandra benchmark shows OK throughput but high read latency (> 100ms)? From: Weijun Li To: cassandra-user@incubator.apache.org Content-Type: multipart/alternative; boundary=0016e64de750f53325047fbbbbb8 X-Virus-Checked: Checked by ClamAV on apache.org --0016e64de750f53325047fbbbbb8 Content-Type: text/plain; charset=ISO-8859-1 Thanks for for DataFileDirectory trick and I'll give a try. Just noticed the impact of number of data files: node A has 13 data files with read latency of 20ms and node B has 27 files with read latency of 60ms. After I ran "nodeprobe compact" on node B its read latency went up to 150ms. The read latency of node A became as low as 10ms. Is this normal behavior? I'm using random partitioner and the hardware/JVM settings are exactly the same for these two nodes. Another problem is that Java heap usage is always 900mb out of 6GB? Is there any way to utilize all of the heap space to decrease the read latency? -Weijun On Tue, Feb 16, 2010 at 10:01 AM, Brandon Williams wrote: > On Tue, Feb 16, 2010 at 11:56 AM, Weijun Li wrote: > >> One more thoughts about Martin's suggestion: is it possible to put the >> data files into multiple directories that are located in different physical >> disks? This should help to improve the i/o bottleneck issue. >> >> > Yes, you can already do this, just add more directives > pointed at multiple drives. > > >> Has anybody tested the row-caching feature in trunk (shoot for 0.6?)? > > > Row cache and key cache both help tremendously if your read pattern has a > decent repeat rate. Completely random io can only be so fast, however. > > -Brandon > --0016e64de750f53325047fbbbbb8 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks for for DataFileDirectory trick and I'll give a try.

Just= noticed the impact of number of data files: node A has 13 data files with = read latency of 20ms and node B has 27 files with read latency of 60ms. Aft= er I ran "nodeprobe compact" on node B its read latency went up t= o 150ms. The read latency of node A became as low as 10ms. Is this normal b= ehavior? I'm using random partitioner and the hardware/JVM settings are= exactly the same for these two nodes.

Another problem is that Java heap usage is always 900mb out of 6GB? Is = there any way to utilize all of the heap space to decrease the read latency= ?

-Weijun

On Tue, Feb 16, 2010 at = 10:01 AM, Brandon Williams <driftx@gmail.com> wrote:
On Tue, Feb 16, 2010 at 11:56 AM, Weijun Li <= weijunli@gmail.com> wrote:
One more thoughts about Martin's suggestion: is it possible to put the = data files into multiple directories that are located in different physical= disks? This should help to improve the i/o bottleneck issue.


Yes, you can already do this, just add more=A0<= ;DataFileDirectory> directives pointed at multiple drives.
=A0
Has anybody tested the row-caching feature in trunk (shoot for 0.6?)?

Row cache and key cache both help tremendo= usly if your read pattern has a decent repeat rate. =A0Completely random io= can only be so fast, however.

-Brandon=A0

--0016e64de750f53325047fbbbbb8--