Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 40A149E27 for ; Thu, 27 Oct 2011 16:13:15 +0000 (UTC) Received: (qmail 67033 invoked by uid 500); 27 Oct 2011 16:13:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 66995 invoked by uid 500); 27 Oct 2011 16:13:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 66985 invoked by uid 99); 27 Oct 2011 16:13:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2011 16:13:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of cryptcom@gmail.com designates 209.85.210.172 as permitted sender) Received: from [209.85.210.172] (HELO mail-iy0-f172.google.com) (209.85.210.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2011 16:13:07 +0000 Received: by iabn5 with SMTP id n5so3910518iab.31 for ; Thu, 27 Oct 2011 09:12:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=aIDD5aPRXWFwqT+JwO+2oCNkh8TPqbt/eHIpf1H+uHI=; b=WqCtJdqUn3wnZPq8C+dx4o2F2B5I+WoCWb6PgV6Xwlnl074iDv/zR9bjdBy91bMZOL utYGTItfhoQq6OgMsZJYAYQ48mEn3G7YKWy98ldSsJfkXLiSoo2zfyYuL3Ui70DOQTKX nlXGaw/M446i/0fy8fnMsCNol78zw6Sn0cCNE= MIME-Version: 1.0 Received: by 10.42.151.196 with SMTP id f4mr59124902icw.17.1319731967268; Thu, 27 Oct 2011 09:12:47 -0700 (PDT) Received: by 10.42.180.197 with HTTP; Thu, 27 Oct 2011 09:12:46 -0700 (PDT) In-Reply-To: References: Date: Thu, 27 Oct 2011 12:12:46 -0400 Message-ID: Subject: Re: Counter Experience (Performance)? From: Joe Stein To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=90e6ba6e83781b985c04b04a0b6d --90e6ba6e83781b985c04b04a0b6d Content-Type: text/plain; charset=ISO-8859-1 Thanks Jake, bottleneck is the disk I believe each write is taking 50ms, EBS probably (doing testing in ec2). I will move my testing over to our production network and run it on some nodes on some real hardware since that where it will end up. I am seeing things slow down linearly and nothing dropping off precipitously. Glad to have the benchmarks I have good to compare things. Thanks! On Thu, Oct 27, 2011 at 11:30 AM, Jake Luciani wrote: > What's your bottleneck? > http://spyced.blogspot.com/2010/01/linux-performance-basics.html > > > On Thu, Oct 27, 2011 at 9:37 AM, Joe Stein wrote: > >> Hey folks, I am interested in what others have seen in regards to their >> experience in the amount of depth and width (CF, Rows & Columns) that they >> can/do write per batch and simultaneously and what is the inflection point >> where performance degrades. I have been expanding my use of counters and >> am finding some interesting nuances some in my code and implementation >> related but others I can't yet quantify. >> >> My batches are 1x5x5 (1 row for each of 5 column families and 5 columns >> for each of those 1 rows within each of the 5 column families). I have 3 >> nodes each with 100 connections and another thread pool of 100 threads >> rolling through 6,000,000 rows off data sending data out to Cassandra (the >> 1x5x5 matrice is constructed from each line). I am finding this to be my >> sweet spot right now but still not really performing fantastically (or at >> least what I had hoped) and I am wondering what else (if anything) I can be >> doing to tweak settings or what to be able to push in more columns or rows. >> I find changing my pool settings very much froms this causes error on >> client lib but I will send email to that list separately though I think I >> have that figured out on my own for now. >> >> Thanks in advance!!! I hope to get more work going on this in the next >> day or so in a more methodic way to find the right count so I can build a >> sparse matrice that will perform best for system and business. >> >> /* >> Joe Stein >> http://www.linkedin.com/in/charmalloc >> Twitter: @allthingshadoop >> */ >> > > > > -- > http://twitter.com/tjake > -- /* Joe Stein http://www.linkedin.com/in/charmalloc Twitter: @allthingshadoop */ --90e6ba6e83781b985c04b04a0b6d Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks Jake,=A0bottleneck=A0is the disk I believe each write is taking 50ms= , EBS probably (doing testing in ec2).

I will move my te= sting over to our production network and run it on some nodes on some real = hardware since that where it will end up.

I am seeing things slow down=A0linearly and nothing dro= pping off=A0precipitously. =A0Glad to have the benchmarks I have good to co= mpare things. =A0Thanks!

On Thu, Oct 27, = 2011 at 11:30 AM, Jake Luciani <jakers@gmail.com> wrote:
What's your bottleneck?=A0http://spyced.blogspot.com/2010/01/linux-performance-basics.html=


On Thu, Oct= 27, 2011 at 9:37 AM, Joe Stein <cryptcom@gmail.com> wrote:=
Hey folks, I am interested in what others ha= ve seen in regards to their experience in the amount of depth and width (CF= , Rows & Columns) that they can/do write per batch and=A0simultaneously= and what is the inflection point where performance degrades. =A0 I have be= en expanding my use of counters and am finding some=A0interesting=A0nuances= some in my code and implementation related but others I can't yet quan= tify.

My batches are 1x5x5 (1 row for each of 5 column families an= d 5 columns for each of those 1 rows within each of the 5 column families).= =A0I have 3 nodes each with 100 connections and another thread pool of 100= threads rolling through 6,000,000 rows off data sending data out to=A0Cass= andra=A0(the 1x5x5 matrice is constructed from each line). =A0I am finding = this to be my sweet spot right now but still not really performing fantasti= cally (or at least what I had hoped) and I am wondering what else (if anyth= ing) I can be doing to=A0tweak=A0settings or what to be able to push in mor= e columns or rows. =A0 I find changing my pool settings very much froms thi= s causes error on client lib but I will send email to that list=A0separatel= y though I think I have that figured out on my own for now.

Thanks in advance!!! =A0I hope to get more work go= ing on this in the next day or so in a more methodic way to find the right = count so I can build a sparse matrice that will perform best for system and= business.

/*
Joe Stein
http://www.linkedin.com/= in/charmalloc
Twitter: @allthingshadoop
*/



--
http://twitter.com/tjake



--

/= *
Joe Stein
http://www.linkedin.com/in/charmalloc
Twitter: @allthingshad= oop
*/
--90e6ba6e83781b985c04b04a0b6d--