Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of cryptcom@gmail.com designates
 209.85.210.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CALamAD+JZcVvRN7YT9dvKJ_FD97LvbJgpCy=kjFOFLn-ZsbmCA@mail.gmail.com>
References: 
 <CANEBM33Hx08PF_DWOqqwXb6mfMghs+44D_4_mLxyQ4YGZ5GQpQ@mail.gmail.com>
	<CALamAD+JZcVvRN7YT9dvKJ_FD97LvbJgpCy=kjFOFLn-ZsbmCA@mail.gmail.com>
Date: Thu, 27 Oct 2011 12:12:46 -0400
Message-ID: 
 <CANEBM30=9_nJT_J9ZTFT_HARpgc4AJ_G5nqS-WJ8pBHhZXdMAQ@mail.gmail.com>
Subject: Re: Counter Experience (Performance)?
From: Joe Stein <cryptcom@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=90e6ba6e83781b985c04b04a0b6d

--90e6ba6e83781b985c04b04a0b6d
Content-Type: text/plain; charset=ISO-8859-1

Thanks Jake, bottleneck is the disk I believe each write is taking 50ms, EBS
probably (doing testing in ec2).

I will move my testing over to our production network and run it on some
nodes on some real hardware since that where it will end up.

I am seeing things slow down linearly and nothing dropping
off precipitously.  Glad to have the benchmarks I have good to compare
things.  Thanks!

On Thu, Oct 27, 2011 at 11:30 AM, Jake Luciani <jakers@gmail.com> wrote:

> What's your bottleneck?
> http://spyced.blogspot.com/2010/01/linux-performance-basics.html
>
>
> On Thu, Oct 27, 2011 at 9:37 AM, Joe Stein <cryptcom@gmail.com> wrote:
>
>> Hey folks, I am interested in what others have seen in regards to their
>> experience in the amount of depth and width (CF, Rows & Columns) that they
>> can/do write per batch and simultaneously and what is the inflection point
>> where performance degrades.   I have been expanding my use of counters and
>> am finding some interesting nuances some in my code and implementation
>> related but others I can't yet quantify.
>>
>> My batches are 1x5x5 (1 row for each of 5 column families and 5 columns
>> for each of those 1 rows within each of the 5 column families).  I have 3
>> nodes each with 100 connections and another thread pool of 100 threads
>> rolling through 6,000,000 rows off data sending data out to Cassandra (the
>> 1x5x5 matrice is constructed from each line).  I am finding this to be my
>> sweet spot right now but still not really performing fantastically (or at
>> least what I had hoped) and I am wondering what else (if anything) I can be
>> doing to tweak settings or what to be able to push in more columns or rows.
>>   I find changing my pool settings very much froms this causes error on
>> client lib but I will send email to that list separately though I think I
>> have that figured out on my own for now.
>>
>> Thanks in advance!!!  I hope to get more work going on this in the next
>> day or so in a more methodic way to find the right count so I can build a
>> sparse matrice that will perform best for system and business.
>>
>> /*
>> Joe Stein
>> http://www.linkedin.com/in/charmalloc
>> Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
>> */
>>
>
>
>
> --
> http://twitter.com/tjake
>


-- 

/*
Joe Stein
http://www.linkedin.com/in/charmalloc
Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
*/

--90e6ba6e83781b985c04b04a0b6d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks Jake,=A0bottleneck=A0is the disk I believe each write is taking 50ms=
, EBS probably (doing testing in ec2).<div><br></div><div>I will move my te=
sting over to our production network and run it on some nodes on some real =
hardware since that where it will end up.</div>
<div><br></div><div>I am seeing things slow down=A0linearly and nothing dro=
pping off=A0precipitously. =A0Glad to have the benchmarks I have good to co=
mpare things. =A0Thanks!<br><br><div class=3D"gmail_quote">On Thu, Oct 27, =
2011 at 11:30 AM, Jake Luciani <span dir=3D"ltr">&lt;<a href=3D"mailto:jake=
rs@gmail.com">jakers@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;">What&#39;s your bottleneck?=A0<a href=3D"ht=
tp://spyced.blogspot.com/2010/01/linux-performance-basics.html" target=3D"_=
blank">http://spyced.blogspot.com/2010/01/linux-performance-basics.html</a>=
<div>
<div></div><div class=3D"h5"><br><br><div class=3D"gmail_quote">On Thu, Oct=
 27, 2011 at 9:37 AM, Joe Stein <span dir=3D"ltr">&lt;<a href=3D"mailto:cry=
ptcom@gmail.com" target=3D"_blank">cryptcom@gmail.com</a>&gt;</span> wrote:=
<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hey folks, I am interested in what others ha=
ve seen in regards to their experience in the amount of depth and width (CF=
, Rows &amp; Columns) that they can/do write per batch and=A0simultaneously=
 and what is the inflection point where performance degrades. =A0 I have be=
en expanding my use of counters and am finding some=A0interesting=A0nuances=
 some in my code and implementation related but others I can&#39;t yet quan=
tify.<div>


<br></div><div>My batches are 1x5x5 (1 row for each of 5 column families an=
d 5 columns for each of those 1 rows within each of the 5 column families).=
 =A0I have 3 nodes each with 100 connections and another thread pool of 100=
 threads rolling through 6,000,000 rows off data sending data out to=A0Cass=
andra=A0(the 1x5x5 matrice is constructed from each line). =A0I am finding =
this to be my sweet spot right now but still not really performing fantasti=
cally (or at least what I had hoped) and I am wondering what else (if anyth=
ing) I can be doing to=A0tweak=A0settings or what to be able to push in mor=
e columns or rows. =A0 I find changing my pool settings very much froms thi=
s causes error on client lib but I will send email to that list=A0separatel=
y though I think I have that figured out on my own for now.</div>


<div><div><br></div><div>Thanks in advance!!! =A0I hope to get more work go=
ing on this in the next day or so in a more methodic way to find the right =
count so I can build a sparse matrice that will perform best for system and=
 business.</div>


<div><br></div>/*<br><font color=3D"#888888">Joe Stein<br><a href=3D"http:/=
/www.linkedin.com/in/charmalloc" target=3D"_blank">http://www.linkedin.com/=
in/charmalloc</a><br>Twitter: <a href=3D"http://www.twitter.com/allthingsha=
doop" target=3D"_blank">@allthingshadoop</a><br>


*/<br>
</font></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><font c=
olor=3D"#888888">-- <br><a href=3D"http://twitter.com/tjake" target=3D"_bla=
nk">http://twitter.com/tjake</a><br>
</font></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><br>/=
*<br>Joe Stein<br><a href=3D"http://www.linkedin.com/in/charmalloc" target=
=3D"_blank">http://www.linkedin.com/in/charmalloc</a><br>Twitter: <a href=
=3D"http://www.twitter.com/allthingshadoop" target=3D"_blank">@allthingshad=
oop</a><br>
*/<br>
</div>

--90e6ba6e83781b985c04b04a0b6d--