Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of mainalimanoj@gmail.com
 designates 209.85.160.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOZAnVNYv8ATG4o6Oy0BFq_hnU8ecyxqfxYUueMeb2Rx4rGYQA@mail.gmail.com>
References: 
 <CAOZAnVMUfUpPz4DeiRbOW_Me1x9nh819acOcsaceNBexYE_beg@mail.gmail.com>
	<CALY91SNzuiWB+tD5C-NvzmtYa76FYJe305T+_xXGW5_b4W_wTA@mail.gmail.com>
	<CAKkz8Q20U3_1L=RrLoiLcNYrdq0Dez5nCgaNGNiE6eRWAYLJow@mail.gmail.com>
	<CAOZAnVOuLPJ9MT0bdwbC2cBs=+Rd5Caj6rybVC3U8uftMH=0Cg@mail.gmail.com>
	<1C1DF018-9710-467B-9180-EC2CE58A6422@thelastpickle.com>
	<CAOZAnVNYv8ATG4o6Oy0BFq_hnU8ecyxqfxYUueMeb2Rx4rGYQA@mail.gmail.com>
Date: Wed, 18 Jul 2012 20:29:55 +0900
Message-ID: 
 <CALGwdJUw64g64X7Ed1uO2rCP6j5LH5Au5mZ-2xB4-udzTjJcmw@mail.gmail.com>
Subject: Re: Cassandra Evaluation/ Benchmarking: Throughput not scaling as
 expected neither latency showing good numbers
From: Manoj Mainali <mainalimanoj@gmail.com>
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=e89a8fb2085c7c7e8404c518fb02

--e89a8fb2085c7c7e8404c518fb02
Content-Type: text/plain; charset=ISO-8859-1

How kind of client are you using in YCSB? If you want to improve latency,
try distributing the requests among nodes instead of stressing a single
node, try host connection pooling instead of creating connection for each
request. Check high level clients like hector or asyantax for use if you
are not already using them. Some clients have ring aware request handling.

You have a 3 nodes cluster and using a RF of three, that means all the node
will get the data. What CL are you using for writes? Latency increases for
strong CL.

If you want to increase throughput, try increasing the number of clients.
Of course, it doesnt mean that throughtput will always increase. My
observation was that it will increase and after certain number of clients
throughput decrease again.

Regards,
Manoj Mainali


On Wednesday, July 18, 2012, Code Box wrote:

> The cassandra stress tool gives me values around 2.5 milli seconds for
> writing. The problem with the Cassandra Stress Tool is that it just gives
> the average latency numbers and the average latency numbers that i am
> getting are comparable in some cases. It is the 95 percentile and 99
> percentile numbers are the ones that are bad. So it means that the 95% of
> requests are really bad and the rest 5% are really good that makes the
> average go down. I want to make sure that the 95% and 99% values are in one
> digit milli seconds. I want them to be single digit because i have seen
> people getting those numbers.
>
> This is my conclusion till now with all the investigations:-
>
> Three node cluster with replication factor of 3 gets me around 10 ms 100%
> writes with consistency equal to ONE. The reads are really bad and they are
> around 65ms.
>
> I thought that network is the issue so i moved the client on a local
> machine. Client on the local machine with one node cluster gives me again
> good average write latencies but the 99%ile and 95%ile are bad. I am
> getting around 10 ms for write and 25 ms for read.
>
> Network Bandwidth between the client and server is 1 Gigabit/second. I was
> able to at the max generate 25 K requests. So it could be the client is the
> bottleneck. I am using YCSB. May be i should change my client to some other.
>
> Throughput that i got from a client at the maximum local was 35K and
> remote was 17K.
>
>
> I can try these things now:-
>
> Use a different client and see how much numbers i get for 99% and 95%. I
> am not sure if there is any client that gives me this detailed or i have to
> write one of my own.
>
> Tweak some hard disk settings raid0 and xfs / ext4 and see if that helps.
>
> Could be a possibility that the cassandra 0.8 to 1.1 the 95% and 99%
> numbers have gone down.  The throughput numbers have also gone down.
>
> Is there any other client that i can use except the cassandra stress tool
> and YCSB  and what ever numbers i have got are they good ?
>
>
> --Akshat Vig.
>
>
>
>
> On Tue, Jul 17, 2012 at 9:22 PM, aaron morton <aaron@thelastpickle.com>wrote:
>
> I would benchmark a default installation, then start tweaking. That way
> you can see if your changes result in improvements.
>
> To simplify things further try using the tools/stress utility in the
> cassandra source distribution first. It's pretty simple to use.
>
> Add clients until you see the latency increase and tasks start to back up
> in nodetool tpstats. If you see it report dropped messages it is over
> loaded.
>
> Hope that helps.
>
>   -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>
> On 18/07/2012, at 4:48 AM, Code Box wrote:
>
> Thanks a lot for your reply guys. I was trying fsyn = batch and window
> =0ms to see if the disk utilization is happening full on my drive. I
> checked the  numbers using iostat the numbers were around 60% and the CPU
> usage was also not too high.
>
> Configuration of my Setup :-
>
> I have three m1.xlarge hosts each having 15 GB RAM and 4 CPU. It has 8
> EC2 Compute Units.
> I have kept the replication factor equal to 3. The typical write size is 1
> KB.
>
> I tried adding different nodes each with 200 threads and the throughput
> got split into two. If i do it from a single host with FSync Set to
> Periodic and Window Size equal to 1000ms and using two nodes i am getting
> these numbers :-
>
>
> [OVERALL], Throughput(ops/sec), 4771
> [INSERT], AverageLatency(us), 18747
> [INSERT], MinLatency(us), 1470
> [INSERT], MaxLatency(us), 446413
> [INSERT], 95thPercentileLatency(ms), 55
> [INSERT], 99thPercentileLatency(ms), 167
>
> [OVERALL], Throughput(ops/sec), 4678
> [INSERT], AverageLatency(us), 22015
> [INSERT], MinLatency(us), 1439
> [INSERT], MaxLatency(us), 466149
> [INSERT], 95thPercentileLatency(ms), 62
> [INSERT], 99thPercentileLatency(ms), 171
>
> Is there something i am doing wrong in cassandra Setup ?? What is the bet
> Setup for Cassandra to get high throughput and good write latency numbers ?
>
>
>
> On Tue, Jul 17, 2012 at 7:02 AM, Sylvain Lebresne <sylvain@datastax.com>
>
>

--e89a8fb2085c7c7e8404c518fb02
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

How kind of client are you using in YCSB? If you want to improve latency, t=
ry distributing the requests among nodes instead of stressing a single node=
, try host connection pooling instead of creating connection for each reque=
st. Check high level clients like hector or asyantax for use if you are not=
 already using them. Some clients have ring aware request handling.<div>
<br></div><div>You have a 3 nodes cluster and using a RF of three, that mea=
ns all the node will get the data. What CL are you using for writes? Latenc=
y increases for strong CL.=A0</div><div><br></div><div>If you want to incre=
ase throughput, try increasing the number of clients. Of course, it doesnt =
mean that throughtput will always increase. My observation was that it will=
 increase and after certain number of clients throughput decrease again.</d=
iv>
<div><br></div>Regards,<div>Manoj Mainali<span></span><br><div><br></div><d=
iv><br>On Wednesday, July 18, 2012, Code Box  wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">
<div>The cassandra stress tool gives me values around 2.5 milli seconds for=
 writing. The problem with the Cassandra Stress Tool is that it just gives =
the average latency numbers and the average latency numbers that i am getti=
ng are comparable in some cases. It is the 95 percentile and 99 percentile =
numbers are the ones that are bad. So it means that the 95% of requests are=
 really bad and the rest 5% are really good that makes the average go down.=
 I want to make sure that the 95% and 99% values are in one digit milli sec=
onds. I want them to be single digit because i have seen people getting tho=
se numbers.=A0</div>

<div><br></div><div><div>This is my conclusion till now with all the invest=
igations:-</div><div><br></div><div>Three node cluster with replication fac=
tor of 3 gets me around 10 ms 100% writes with consistency equal to ONE. Th=
e reads are really bad and they are around 65ms.=A0</div>

<div><br></div><div>I thought that network is the issue so i moved the clie=
nt on a local machine. Client on the local machine with one node cluster gi=
ves me again good average write latencies but the 99%ile and 95%ile are bad=
. I am getting around 10 ms for write and 25 ms for read.=A0</div>

<div><br></div><div>Network Bandwidth between the client and server is 1 Gi=
gabit/second. I was able to at the max generate 25 K requests. So it could =
be the client is the bottleneck. I am using YCSB. May be i should change my=
 client to some other.</div>

<div><br></div><div>Throughput that i got from a client at the maximum loca=
l was 35K and remote was 17K.</div><div><br></div><div><br></div><div>I can=
 try these things now:-</div><div><br></div><div>Use a different client and=
 see how much numbers i get for 99% and 95%. I am not sure if there is any =
client that gives me this detailed or i have to write one of my own.</div>

<div><br></div><div>Tweak some hard disk settings raid0 and xfs / ext4 and =
see if that helps.</div><div><br></div><div>Could be a possibility that the=
 cassandra 0.8 to 1.1 the 95% and 99% numbers have gone down. =A0The throug=
hput numbers have also gone down.</div>

<div><br></div><div>Is there any other client that i can use except the cas=
sandra stress tool and YCSB =A0and what ever numbers i have got are they go=
od ?</div><div><br></div><div><br></div><div>--Akshat Vig.</div><div><br>

</div><div><br></div><div><br></div><div><br><div>On Tue, Jul 17, 2012 at 9=
:22 PM, aaron morton <span dir=3D"ltr">&lt;<a>aaron@thelastpickle.com</a>&g=
t;</span> wrote:<br>
<blockquote style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex"><div style=3D"word-wrap:break-word">I would benchmark a default in=
stallation, then start tweaking. That way you can see if your changes resul=
t in improvements.<div>

=A0</div><div>To simplify things further try using the tools/stress utility=
 in the cassandra source distribution first. It&#39;s pretty simple to use.=
=A0</div><div><br></div><div>Add clients until you see the latency increase=
 and tasks start to back up in nodetool tpstats. If you see it report dropp=
ed messages it is over loaded.</div>

<div><br></div><div>Hope that helps.</div><div><br><div>
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norm=
al;border-collapse:separate;text-transform:none;font-size:medium;white-spac=
e:normal;font-family:Helvetica;word-spacing:0px"><span style=3D"text-indent=
:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weigh=
t:normal;line-height:normal;border-collapse:separate;text-transform:none;fo=
nt-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><=
div style=3D"word-wrap:break-word">

<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">

<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">

<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance Deve=
loper</div><div>@aaronmorton</div><div><a href=3D"http://www.thelastpickle.=
com" target=3D"_blank">http://www.thelastpickle.com</a></div></div></div></=
span></div>

</span></div></span></span>
</div><div><div>

<br><div><div>On 18/07/2012, at 4:48 AM, Code Box wrote:</div><br><blockquo=
te type=3D"cite">Thanks a lot for your reply guys. I was trying fsyn =3D ba=
tch and window =3D0ms to see if the disk utilization is happening full on m=
y drive. I checked the =A0numbers using iostat the numbers were around 60% =
and the CPU usage was also not too high.=A0<div>


<br></div><div>Configuration of my Setup :-</div><div><br><div>I have three=
=A0<span style=3D"line-height:18px;text-align:left;font-size:12px;font-fami=
ly:verdana,arial,helvetica,clean,sans-serif">m1.xlarge=A0</span>hosts each =
having 15 GB RAM and 4 CPU. It has <span style=3D"line-height:18px;text-ali=
gn:left;font-size:12px;font-family:verdana,arial,helvetica,clean,sans-serif=
">8 EC2 Compute Units.</span></div>


<div><span style=3D"line-height:18px;text-align:left;font-size:12px;font-fa=
mily:verdana,arial,helvetica,clean,sans-serif">I have kept the replication =
factor equal to 3. The typical write size is 1 KB.=A0</span></div>
<div><span style=3D"line-height:18px;text-align:left;font-size:12px;font-fa=
mily:verdana,arial,helvetica,clean,sans-serif"><br></span></div><div><span =
style=3D"line-height:18px;text-align:left;font-size:12px;font-family:verdan=
a,arial,helvetica,clean,sans-serif">I tried adding different nodes each wit=
h 200 threads and the throughput got split into two. If i do it from a sing=
le host with FSync Set to Periodic and Window Size equal to 1000ms and usin=
g two nodes i am getting these numbers :-</span></div>


<div><br></div><div><span style=3D"line-height:18px;text-align:left;font-si=
ze:12px;font-family:verdana,arial,helvetica,clean,sans-serif"><br></span></=
div><div><span style=3D"line-height:18px;text-align:left;font-size:12px;fon=
t-family:verdana,arial,helvetica,clean,sans-serif"><div>


[OVERALL], Throughput(ops/sec), 4771</div><div>[INSERT], AverageLatency(us)=
, 18747</div><div>[INSERT], MinLatency(us), 1470</div><div>[INSERT], MaxLat=
ency(us), 446413</div><div>[INSERT], 95thPercentileLatency(ms), 55</div>


<div>[INSERT], 99thPercentileLatency(ms), 167</div><div><br></div><div>[OVE=
RALL], Throughput(ops/sec), 4678</div><div><div>[INSERT], AverageLatency(us=
), 22015</div><div>[INSERT], MinLatency(us), 1439</div><div>[INSERT], MaxLa=
tency(us), 466149</div>


<div>[INSERT], 95thPercentileLatency(ms), 62</div><div>[INSERT], 99thPercen=
tileLatency(ms), 171</div></div><div><br></div><div>Is there something i am=
 doing wrong in cassandra Setup ?? What is the bet Setup for Cassandra to g=
et high throughput and good write latency numbers ?</div>


<div><br></div></span></div><div><br></div><div><div><br><div>On Tue, Jul 1=
7, 2012 at 7:02 AM, Sylvain Lebresne <span dir=3D"ltr">&lt;<a>sylvain@datas=
tax.com</a>&gt;</span></div></div></div></div></blockquote></div></div></di=
v>
</div></div></blockquote></div></div></div>
</blockquote></div></div>

--e89a8fb2085c7c7e8404c518fb02--