Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=IYgX1dAo/f
	igy0UPo6+2Z0MZFxNVU4Gs1+KILr/+aJVwOFHARUJPgwlK7RK39H05VkP1tmkrwg
	2IdjCDygKyHmWy/a5748d5La1hRsEkqc+nevLPgGhws7BVQMnfYEgFJilO2lLE9y
	l7KufacqO2huw3roxm9FWqrDMm/x0Xil4=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1251.1)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_06EB121E-E524-4950-A54C-083BB113CA20"
Subject: Re: Writes slower then reads
Date: Fri, 6 Jan 2012 08:20:06 +1300
In-Reply-To: 
 <CAHwsXYnfUOW-TRcfejKbjeHVPv+n-yQsKWrCdQLxX-7n=0nj_w@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CADVHTB-qJRBjARgnbfxb=NZDPXdzvazMAGe76oOHH=rG6QHBKw@mail.gmail.com>
 <CAHwsXYmTwsDd4v4x1bSvQn65BKefQG=GcBiXu8f10JdMCSehSw@mail.gmail.com>
 <CADVHTB9_hGp2vT=XRO3TON8rc_dXv_bnge2cDhxtxzGR+9y_qw@mail.gmail.com>
 <CADVHTB90NpJ93xHmsndP1NXXoT-LvxuMWRR01Pkvrf-T9kC+zA@mail.gmail.com>
 <CAHwsXYnbbBiHr-Jucwayr5XSUr6aeHfuBheZAhmqT9j8ptb3mg@mail.gmail.com>
 <CADVHTB8fxOdxN5pL-6jEK8G5wEB7Aqm6T4AEhUoEdcF0sEyrJw@mail.gmail.com>
 <CAHwsXY=TdMxumvgVc7bcTUUxtyRFD6oZKAcU=DGEUkky_k96yw@mail.gmail.com>
 <CADVHTB939gkXD2CZ6inxfPoabovQZ9+UA33zBcDKRHzHzw+YJw@mail.gmail.com>
 <CAHwsXYnOhgjM6VOWvLLheWHvNE2Gbfy2yh8Ov1g=inqbO9URFw@mail.gmail.com>
 <CADVHTB9eKvgcg5Smbq9_iCHTY=WK0OPeqTv+U_Y6oQ3X4MP7zg@mail.gmail.com>
 <CAHwsXYnfUOW-TRcfejKbjeHVPv+n-yQsKWrCdQLxX-7n=0nj_w@mail.gmail.com>
Message-Id: <020FFBD7-FAEF-4053-8BDD-424E7DB574A2@thelastpickle.com>


--Apple-Mail=_06EB121E-E524-4950-A54C-083BB113CA20
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

What happens when you turn off the cron jobs ?=20

Cheers

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 6/01/2012, at 6:57 AM, Philippe wrote:

> Unless you are doing huge batches no... don't have any other idea for =
now...
>=20
> 2012/1/5 R. Verlangen <robin@us2.nl>
> The write and read load is very minimal the moment. Roughly 10 writes =
+ 10 reads / second. So 20 operations per second. Don't think that =
overloads my cluster, does it?
>=20
>=20
> 2012/1/5 Philippe <watcherfr@gmail.com>
> You may be overloading the cluster though...
>=20
> My hypothesis is that your traffic is being spread across your node =
and that one slow node is slowing down the fraction of traffic that goes =
to that node (when it's acting as coordinator).
> So what I would do is reduce the read load a lot to make sure I don't =
overload the cluster and measure if I see a 1/RF improvement in response =
time which would validate my hypothesis.
>=20
>=20
> 2012/1/5 R. Verlangen <robin@us2.nl>
>=20
> It does not appear to affect the response time, certainly not in a =
positive way.
>=20
>=20
> 2012/1/5 Philippe <watcherfr@gmail.com>
> What if you shutdown the cassandra service on the slow node, does that =
improve your read performance ?
> If it does then that sole node is responsible for the slow down =
because it can't act as a coordinator fast enough.
>=20
> 2012/1/5 R. Verlangen <robin@us2.nl>
>=20
> I'm also reading with CL =3D ONE
>=20
>=20
> 2012/1/5 Philippe <watcherfr@gmail.com>
> Depending on the CL you're reading at it will yes : if the CL requires =
that the "slow" node create a digest of the data and send it to the =
coordinator then it might explain the poor performance on reads. What is =
your read CL ?
>=20
> 2012/1/5 R. Verlangen <robin@us2.nl>
>=20
> As I posted this I noticed that the other node's CPU is running high =
on some other cronjobs (every couple of minutes to 60% usage). Is the =
lack of more CPU cycles a problem in this case?
>=20
> Robin
>=20
> 2012/1/5 R. Verlangen <robin@us2.nl>
>=20
> CPU is idle (< 10% usage). Disk reads occasionally blocks over 32/64K. =
Writes around 0-5MB per second. Network traffic 0.1 / 0.1 MB/s (in / =
out). Paging 0. System int ~ 1300, csw ~ 2500.
>=20
>=20
> 2012/1/5 Philippe <watcherfr@gmail.com>
> What can you see in vmstat/dstat ?
>=20
> Le 5 janv. 2012 11:58, "R. Verlangen" <robin@us2.nl> a =E9crit :
>=20
> Hi there,
>=20
> I'm running a cassandra 0.8.6 cluster with 2 nodes (in 2 DC's), RF =3D =
2. Actual data on the nodes is only 1GB. Disk latency < 1ms. Disk =
throughput ~ 0.4MB/s. OS load always below 1 (on a 8 core machine with =
16GB ram).=20
>=20
> When I'm running my writes against the cluster with cl =3D ONE all =
reads appear to be faster then the writes.=20
>=20
> Average write speed =3D 1600us/operation
> Average read speed =3D 200us/operation
>=20
> I'm really wondering why this is the case. Anyone got a clue?
>=20
> With kind regards,
> Robin=20
>=20
>=20
>=20
>=20
>=20
>=20
>=20
>=20
>=20


--Apple-Mail=_06EB121E-E524-4950-A54C-083BB113CA20
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">What =
happens when you turn off the cron jobs =
?&nbsp;<div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></div></span></span>
</div>

<br><div><div>On 6/01/2012, at 6:57 AM, Philippe wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">Unless you =
are doing huge batches no... don't have any other idea for =
now...<br><br><div class=3D"gmail_quote">2012/1/5 R. Verlangen <span =
dir=3D"ltr">&lt;<a =
href=3D"mailto:robin@us2.nl">robin@us2.nl</a>&gt;</span><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">
The write and read load is very minimal the moment. Roughly 10 writes + =
10 reads / second. So 20 operations per second. Don't think that =
overloads my cluster, does it?<div class=3D"HOEnZb"><div =
class=3D"h5"><br><br><div class=3D"gmail_quote">
2012/1/5 Philippe <span dir=3D"ltr">&lt;<a =
href=3D"mailto:watcherfr@gmail.com" =
target=3D"_blank">watcherfr@gmail.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">You may be overloading =
the cluster though...<div><br><div>My hypothesis is that your traffic is =
being spread across your node and that one slow node is slowing down the =
fraction of traffic that goes to that node (when it's acting as =
coordinator).</div>


<div>So what I would do is reduce the read load a lot to make sure I =
don't overload the cluster and measure if I see a 1/RF improvement in =
response time which would validate my =
hypothesis.</div><div><br></div><div><br>


<div class=3D"gmail_quote">2012/1/5 R. Verlangen <span dir=3D"ltr">&lt;<a =
href=3D"mailto:robin@us2.nl" =
target=3D"_blank">robin@us2.nl</a>&gt;</span><div><div><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


It does not appear to affect the response time, certainly not in a =
positive way.<div><div><br><br><div class=3D"gmail_quote">2012/1/5 =
Philippe <span dir=3D"ltr">&lt;<a href=3D"mailto:watcherfr@gmail.com" =
target=3D"_blank">watcherfr@gmail.com</a>&gt;</span><br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">What if you shutdown =
the cassandra service on the slow node, does that improve your read =
performance ?<div>If it does then that sole node is responsible for the =
slow down because it can't act as a coordinator fast enough.</div>


<div><br><div class=3D"gmail_quote">2012/1/5 R. Verlangen <span =
dir=3D"ltr">&lt;<a href=3D"mailto:robin@us2.nl" =
target=3D"_blank">robin@us2.nl</a>&gt;</span><div><div><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


I'm also reading with CL =3D ONE<div><div><br><br><div =
class=3D"gmail_quote">2012/1/5 Philippe <span dir=3D"ltr">&lt;<a =
href=3D"mailto:watcherfr@gmail.com" =
target=3D"_blank">watcherfr@gmail.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
Depending on the CL you're reading at it will yes : if the CL requires =
that the "slow" node create a digest of the data and send it to the =
coordinator then it might explain the poor performance on reads. What is =
your read CL ?<br>


<br><div class=3D"gmail_quote">2012/1/5 R. Verlangen <span =
dir=3D"ltr">&lt;<a href=3D"mailto:robin@us2.nl" =
target=3D"_blank">robin@us2.nl</a>&gt;</span><div><div><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


As I posted this I noticed that the other node's CPU is running high on =
some other cronjobs (every couple of minutes to 60% usage). Is the lack =
of more CPU cycles a problem in this =
case?<div><br></div><div>Robin<br><br>


<div class=3D"gmail_quote">2012/1/5 R. Verlangen <span dir=3D"ltr">&lt;<a =
href=3D"mailto:robin@us2.nl" =
target=3D"_blank">robin@us2.nl</a>&gt;</span><div><div><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


CPU is idle (&lt; 10% usage). Disk reads occasionally blocks over =
32/64K. Writes around 0-5MB per second. Network traffic 0.1 / 0.1 MB/s =
(in / out). Paging 0. System int ~ 1300, csw ~ 2500.<div><div>
<br><br><div class=3D"gmail_quote">
2012/1/5 Philippe <span dir=3D"ltr">&lt;<a =
href=3D"mailto:watcherfr@gmail.com" =
target=3D"_blank">watcherfr@gmail.com</a>&gt;</span><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex"><p>What can you see in vmstat/dstat ?</p>

<div class=3D"gmail_quote">Le 5 janv. 2012 11:58, "R. Verlangen" &lt;<a =
href=3D"mailto:robin@us2.nl" target=3D"_blank">robin@us2.nl</a>&gt; a =
=E9crit&nbsp;:<div><div><br type=3D"attribution"><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">


Hi there,<div><br></div><div>I'm running a cassandra 0.8.6 cluster with =
2 nodes (in 2 DC's), RF =3D 2. Actual data on the nodes is only 1GB. =
Disk latency &lt; 1ms. Disk throughput ~ 0.4MB/s. OS load always below 1 =
(on a 8 core machine with 16GB ram).&nbsp;</div>


<div><br></div><div>When I'm running my writes against the cluster with =
cl =3D ONE all reads appear to be faster then the =
writes.&nbsp;</div><div><br></div><div>Average write speed =3D =
1600us/operation</div><div>Average read speed =3D 200us/operation</div>


<div><br></div><div>I'm really wondering why this is the case. Anyone =
got a clue?</div><div><br></div><div>With kind =
regards,</div><div>Robin&nbsp;</div>
</blockquote></div></div></div>
</blockquote></div><br>
</div></div></blockquote></div></div></div><br></div>
</blockquote></div></div></div><br>
</blockquote></div><br>
</div></div></blockquote></div></div></div><br></div>
</blockquote></div><br>
</div></div></blockquote></div></div></div><br></div></div>
</blockquote></div><br>
</div></div></blockquote></div><br>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_06EB121E-E524-4950-A54C-083BB113CA20--