Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of jcistaro@netflix.com
 designates 69.53.237.162 as permitted sender)
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws;
  s=s2048;d=netflix.com;
  h=from:to:subject:date:message-id:in-reply-to:content-type:mime-version;
  b=FxU8PkMoKCn5F2Jme4pA5bdKqZpGQRjI144O5UFUalxu8ydQ0tC71aSIon++sIZ5EP2LmyPK
    zvxTQrX36X5rM5LpCeoEYoYlQ42T2n9s5EWaDuY1X7BWupGw0Lr9uJS5z5PG9iXOkxTJxy0U
    sEiXi1wgP92HNmgIpjPrR8YEbKg9qyHFSiR7tqA47kKjPDhvLeEP63Uoj6lgHad1Of5xpe1Q
    jKmiPfUYkw0182ZsN12xZFu6JRUFO+wYnOQGmtSq7nDail3alOCfXWHkZFVKU3dznwfehYkO
    CDLNZrmuF/qB4ymofe3ATH9b8IpmubyHkARQ1OqvlG5XMbsi2y7NEQ==
From: Jim Cistaro <jcistaro@netflix.com>
To: "user@cassandra.apache.org" <user@cassandra.apache.org>, Wei Zhu
	<wz1975@yahoo.com>
Subject: Re: Cassandra pending compaction tasks keeps increasing
Thread-Topic: Cassandra pending compaction tasks keeps increasing
Thread-Index: AQHN9bgExENXgQMIR02bJiPNrp6a5phQ/+2A
Date: Sat, 19 Jan 2013 18:49:18 +0000
Message-ID: <CD202A74.93C0%jcistaro@netflix.com>
In-Reply-To: <1358539841.10972.YahooMailNeo@web160904.mail.bf1.yahoo.com>
Accept-Language: en-US
Content-Language: en-US
user-agent: Microsoft-MacOutlook/14.2.5.121010
Content-Type: multipart/alternative;
	boundary="_000_CD202A7493C0jcistaronetflixcom_"
MIME-Version: 1.0

--_000_CD202A7493C0jcistaronetflixcom_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

1) In addition to iostat, dstat is a good tool to see wht kind of disck thr=
ouput your are getting.  That would be one thing to monitor.
2) For LCS, we also see pending compactions skyrocket.  During load, LCS wi=
ll create a lot of small sstables which will queue up for compaction.
3) For us the biggest concern is not how high the pending count gets, but h=
ow often it gets back down near zero.  If your load is something you can do=
 in segments or pause, then you can see how fast the cluster recovers on th=
e compactions.
4) One thing which we tune per cluster is the size of the files.  Increasin=
g this from 5MB can sometimes improve things.  But I forget if we have ever=
 changed this after starting data load.

Is your cluster receiving read traffic during this data migration? If so, I=
 would say that read latency is your best measure.  If the high number of S=
STables waiting to compact is not hurting your reads, then you are probably=
 ok.  Since you are on SSD, there is a good chance the compactions are not =
hurting you.  As for compactionthroughput, we set ours high for SSD.  You u=
sually wont use it all because the compactions are usually single threaded.=
  Dstat will help you measure this.

I hope this helps,
jc

From: Wei Zhu <wz1975@yahoo.com<mailto:wz1975@yahoo.com>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" <us=
er@cassandra.apache.org<mailto:user@cassandra.apache.org>>, Wei Zhu <wz1975=
@yahoo.com<mailto:wz1975@yahoo.com>>
Date: Friday, January 18, 2013 12:10 PM
To: Cassandr usergroup <user@cassandra.apache.org<mailto:user@cassandra.apa=
che.org>>
Subject: Cassandra pending compaction tasks keeps increasing

Hi,
When I run nodetool compactionstats

I see the number of pending tasks keep going up steadily.

I tried to increase the  compactionthroughput, by using

nodetool setcompactionthroughput

I even tried the extreme to set it to 0 to disable the throttling.

I checked iostats and we have SSD for data, the disk util is less than 5% w=
hich means it's not I/O bound, CPU is also less than 10%

We are using levelcompaction and in the process of migrating data. We have =
4500 writes per second and very few reads. We have about 70G data now and w=
ill grow to 150G when the migration finishes. We only have one CF and right=
 now the number of  SSTable is around 15000, write latency is still under 0=
.1ms.

Anything needs to be concerned? Or anything I can do to reduce the number o=
f pending compaction?

Thanks.
-Wei


--_000_CD202A7493C0jcistaronetflixcom_
Content-Type: text/html; charset="us-ascii"
Content-ID: <72043CD8C78445449AB5ADF00B70EBBF@netflix.com>
Content-Transfer-Encoding: quoted-printable

<html>
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
</head>
<body style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-lin=
e-break: after-white-space; color: rgb(0, 0, 0); font-size: 16px; font-fami=
ly: Calibri, sans-serif; ">
<div>
<div>1) In addition to iostat, dstat is a good tool to see wht kind of disc=
k throuput your are getting. &nbsp;That would be one thing to monitor.</div=
>
<div>2) For LCS, we also see pending compactions skyrocket. &nbsp;During lo=
ad, LCS will create a lot of small sstables which will queue up for compact=
ion.</div>
<div>3) For us the biggest concern is not how high the pending count gets, =
but how often it gets back down near zero. &nbsp;If your load is something =
you can do in segments or pause, then you can see how fast the cluster reco=
vers on the compactions.</div>
<div>4) One thing which we tune per cluster is the size of the files. &nbsp=
;Increasing this from 5MB can sometimes improve things. &nbsp;But I forget =
if we have ever changed this after starting data load.</div>
<div><br>
</div>
<div>Is your cluster receiving read traffic during this data migration? If =
so, I would say that read latency is your best measure. &nbsp;If the high n=
umber of SSTables waiting to compact is not hurting your reads, then you ar=
e probably ok. &nbsp;Since you are on SSD,
 there is a good chance the compactions are not hurting you. &nbsp;As for c=
ompactionthroughput, we set ours high for SSD. &nbsp;You usually wont use i=
t all because the compactions are usually single threaded. &nbsp;Dstat will=
 help you measure this.</div>
<div><br>
</div>
<div>I hope this helps,</div>
<div>jc</div>
</div>
<div><br>
</div>
<span id=3D"OLK_SRC_BODY_SECTION">
<div style=3D"font-family:Calibri; font-size:11pt; text-align:left; color:b=
lack; BORDER-BOTTOM: medium none; BORDER-LEFT: medium none; PADDING-BOTTOM:=
 0in; PADDING-LEFT: 0in; PADDING-RIGHT: 0in; BORDER-TOP: #b5c4df 1pt solid;=
 BORDER-RIGHT: medium none; PADDING-TOP: 3pt">
<span style=3D"font-weight:bold">From: </span>Wei Zhu &lt;<a href=3D"mailto=
:wz1975@yahoo.com">wz1975@yahoo.com</a>&gt;<br>
<span style=3D"font-weight:bold">Reply-To: </span>&quot;<a href=3D"mailto:u=
ser@cassandra.apache.org">user@cassandra.apache.org</a>&quot; &lt;<a href=
=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.org</a>&gt;, We=
i Zhu &lt;<a href=3D"mailto:wz1975@yahoo.com">wz1975@yahoo.com</a>&gt;<br>
<span style=3D"font-weight:bold">Date: </span>Friday, January 18, 2013 12:1=
0 PM<br>
<span style=3D"font-weight:bold">To: </span>Cassandr usergroup &lt;<a href=
=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.org</a>&gt;<br>
<span style=3D"font-weight:bold">Subject: </span>Cassandra pending compacti=
on tasks keeps increasing<br>
</div>
<div><br>
</div>
<div>
<div>
<div style=3D"color:#000; background-color:#fff; font-family:arial, helveti=
ca, sans-serif;font-size:10pt">
<div>Hi,</div>
<div>When I run&nbsp;nodetool compactionstats</div>
<div><br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
I see the number of pending tasks keep going up steadily.&nbsp;</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
I tried to increase the &nbsp;compactionthroughput, by using</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0);
 font-size: 13px; font-family: arial, helvetica, sans-serif; background-col=
or: transparent; font-style: normal; ">
nodetool&nbsp;setcompactionthroughput</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
I even tried the extreme to set it to 0 to disable the throttling.&nbsp;</d=
iv>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
I checked iostats and we have SSD for data, the disk util is less than 5% w=
hich means it's not I/O bound, CPU is also less than 10%</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
We are using levelcompaction and in the process of migrating data. We have =
4500 writes per second and very few reads. We have about 70G data now and w=
ill grow to 150G when the migration finishes. We only have one CF and right=
 now the number of &nbsp;SSTable is around&nbsp;15000,
 write latency is still under 0.1ms.&nbsp;</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
Anything needs to be concerned? Or anything I can do to reduce the number o=
f pending compaction?</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
Thanks.</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
-Wei</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
<div style=3D"color: rgb(0, 0, 0); font-size: 13px; font-family: arial, hel=
vetica, sans-serif; background-color: transparent; font-style: normal; ">
<br>
</div>
</div>
</div>
</div>
</span>
</body>
</html>

--_000_CD202A7493C0jcistaronetflixcom_--