Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <CA+4UHyP1NQG3cOagc1a1aKgTOTCbwz0nG6_4Xr9dJx2DOdnQkg@mail.gmail.com>
References: 
 <CA+4UHyPL_YxU1TA8uWuE=8zRH5EAMmfXw1zX_t4Qb7EfMkBL=A@mail.gmail.com>
 <47C33AA2-0DA3-4364-9A2C-DCF07951C67C@crowdstrike.com>
 <CA+4UHyOdn8a_WqLtPDN2oLXCLBS8j2xYDRwyBGGSOSCS9YTAQQ@mail.gmail.com>
 <CACtG2e1siwjrPWaEg-1EAB-GU-oa_60tTh7niKoM_iXjLN-gkA@mail.gmail.com>
 <CACCmCN_GTXSKeE4G06iH+yNSFS2LBuKiOQsWcorAcMACtYS+BQ@mail.gmail.com>
 <CACtG2e3Sa_fFeTvgvEYExNLt4zwC3KUx-TqrygZ4Qs-xt0UX+g@mail.gmail.com>
 <CA+4UHyP1NQG3cOagc1a1aKgTOTCbwz0nG6_4Xr9dJx2DOdnQkg@mail.gmail.com>
From: Sebastian Estevez <sebastian.estevez@datastax.com>
Date: Thu, 21 Jan 2016 10:11:49 -0500
Message-ID: 
 <CACCmCN-zHCE3LZ=FWA0qm7oe3B-ipVvEoW6YSoG9hBjB2FySag@mail.gmail.com>
Subject: Re: compaction throughput
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=089e0115fc28c5d35c0529d986d4

--089e0115fc28c5d35c0529d986d4
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

@penguin There have been steady improvements in the different compaction
strategies recently but not major re-writes.

All the best,


[image: datastax_logo.png] <http://www.datastax.com/>

Sebasti=C3=A1n Est=C3=A9vez

Solutions Architect | 954 905 8615 | sebastian.estevez@datastax.com

[image: linkedin.png] <https://www.linkedin.com/company/datastax> [image:
facebook.png] <https://www.facebook.com/datastax> [image: twitter.png]
<https://twitter.com/datastax> [image: g+.png]
<https://plus.google.com/+Datastax/about>
<http://feeds.feedburner.com/datastax>
<http://goog_410786983>


<http://www.datastax.com/gartner-magic-quadrant-odbms>

DataStax is the fastest, most scalable distributed database technology,
delivering Apache Cassandra to the world=E2=80=99s most innovative enterpri=
ses.
Datastax is built to be agile, always-on, and predictably scalable to any
size. With more than 500 customers in 45 countries, DataStax is the
database technology and transactional backbone of choice for the worlds
most innovative companies such as Netflix, Adobe, Intuit, and eBay.

On Thu, Jan 21, 2016 at 9:12 AM, Kai Wang <depend@gmail.com> wrote:

> I am using 2.2.4 and have seen multiple compactors running on the same
> table. The number of compactors seems to be controlled by
> concurrent_compactors. As of type of compactions, I've seen normal
> compaction, tombstone compaction. Validation and Anticompaction seem to
> always be single threaded.
>
> On Thu, Jan 21, 2016 at 8:28 AM, PenguinWhispererThe . <
> th3penguinwhisperer@gmail.com> wrote:
>
>> Thanks for that clarification Sebastian! That's really good to know! I
>> never took increasing this value in consideration because of my previous
>> experience.
>>
>> In my case I had a table that was compacting over and over... and only
>> one CPU was used. So that made me believe it was not multithreaded (I
>> actually believe I asked this on IRC however it's been a few months ago =
so
>> I might be wrong).
>>
>> Have there been behavioral changes on this lately? (I was using 2.0.9 or
>> 2.0.11 I believe).
>>
>> 2016-01-21 14:15 GMT+01:00 Sebastian Estevez <
>> sebastian.estevez@datastax.com>:
>>
>>> >So compaction of one table will NOT spread over different cores.
>>>
>>> This is not exactly true. You actually can have multiple compactions
>>> running at the same time on the same table, it just doesn't happen all =
that
>>> often. You essentially would have to have two sets of sstables that are
>>> both eligible for compactions at the same time.
>>>
>>> all the best,
>>>
>>> Sebasti=C3=A1n
>>> On Jan 21, 2016 7:41 AM, "PenguinWhispererThe ." <
>>> th3penguinwhisperer@gmail.com> wrote:
>>>
>>>> After having some issues myself with compaction I think it's noteworth=
y
>>>> to explicitly state that compaction of a table can only run on one CPU=
. So
>>>> compaction of one table will NOT spread over different cores.
>>>> To really have use of concurrent_compactors you need to have multiple
>>>> table compactions initiated at the same time. If those are small they'=
ll
>>>> finish way earlier resulting in only one core using 100% as compaction=
 is
>>>> generally CPU bound (unless your disks can't keep up).
>>>> I believe it's better to be CPU(core) bound on one core(or at least no=
t
>>>> all) for compaction than disk IO bound as this would result in writes =
and
>>>> reads, ... having performance impact.
>>>> Compaction is a maintenance task so it shouldn't be eating all your
>>>> resources.
>>>>
>>>>
>>>> <https://www.avast.com/sig-email?utm_medium=3Demail&utm_source=3Dlink&=
utm_campaign=3Dsig-email&utm_content=3Dwebmail> This
>>>> email has been sent from a virus-free computer protected by Avast.
>>>> www.avast.com
>>>> <https://www.avast.com/sig-email?utm_medium=3Demail&utm_source=3Dlink&=
utm_campaign=3Dsig-email&utm_content=3Dwebmail>
>>>> <#-1919795192_-2069969251_1162782367_-1582318301_DDB4FAA8-2DD7-40BB-A1=
B8-4E2AA1F9FDF2>
>>>>
>>>> 2016-01-16 0:18 GMT+01:00 Kai Wang <depend@gmail.com>:
>>>>
>>>>> Jeff & Sebastian,
>>>>>
>>>>> Thanks for the reply. There are 12 cores but in my case C* only uses
>>>>> one core most of the time. *nodetool compactionstats* shows there's
>>>>> only one compactor running. I can see C* process only uses one core. =
So I
>>>>> guess I should've asked the question more clearly:
>>>>>
>>>>> 1. Is ~25 M/s a reasonable compaction throughput for one core?
>>>>> 2. Is there any configuration that affects single core compaction
>>>>> throughput?
>>>>> 3. Is concurrent_compactors the only option to parallelize compaction=
?
>>>>> If so, I guess it's the compaction strategy itself that decides when =
to
>>>>> parallelize and when to block on one core. Then there's not much we c=
an do
>>>>> here.
>>>>>
>>>>> Thanks.
>>>>>
>>>>> On Fri, Jan 15, 2016 at 5:23 PM, Jeff Jirsa <
>>>>> jeff.jirsa@crowdstrike.com> wrote:
>>>>>
>>>>>> With SSDs, the typical recommendation is up to 0.8-1 compactor per
>>>>>> core (depending on other load).  How many CPU cores do you have?
>>>>>>
>>>>>>
>>>>>> From: Kai Wang
>>>>>> Reply-To: "user@cassandra.apache.org"
>>>>>> Date: Friday, January 15, 2016 at 12:53 PM
>>>>>> To: "user@cassandra.apache.org"
>>>>>> Subject: compaction throughput
>>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> I am trying to figure out the bottleneck of compaction on my node.
>>>>>> The node is CentOS 7 and has SSDs installed. The table is configured=
 to use
>>>>>> LCS. Here is my compaction related configs in cassandra.yaml:
>>>>>>
>>>>>> compaction_throughput_mb_per_sec: 160
>>>>>> concurrent_compactors: 4
>>>>>>
>>>>>> I insert about 10G of data and start observing compaction.
>>>>>>
>>>>>> *nodetool compaction* shows most of time there is one compaction.
>>>>>> Sometimes there are 3-4 (I suppose this is controlled by
>>>>>> concurrent_compactors). During the compaction, I see one CPU core is=
 100%.
>>>>>> At that point, disk IO is about 20-25 M/s write which is much lower =
than
>>>>>> the disk is capable of. Even when there are 4 compactions running, I=
 see
>>>>>> CPU go to +400% but disk IO is still at 20-25M/s write. I use *nodet=
ool
>>>>>> setcompactionthroughput 0* to disable the compaction throttle but
>>>>>> don't see any difference.
>>>>>>
>>>>>> Does this mean compaction is CPU bound? If so 20M/s is kinda low. Is
>>>>>> there anyway to improve the throughput?
>>>>>>
>>>>>> Thanks.
>>>>>>
>>>>>
>>>>>
>>>>
>>
>

--089e0115fc28c5d35c0529d986d4
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">@penguin There have been steady improvements in the differ=
ent compaction strategies recently but not major re-writes.</div><div class=
=3D"gmail_extra"><br clear=3D"all"><div><div class=3D"gmail_signature"><div=
 dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"l=
tr"><span><p style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt">Al=
l the best,</p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;marg=
in-bottom:0pt"><br></p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:=
0pt;margin-bottom:0pt"><span style=3D"text-decoration:underline;font-size:1=
2px;font-family:Arial;color:rgb(17,85,204);vertical-align:baseline;white-sp=
ace:pre-wrap"><a href=3D"http://www.datastax.com/" style=3D"text-decoration=
:none" target=3D"_blank"><img src=3D"https://lh3.googleusercontent.com/pVhG=
eSH7Pht91xjoj4-LSCmsBLEJnOtq9c52j-z5RQD-I_vqkjnlxxkkHZZPQYi-2xgAropKv0UMqXx=
u24XYSkSXg-WQ82UB2nZ2DHu9yusG97HKdKzgJcRg55Lxinrnkw" width=3D"187px;" heigh=
t=3D"39px;" style=3D"border:none" alt=3D"datastax_logo.png"></a></span></p>=
<p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt">=
<span style=3D"font-size:15px;font-family:Calibri;color:rgb(0,0,0);vertical=
-align:baseline;white-space:pre-wrap;background-color:transparent">Sebasti=
=C3=A1n Est=C3=A9vez</span></p><p dir=3D"ltr" style=3D"line-height:1.15;mar=
gin-top:0pt;margin-bottom:0pt"><span style=3D"font-size:15px;font-family:Ca=
libri;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;backgro=
und-color:transparent">Solutions Architect |</span><span style=3D"font-size=
:15px;font-family:Calibri;color:rgb(0,0,0);font-weight:bold;vertical-align:=
baseline;white-space:pre-wrap;background-color:transparent"> </span><span s=
tyle=3D"font-size:15px;font-family:Calibri;color:rgb(0,0,0);vertical-align:=
baseline;white-space:pre-wrap;background-color:transparent">954 905 8615 | =
<a href=3D"mailto:sebastian.estevez@datastax.com" target=3D"_blank">sebasti=
an.estevez@datastax.com</a></span></p><p dir=3D"ltr" style=3D"line-height:1=
.15;margin-top:0pt;margin-bottom:0pt"><a href=3D"https://www.linkedin.com/c=
ompany/datastax" style=3D"color:rgb(17,85,204);font-size:12.8000001907349px=
;line-height:11.7760000228882px;text-decoration:none" target=3D"_blank"><sp=
an style=3D"font-size:15px;font-family:Calibri;text-decoration:underline;ve=
rtical-align:baseline;white-space:pre-wrap;background-color:transparent"><i=
mg src=3D"https://lh3.googleusercontent.com/mtwNeSEAXaqeWwFu3bQmYfrSh4u1-Rk=
lZGXi_qeKa_xk1aGiVTDY4D8dFBMmJDRTR8G5E3C1rQhSsvh5-qsgxDJn0EnyB7QA4ymlNcjE-a=
Z2Bs5j4Azw6SAzFeGhlouE9w" width=3D"27px;" height=3D"27px;" alt=3D"linkedin.=
png" style=3D"border:none"></span></a><span style=3D"font-size:15px;font-fa=
mily:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap;ba=
ckground-color:transparent"> </span><a href=3D"https://www.facebook.com/dat=
astax" style=3D"color:rgb(17,85,204);font-size:12.8000001907349px;line-heig=
ht:11.7760000228882px;text-decoration:none" target=3D"_blank"><span style=
=3D"font-size:15px;font-family:Calibri;text-decoration:underline;vertical-a=
lign:baseline;white-space:pre-wrap;background-color:transparent"><img src=
=3D"https://lh4.googleusercontent.com/y-b0_-GHQA7rCMcAmCgyhwfRCnHWNV996qbYd=
UM4zrr5rWr7drbMmqcaE6cLuv_QplGMp30Z5IMZ1N3ScZqwbNcL91BXnUiXgjeuO1XcEq9v45v6=
5i0svNGk1srzYkmMlQ" width=3D"27px;" height=3D"27px;" alt=3D"facebook.png" s=
tyle=3D"border:none"></span></a><span style=3D"font-size:15px;font-family:C=
alibri;color:rgb(102,102,102);vertical-align:baseline;white-space:pre-wrap;=
background-color:transparent"> </span><a href=3D"https://twitter.com/datast=
ax" style=3D"color:rgb(17,85,204);font-size:12.8000001907349px;line-height:=
11.7760000228882px;text-decoration:none" target=3D"_blank"><span style=3D"f=
ont-size:15px;font-family:Calibri;text-decoration:underline;vertical-align:=
baseline;white-space:pre-wrap;background-color:transparent"><img src=3D"htt=
ps://lh4.googleusercontent.com/ZdAbTYu8I6ebOtAT1Umh9JHlULX4st8OlFMMNZr_YoF4=
C_94k_vziIHnUs1I9csY57-RUoeQyBPjbkGg3RTwM9QBVSh_aojEjvg3iyxZRHxvyPyXs_wScfy=
z3x3R8BVlMQ" width=3D"27px;" height=3D"27px;" alt=3D"twitter.png" style=3D"=
border:none"></span></a><span style=3D"font-size:15px;font-family:Calibri;c=
olor:rgb(102,102,102);vertical-align:baseline;white-space:pre-wrap;backgrou=
nd-color:transparent"> </span><a href=3D"https://plus.google.com/+Datastax/=
about" style=3D"color:rgb(17,85,204);font-size:12.8000001907349px;line-heig=
ht:11.7760000228882px;text-decoration:none" target=3D"_blank"><span style=
=3D"font-size:15px;font-family:Calibri;text-decoration:underline;vertical-a=
lign:baseline;white-space:pre-wrap;background-color:transparent"><img src=
=3D"https://lh6.googleusercontent.com/gcFd7WMLL8mrrumsfosMiEjhDw29KePMjKcs-=
2BKezcUcvnuNWeqgZiig9OMStR6yt3e1KqZrJ_KDHnsgq_cTpjfjniP_ZzgT1ISGs1Dr7S2hGgf=
Dbw9f7npg_F3IvxCNw" width=3D"27px;" height=3D"27px;" alt=3D"g+.png" style=
=3D"border:none"></span></a><span style=3D"font-size:15px;font-family:Calib=
ri;color:rgb(102,102,102);vertical-align:baseline;white-space:pre-wrap;back=
ground-color:transparent"> </span><span style=3D"color:rgb(17,85,204);font-=
size:15px;line-height:11.776px;text-decoration:underline;font-family:Calibr=
i;vertical-align:baseline;white-space:pre-wrap;background-color:transparent=
"><a href=3D"http://feeds.feedburner.com/datastax" style=3D"color:rgb(17,85=
,204);font-size:12.8000001907349px;line-height:11.7760000228882px;text-deco=
ration:none" target=3D"_blank"><img src=3D"https://lh6.googleusercontent.co=
m/24_538J0j5M0NHQx-jkRiV_IHrhsh-98hpi--Qz9b0-I4llvWuYI6LgiVJsul0AhxL0gMTOHg=
w3G0SvIXaT2C7fsKKa_DdQ2uOJ-bQ6h_mQ7k7iMybcR1dr1VhWgLMxcmg" width=3D"27px;" =
height=3D"27px;" style=3D"border:none"></a></span><a href=3D"http://goog_41=
0786983" target=3D"_blank"><br></a></p><p dir=3D"ltr" style=3D"line-height:=
1.15;margin-top:0pt;margin-bottom:0pt"><br></p><p dir=3D"ltr" style=3D"line=
-height:1.15;margin-top:0pt;margin-bottom:0pt"><a href=3D"http://www.datast=
ax.com/gartner-magic-quadrant-odbms" target=3D"_blank"><img src=3D"http://l=
earn.datastax.com/rs/059-YLZ-577/images/Gartner_728x90_Sig4.png" alt=3D""><=
/a></p></span></div><div dir=3D"ltr"><br></div><div dir=3D"ltr">Da<span sty=
le=3D"font-size:12.8000001907349px"><span style=3D"font-size:12px;font-fami=
ly:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap">taS=
tax is the </span></span><span style=3D"font-size:12.8000001907349px"><span=
 style=3D"font-size:12px;font-family:Arial;color:rgb(0,0,0);vertical-align:=
baseline;white-space:pre-wrap">fastest, mo</span></span><span style=3D"font=
-size:12px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white=
-space:pre-wrap">st scalable distributed database technology, delivering Ap=
ache Cassandra to the world=E2=80=99s most innovative enterprises. Datastax=
 is built to be agile, always-on, and predictably scalable to any size. Wit=
h more than 500 customers in 45 countries, </span><span style=3D"font-size:=
12px;font-family:Arial;vertical-align:baseline;white-space:pre-wrap">DataSt=
ax is the database technology and transactional backbone of choice for the =
worlds most innovative companies such as Netflix, Adobe, Intuit, and eBay.<=
/span><span style=3D"font-size:12px;font-family:Arial;color:rgb(0,0,0);vert=
ical-align:baseline;white-space:pre-wrap"> </span></div></div></div></div><=
/div></div></div></div></div>
<br><div class=3D"gmail_quote">On Thu, Jan 21, 2016 at 9:12 AM, Kai Wang <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:depend@gmail.com" target=3D"_blank">d=
epend@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" =
style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><di=
v dir=3D"ltr">I am using 2.2.4 and have seen multiple compactors running on=
 the same table. The number of compactors seems to be controlled by concurr=
ent_compactors. As of type of compactions, I&#39;ve seen normal compaction,=
 tombstone compaction. Validation and Anticompaction seem to always be sing=
le threaded.<br></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D=
"gmail_extra"><br><div class=3D"gmail_quote">On Thu, Jan 21, 2016 at 8:28 A=
M, PenguinWhispererThe . <span dir=3D"ltr">&lt;<a href=3D"mailto:th3penguin=
whisperer@gmail.com" target=3D"_blank">th3penguinwhisperer@gmail.com</a>&gt=
;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><di=
v>Thanks for that clarification Sebastian! That&#39;s really good to know! =
I never took increasing this value in consideration because of my previous =
experience.<br><br></div>In my case I had a table that was compacting over =
and over... and only one CPU was used. So that made me believe it was not m=
ultithreaded (I actually believe I asked this on IRC however it&#39;s been =
a few months ago so I might be wrong).<br><br></div>Have there been behavio=
ral changes on this lately? (I was using 2.0.9 or 2.0.11 I believe).<br></d=
iv><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">2016=
-01-21 14:15 GMT+01:00 Sebastian Estevez <span dir=3D"ltr">&lt;<a href=3D"m=
ailto:sebastian.estevez@datastax.com" target=3D"_blank">sebastian.estevez@d=
atastax.com</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" style=3D"m=
argin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span><p dir=
=3D"ltr">&gt;So compaction of one table will NOT spread over different core=
s.</p>
</span><p dir=3D"ltr">This is not exactly true. You actually can have multi=
ple compactions running at the same time on the same table, it just doesn&#=
39;t happen all that often. You essentially would have to have two sets of =
sstables that are both eligible for compactions at the same time.<br></p>
<p dir=3D"ltr">all the best,</p>
<p dir=3D"ltr">Sebasti=C3=A1n</p><div><div>
<div class=3D"gmail_quote">On Jan 21, 2016 7:41 AM, &quot;PenguinWhispererT=
he .&quot; &lt;<a href=3D"mailto:th3penguinwhisperer@gmail.com" target=3D"_=
blank">th3penguinwhisperer@gmail.com</a>&gt; wrote:<br type=3D"attribution"=
><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1=
px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>After having some iss=
ues myself with compaction I think it&#39;s noteworthy to explicitly state =
that compaction of a table can only run on one CPU. So compaction of one ta=
ble will NOT spread over different cores.<br></div><div>To really have use =
of concurrent_compactors you need to have multiple table compactions initia=
ted at the same time. If those are small they&#39;ll finish way earlier res=
ulting in only one core using 100% as compaction is generally CPU bound (un=
less your disks can&#39;t keep up).<br></div><div>I believe it&#39;s better=
 to be CPU(core) bound on one core(or at least not all) for compaction than=
 disk IO bound as this would result in writes and reads, ... having perform=
ance impact.<br></div><div>Compaction is a maintenance task so it shouldn&#=
39;t be eating all your resources.<br></div><div><br></div></div><div><tabl=
e style=3D"border-top:1px solid #aaabb6;margin-top:30px">
	<tbody><tr>
		<td style=3D"width:105px;padding-top:15px">
			<a href=3D"https://www.avast.com/sig-email?utm_medium=3Demail&amp;utm_so=
urce=3Dlink&amp;utm_campaign=3Dsig-email&amp;utm_content=3Dwebmail" target=
=3D"_blank"><img src=3D"https://ipmcdn.avast.com/images/logo-avast-v1.png" =
style=3D"width:90px;min-height:33px"></a>
		</td>
		<td style=3D"width:470px;padding-top:20px;color:#41424e;font-size:13px;fo=
nt-family:Arial,Helvetica,sans-serif;line-height:18px">This email has been =
sent from a virus-free computer protected by Avast. <br><a href=3D"https://=
www.avast.com/sig-email?utm_medium=3Demail&amp;utm_source=3Dlink&amp;utm_ca=
mpaign=3Dsig-email&amp;utm_content=3Dwebmail" style=3D"color:#4453ea" targe=
t=3D"_blank">www.avast.com</a>
		</td>
	</tr>
</tbody></table><a href=3D"#-1919795192_-2069969251_1162782367_-1582318301_=
DDB4FAA8-2DD7-40BB-A1B8-4E2AA1F9FDF2" width=3D"1" height=3D"1"></a></div><d=
iv class=3D"gmail_extra"><br><div class=3D"gmail_quote">2016-01-16 0:18 GMT=
+01:00 Kai Wang <span dir=3D"ltr">&lt;<a href=3D"mailto:depend@gmail.com" t=
arget=3D"_blank">depend@gmail.com</a>&gt;</span>:<br><blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex"><div dir=3D"ltr"><div><div><div><div><div>Jeff &amp; Sebastian,<br=
><br></div><div>Thanks for the reply. There are 12 cores but in my case C* =
only uses one core most of the time. <b>nodetool compactionstats</b> shows =
there&#39;s only one compactor running. I can see C* process only uses one =
core. So I guess I should&#39;ve asked the question more clearly:<br></div>=
<br></div>1. Is ~25 M/s a reasonable compaction throughput for one core?<br=
></div>2. Is there any configuration that affects single core compaction th=
roughput?<br></div>3. Is concurrent_compactors the only option to paralleli=
ze compaction? If so, I guess it&#39;s the compaction strategy itself that =
decides when to parallelize and when to block on one core. Then there&#39;s=
 not much we can do here.<br><br></div>Thanks.<br></div><div><div><div clas=
s=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, Jan 15, 2016 at 5:=
23 PM, Jeff Jirsa <span dir=3D"ltr">&lt;<a href=3D"mailto:jeff.jirsa@crowds=
trike.com" target=3D"_blank">jeff.jirsa@crowdstrike.com</a>&gt;</span> wrot=
e:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-l=
eft:1px #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word"><d=
iv><div><div><font face=3D"Calibri,sans-serif">With SSDs, the typical recom=
mendation is up to 0.8-1 compactor per core (depending on other load). =C2=
=A0</font><span style=3D"font-family:Calibri,sans-serif">How many CPU cores=
 do you have?</span></div></div><div><font face=3D"Calibri,sans-serif"><br>=
</font></div><div><br></div></div><span style=3D"color:rgb(0,0,0);font-fami=
ly:Calibri,sans-serif;font-size:14px"><div style=3D"font-family:Calibri;fon=
t-size:12pt;text-align:left;color:black;BORDER-BOTTOM:medium none;BORDER-LE=
FT:medium none;PADDING-BOTTOM:0in;PADDING-LEFT:0in;PADDING-RIGHT:0in;BORDER=
-TOP:#b5c4df 1pt solid;BORDER-RIGHT:medium none;PADDING-TOP:3pt"><span styl=
e=3D"font-weight:bold">From: </span> Kai Wang<br><span style=3D"font-weight=
:bold">Reply-To: </span> &quot;<a href=3D"mailto:user@cassandra.apache.org"=
 target=3D"_blank">user@cassandra.apache.org</a>&quot;<br><span style=3D"fo=
nt-weight:bold">Date: </span> Friday, January 15, 2016 at 12:53 PM<br><span=
 style=3D"font-weight:bold">To: </span> &quot;<a href=3D"mailto:user@cassan=
dra.apache.org" target=3D"_blank">user@cassandra.apache.org</a>&quot;<br><s=
pan style=3D"font-weight:bold">Subject: </span> compaction throughput<br></=
div><div><div><div><br></div><div><div><div dir=3D"ltr"><div><div><div>Hi,<=
br><br></div>
I am trying to figure out the bottleneck of compaction on my node. The node=
 is CentOS 7 and has SSDs installed. The table is configured to use LCS. He=
re is my compaction related configs in cassandra.yaml:<br><br>
compaction_throughput_mb_per_sec: 160<br>
concurrent_compactors: 4<br><br>
I insert about 10G of data and start observing compaction.<br><br></div><b>=
nodetool compaction</b> shows most of time there is one compaction. Sometim=
es there are 3-4 (I suppose this is controlled by concurrent_compactors). D=
uring the compaction, I see one CPU core is 100%. At that point, disk IO is=
 about 20-25 M/s write which
 is much lower than the disk is capable of. Even when there are 4 compactio=
ns running, I see CPU go to +400% but disk IO is still at 20-25M/s write. I=
 use
<b>nodetool setcompactionthroughput 0</b> to disable the compaction throttl=
e but don&#39;t see any difference.<br><br></div><div>Does this mean compac=
tion is CPU bound? If so 20M/s is kinda low. Is there anyway to improve the=
 throughput?<br><br></div><div>Thanks.<br></div></div></div></div></div></d=
iv></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--089e0115fc28c5d35c0529d986d4--