Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of rsvihla@datastax.com
 designates 209.85.213.47 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAC8=97eLTuwKADRBVZgrjDgqHWAp71c0AMpH+mvS3Grrg-VpNg@mail.gmail.com>
References: 
 <CAC8=97fugafn4Xhx3d1D1xwPVb_p4r7uWZygXrOKwxynZYphDQ@mail.gmail.com>
	<CAOP8aV88xNRsPk92jR-Hf9RY0E4S=02p=x=nCsYzdHs9juyLWA@mail.gmail.com>
	<CAC8=97d0p+7LYKkSyMhMMgwoWqUgoZ9NB_EVYwm-U2M0mV0LsA@mail.gmail.com>
	<CAEvoPJqgC2_d=nRp6Vbw1VxNL6YhHyX-S6sS-+JzFTmFQFCssg@mail.gmail.com>
	<CAC8=97dzjvTNQyQZ+f7DOMP4k349w0PvL6rKx5Bw4Htw6jC=kQ@mail.gmail.com>
	<CAC8=97cA4McWS0oFBx7dMddyhr7=+ij=X6ho7tg7eKiBFD7LZw@mail.gmail.com>
	<CAC8=97eLTuwKADRBVZgrjDgqHWAp71c0AMpH+mvS3Grrg-VpNg@mail.gmail.com>
Date: Tue, 16 Dec 2014 15:12:40 -0600
Message-ID: 
 <CAEvoPJpw4gUw+u0D1eQL47_GHvN0DvrBCQrMkekKrSpqkh51xQ@mail.gmail.com>
Subject: Re: 100% CPU utilization, ParNew and never completing compactions
From: Ryan Svihla <rsvihla@datastax.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a1138e9deb91879050a5bd144

--001a1138e9deb91879050a5bd144
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

So heap of that size without some tuning will create a number of problems
(high cpu usage one of them), I suggest either 8GB heap and 400mb parnew
(which I'd only set that low for that low cpu count) , or attempt the
tunings as indicated in https://issues.apache.org/jira/browse/CASSANDRA-815=
0

On Tue, Dec 16, 2014 at 3:06 PM, Arne Claassen <arne@emotient.com> wrote:
>
> Changed the 15GB node to 25GB heap and the nice CPU is down to ~20% now.
> Checked my dev cluster to see if the ParNew log entries are just par for
> the course, but not seeing them there. However, both have the following
> every 30 seconds:
>
> DEBUG [BatchlogTasks:1] 2014-12-16 21:00:44,898 BatchlogManager.java (lin=
e
> 165) Started replayAllFailedBatches
> DEBUG [MemtablePostFlusher:1] 2014-12-16 21:00:44,899
> ColumnFamilyStore.java (line 866) forceFlush requested but everything is
> clean in batchlog
> DEBUG [BatchlogTasks:1] 2014-12-16 21:00:44,899 BatchlogManager.java (lin=
e
> 200) Finished replayAllFailedBatches
>
> Is that just routine scheduled house-keeping or a sign of something else?
>
> On Tue, Dec 16, 2014 at 12:52 PM, Arne Claassen <arne@emotient.com> wrote=
:
>>
>> Sorry, I meant 15GB heap on the one machine that has less nice CPU% now.
>> The others are 6GB
>>
>> On Tue, Dec 16, 2014 at 12:50 PM, Arne Claassen <arne@emotient.com>
>> wrote:
>>>
>>> AWS r3.xlarge, 30GB, but only using a Heap of 10GB, new 2GB because we
>>> might go c3.2xlarge instead if CPU is more important than RAM
>>> Storage is optimized EBS SSD (but iostat shows no real IO going on)
>>> Each node only has about 10GB with ownership of 67%, 64.7% & 68.3%.
>>>
>>> The node on which I set the Heap to 10GB from 6GB the utlilization has
>>> dropped to 46%nice now, but the ParNew log messages still continue at t=
he
>>> same pace. I'm gonna up the HEAP to 20GB for a bit, see if that brings =
that
>>> nice CPU further down.
>>>
>>> No TombstoneOverflowingExceptions.
>>>
>>> On Tue, Dec 16, 2014 at 11:50 AM, Ryan Svihla <rsvihla@datastax.com>
>>> wrote:
>>>>
>>>> What's CPU, RAM, Storage layer, and data density per node? Exact heap
>>>> settings would be nice. In the logs look for TombstoneOverflowingExcep=
tion
>>>>
>>>>
>>>> On Tue, Dec 16, 2014 at 1:36 PM, Arne Claassen <arne@emotient.com>
>>>> wrote:
>>>>>
>>>>> I'm running 2.0.10.
>>>>>
>>>>> The data is all time series data and as we change our pipeline, we've
>>>>> been periodically been reprocessing the data sources, which causes ea=
ch
>>>>> time series to be overwritten, i.e. every row per partition key is de=
leted
>>>>> and re-written, so I assume i've been collecting a bunch of tombstone=
s.
>>>>>
>>>>> Also, the presence of the ever present and never completing compactio=
n
>>>>> types, i assumed were an artifact of tombstoning, but i fully admit t=
o
>>>>> conjecture based on about ~20 blog posts and stackoverflow questions =
i've
>>>>> surveyed.
>>>>>
>>>>> I doubled the Heap on one node and it changed nothing regarding the
>>>>> load or the ParNew log statements. New Generation Usage is 50%, Eden =
itself
>>>>> is 56%.
>>>>>
>>>>> Anything else i should look at and report, let me know.
>>>>>
>>>>> On Tue, Dec 16, 2014 at 11:14 AM, Jonathan Lacefield <
>>>>> jlacefield@datastax.com> wrote:
>>>>>>
>>>>>> Hello,
>>>>>>
>>>>>>   What version of Cassandra are you running?
>>>>>>
>>>>>>   If it's 2.0, we recently experienced something similar with 8447
>>>>>> [1], which 8485 [2] should hopefully resolve.
>>>>>>
>>>>>>   Please note that 8447 is not related to tombstones.  Tombstone
>>>>>> processing can put a lot of pressure on the heap as well. Why do you=
 think
>>>>>> you have a lot of tombstones in that one particular table?
>>>>>>
>>>>>>   [1] https://issues.apache.org/jira/browse/CASSANDRA-8447
>>>>>>   [2] https://issues.apache.org/jira/browse/CASSANDRA-8485
>>>>>>
>>>>>> Jonathan
>>>>>>
>>>>>> [image: datastax_logo.png]
>>>>>>
>>>>>> Jonathan Lacefield
>>>>>>
>>>>>> Solution Architect | (404) 822 3487 | jlacefield@datastax.com
>>>>>>
>>>>>> [image: linkedin.png] <http://www.linkedin.com/in/jlacefield/> [imag=
e:
>>>>>> facebook.png] <https://www.facebook.com/datastax> [image:
>>>>>> twitter.png] <https://twitter.com/datastax> [image: g+.png]
>>>>>> <https://plus.google.com/+Datastax/about>
>>>>>> <http://feeds.feedburner.com/datastax> <https://github.com/datastax/=
>
>>>>>>
>>>>>> On Tue, Dec 16, 2014 at 2:04 PM, Arne Claassen <arne@emotient.com>
>>>>>> wrote:
>>>>>>>
>>>>>>> I have a three node cluster that has been sitting at a load of 4
>>>>>>> (for each node), 100% CPI utilization (although 92% nice) for that =
last 12
>>>>>>> hours, ever since some significant writes finished. I'm trying to d=
etermine
>>>>>>> what tuning I should be doing to get it out of this state. The debu=
g log is
>>>>>>> just an endless series of:
>>>>>>>
>>>>>>> DEBUG [ScheduledTasks:1] 2014-12-16 19:03:35,042 GCInspector.java
>>>>>>> (line 118) GC for ParNew: 166 ms for 10 collections, 4400928736 use=
d; max
>>>>>>> is 8000634880
>>>>>>> DEBUG [ScheduledTasks:1] 2014-12-16 19:03:36,043 GCInspector.java
>>>>>>> (line 118) GC for ParNew: 165 ms for 10 collections, 4440011176 use=
d; max
>>>>>>> is 8000634880
>>>>>>> DEBUG [ScheduledTasks:1] 2014-12-16 19:03:37,043 GCInspector.java
>>>>>>> (line 118) GC for ParNew: 135 ms for 8 collections, 4402220568
>>>>>>> used; max is 8000634880
>>>>>>>
>>>>>>> iostat shows virtually no I/O.
>>>>>>>
>>>>>>> Compaction may enter into this, but i don't really know what to mak=
e
>>>>>>> of compaction stats since they never change:
>>>>>>>
>>>>>>> [root@cassandra-37919c3a ~]# nodetool compactionstats
>>>>>>> pending tasks: 10
>>>>>>>           compaction type        keyspace           table
>>>>>>> completed           total      unit  progress
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>> 271651482       563615497     bytes    48.20%
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>>  30308910     21676695677     bytes     0.14%
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>>  1198384080      1815603161     bytes    66.00%
>>>>>>> Active compaction remaining time :   0h22m24s
>>>>>>>
>>>>>>> 5 minutes later:
>>>>>>>
>>>>>>> [root@cassandra-37919c3a ~]# nodetool compactionstats
>>>>>>> pending tasks: 9
>>>>>>>           compaction type        keyspace           table
>>>>>>> completed           total      unit  progress
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>> 271651482       563615497     bytes    48.20%
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>>  30308910     21676695677     bytes     0.14%
>>>>>>>                Compaction           mediamedia_tracks_raw
>>>>>>>  1198384080      1815603161     bytes    66.00%
>>>>>>> Active compaction remaining time :   0h22m24s
>>>>>>>
>>>>>>> Sure the pending tasks went down by one, but the rest is identical.
>>>>>>> media_tracks_raw likely has a bunch of tombstones (can't figure out=
 how to
>>>>>>> get stats on that).
>>>>>>>
>>>>>>> Is this behavior something that indicates that i need more Heap,
>>>>>>> larger new generation? Should I be manually running compaction on t=
ables
>>>>>>> with lots of tombstones?
>>>>>>>
>>>>>>> Any suggestions or places to educate myself better on performance
>>>>>>> tuning would be appreciated.
>>>>>>>
>>>>>>> arne
>>>>>>>
>>>>>>
>>>>
>>>> --
>>>>
>>>> [image: datastax_logo.png] <http://www.datastax.com/>
>>>>
>>>> Ryan Svihla
>>>>
>>>> Solution Architect
>>>>
>>>> [image: twitter.png] <https://twitter.com/foundev> [image:
>>>> linkedin.png] <http://www.linkedin.com/pub/ryan-svihla/12/621/727/>
>>>>
>>>> DataStax is the fastest, most scalable distributed database technology=
,
>>>> delivering Apache Cassandra to the world=E2=80=99s most innovative ent=
erprises.
>>>> Datastax is built to be agile, always-on, and predictably scalable to =
any
>>>> size. With more than 500 customers in 45 countries, DataStax is the
>>>> database technology and transactional backbone of choice for the world=
s
>>>> most innovative companies such as Netflix, Adobe, Intuit, and eBay.
>>>>
>>>>

--=20

[image: datastax_logo.png] <http://www.datastax.com/>

Ryan Svihla

Solution Architect

[image: twitter.png] <https://twitter.com/foundev> [image: linkedin.png]
<http://www.linkedin.com/pub/ryan-svihla/12/621/727/>

DataStax is the fastest, most scalable distributed database technology,
delivering Apache Cassandra to the world=E2=80=99s most innovative enterpri=
ses.
Datastax is built to be agile, always-on, and predictably scalable to any
size. With more than 500 customers in 45 countries, DataStax is the
database technology and transactional backbone of choice for the worlds
most innovative companies such as Netflix, Adobe, Intuit, and eBay.

--001a1138e9deb91879050a5bd144
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>So heap of that size without some tuning will create =
a number of problems (high cpu usage one of them), I suggest either 8GB hea=
p and 400mb parnew (which I&#39;d only set that low for that low cpu count)=
 , or attempt the tunings as indicated in <a href=3D"https://issues.apache.=
org/jira/browse/CASSANDRA-8150">https://issues.apache.org/jira/browse/CASSA=
NDRA-8150</a><br></div></div><div class=3D"gmail_extra"><br><div class=3D"g=
mail_quote">On Tue, Dec 16, 2014 at 3:06 PM, Arne Claassen <span dir=3D"ltr=
">&lt;<a href=3D"mailto:arne@emotient.com" target=3D"_blank">arne@emotient.=
com</a>&gt;</span> wrote:<blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Ch=
anged the 15GB node to 25GB heap and the nice CPU is down to ~20% now. Chec=
ked my dev cluster to see if the ParNew log entries are just par for the co=
urse, but not seeing them there. However, both have the following every 30 =
seconds:<div><br></div><div><div>DEBUG [BatchlogTasks:1] 2014-12-16 21:00:4=
4,898 BatchlogManager.java (line 165) Started replayAllFailedBatches</div><=
div>DEBUG [MemtablePostFlusher:1] 2014-12-16 21:00:44,899 ColumnFamilyStore=
.java (line 866) forceFlush requested but everything is clean in batchlog</=
div><div>DEBUG [BatchlogTasks:1] 2014-12-16 21:00:44,899 BatchlogManager.ja=
va (line 200) Finished replayAllFailedBatches</div></div><div><br></div><di=
v>Is that just routine scheduled house-keeping or a sign of something else?=
</div></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_ext=
ra"><br><div class=3D"gmail_quote">On Tue, Dec 16, 2014 at 12:52 PM, Arne C=
laassen <span dir=3D"ltr">&lt;<a href=3D"mailto:arne@emotient.com" target=
=3D"_blank">arne@emotient.com</a>&gt;</span> wrote:<blockquote class=3D"gma=
il_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-lef=
t:1ex"><div dir=3D"ltr">Sorry, I meant 15GB heap on the one machine that ha=
s less nice CPU% now. The others are 6GB</div><div><div><div class=3D"gmail=
_extra"><br><div class=3D"gmail_quote">On Tue, Dec 16, 2014 at 12:50 PM, Ar=
ne Claassen <span dir=3D"ltr">&lt;<a href=3D"mailto:arne@emotient.com" targ=
et=3D"_blank">arne@emotient.com</a>&gt;</span> wrote:<blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex"><div dir=3D"ltr">AWS r3.xlarge, 30GB, but only using a Heap of 10G=
B, new 2GB because we might go c3.2xlarge instead if CPU is more important =
than RAM<div>Storage is optimized EBS SSD (but iostat shows no real IO goin=
g on)</div><div>Each node only has about 10GB with ownership of 67%, 64.7% =
&amp; 68.3%.</div><div><br></div><div>The node on which I set the Heap to 1=
0GB from 6GB the utlilization has dropped to 46%nice now, but the ParNew lo=
g messages still continue at the same pace. I&#39;m gonna up the HEAP to 20=
GB for a bit, see if that brings that nice CPU further down.</div><div><br>=
</div><div>No TombstoneOverflowingExceptions.</div></div><div><div><div cla=
ss=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Dec 16, 2014 at 1=
1:50 AM, Ryan Svihla <span dir=3D"ltr">&lt;<a href=3D"mailto:rsvihla@datast=
ax.com" target=3D"_blank">rsvihla@datastax.com</a>&gt;</span> wrote:<blockq=
uote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex"><div dir=3D"ltr">What&#39;s CPU, RAM, Storage layer=
, and data density per node? Exact heap settings would be nice. In the logs=
 look for TombstoneOverflowingException<br><br></div><div class=3D"gmail_ex=
tra"><div><div><br><div class=3D"gmail_quote">On Tue, Dec 16, 2014 at 1:36 =
PM, Arne Claassen <span dir=3D"ltr">&lt;<a href=3D"mailto:arne@emotient.com=
" target=3D"_blank">arne@emotient.com</a>&gt;</span> wrote:<blockquote clas=
s=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pad=
ding-left:1ex"><div dir=3D"ltr">I&#39;m running 2.0.10.<div><br></div><div>=
The data is all time series data and as we change our pipeline, we&#39;ve b=
een periodically been reprocessing the data sources, which causes each time=
 series to be overwritten, i.e. every row per partition key is deleted and =
re-written, so I assume i&#39;ve been collecting a bunch of tombstones.</di=
v><div><br></div><div>Also, the presence of the ever present and never comp=
leting compaction types, i assumed were an artifact of tombstoning, but i f=
ully admit to conjecture based on about ~20 blog posts and stackoverflow qu=
estions i&#39;ve surveyed.</div><div><br></div><div>I doubled the Heap on o=
ne node and it changed nothing regarding the load or the ParNew log stateme=
nts. New Generation Usage is 50%, Eden itself is 56%.</div><div><br></div><=
div>Anything else i should look at and report, let me know.</div></div><div=
><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Dec=
 16, 2014 at 11:14 AM, Jonathan Lacefield <span dir=3D"ltr">&lt;<a href=3D"=
mailto:jlacefield@datastax.com" target=3D"_blank">jlacefield@datastax.com</=
a>&gt;</span> wrote:<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0=
 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hello,<=
div><br></div><div>=C2=A0 What version of Cassandra are you running? =C2=A0=
</div><div><br></div><div>=C2=A0 If it&#39;s 2.0, we recently experienced s=
omething similar with 8447 [1], which=C2=A08485=C2=A0[2] should hopefully r=
esolve. =C2=A0</div><div><br></div><div>=C2=A0 Please note that 8447 is not=
 related to tombstones.=C2=A0 Tombstone processing can put a lot of pressur=
e on the heap as well. Why do you think you have a lot of tombstones in tha=
t one particular table?</div><div><br></div><div>=C2=A0 [1]=C2=A0<a href=3D=
"https://issues.apache.org/jira/browse/CASSANDRA-8447" target=3D"_blank">ht=
tps://issues.apache.org/jira/browse/CASSANDRA-8447</a></div><div>=C2=A0 [2]=
=C2=A0<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-8485" targ=
et=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-8485</a></div=
><div><br></div><div>Jonathan</div></div><div class=3D"gmail_extra"><br cle=
ar=3D"all"><div><div><div dir=3D"ltr"><p dir=3D"ltr" style=3D"line-height:1=
.15;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-size:15px;font-fa=
mily:Arial;color:#000000;background-color:transparent;font-weight:normal;fo=
nt-style:normal;font-variant:normal;text-decoration:none;vertical-align:bas=
eline"><img src=3D"https://lh3.googleusercontent.com/4QSB2u1tWL82tBCw-Ls9Rn=
eVd_eREpnFDymMq2KtGYKzuaHoRmOG4moI3c6rXCS_4Ivy_0huvlWvdQHYzElMtlI0HGpeiy--1=
VmPNe9pKjp_upcfNFvGmk8nQw8cOiD3LQ" style=3D"border:none" alt=3D"datastax_lo=
go.png" height=3D"39px;" width=3D"187px;"></span></p><p dir=3D"ltr" style=
=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><font><span style=3D=
"font-family:Calibri;color:rgb(0,0,0);background-color:transparent;font-wei=
ght:normal;font-style:normal;font-variant:normal;text-decoration:none;verti=
cal-align:baseline">Jonathan Lacefield</span></font></p><p dir=3D"ltr" styl=
e=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><font><span style=
=3D"font-family:Calibri;color:rgb(0,0,0);background-color:transparent;font-=
weight:normal;font-style:normal;font-variant:normal;text-decoration:none;ve=
rtical-align:baseline">Solution Architect |</span><span style=3D"font-famil=
y:Calibri;color:rgb(0,0,0);background-color:transparent;font-weight:bold;fo=
nt-style:normal;font-variant:normal;text-decoration:none;vertical-align:bas=
eline"> </span><span style=3D"font-family:Calibri;color:rgb(0,0,0);backgrou=
nd-color:transparent;font-weight:normal;font-style:normal;font-variant:norm=
al;text-decoration:none;vertical-align:baseline"><a href=3D"tel:%28404%29%2=
0822%203487" value=3D"+14048223487" target=3D"_blank">(404) 822 3487</a> | =
<a href=3D"mailto:jlacefield@datastax.com" target=3D"_blank">jlacefield@dat=
astax.com</a></span></font></p><br><a href=3D"http://www.linkedin.com/in/jl=
acefield/" style=3D"text-decoration:none" target=3D"_blank"><span style=3D"=
font-size:15px;font-family:Calibri;color:#1155cc;background-color:transpare=
nt;font-weight:normal;font-style:normal;font-variant:normal;text-decoration=
:underline;vertical-align:baseline"><img src=3D"https://lh4.googleuserconte=
nt.com/2OcDXDl1kNdDrrs_8GMyvoN-hEN6ypVST_RH-jUXzcgv34GDIH2uLU83YqzL930RfS0v=
t6HV19ACroP0umoJfR6Ik7RyfDqmyO6UZX0z7APniPeN4yCQtqD74P3O9KeJqg" style=3D"bo=
rder:none" alt=3D"linkedin.png" height=3D"27px;" width=3D"27px;"></span></a=
><a href=3D"https://www.facebook.com/datastax" style=3D"text-decoration:non=
e" target=3D"_blank"><span style=3D"font-size:15px;font-family:Arial;color:=
#000000;background-color:transparent;font-weight:normal;font-style:normal;f=
ont-variant:normal;text-decoration:none;vertical-align:baseline"> </span><s=
pan style=3D"font-size:15px;font-family:Arial;color:#000000;background-colo=
r:transparent;font-weight:normal;font-style:normal;font-variant:normal;text=
-decoration:none;vertical-align:baseline"><img src=3D"https://lh6.googleuse=
rcontent.com/SK2E_dMdZ-a0S0VtqkI870X1kzDeZufI4X0hzkvTSY3JNLxbMc_Fsv1cmMQLi6=
erxfZnU-TEHJCw4XFEEj8FiBuwMWFzX6mZrwEAK1deAaqa3-BJLpJRXmtdDVhbBwkLfQ" style=
=3D"border:none" alt=3D"facebook.png" height=3D"27px;" width=3D"27px;"></sp=
an></a><a href=3D"https://twitter.com/datastax" style=3D"text-decoration:no=
ne" target=3D"_blank"><span style=3D"font-size:15px;font-family:Calibri;col=
or:#666666;background-color:transparent;font-weight:normal;font-style:norma=
l;font-variant:normal;text-decoration:none;vertical-align:baseline"> </span=
><span style=3D"font-size:15px;font-family:Calibri;color:#666666;background=
-color:transparent;font-weight:normal;font-style:normal;font-variant:normal=
;text-decoration:none;vertical-align:baseline"><img src=3D"https://lh3.goog=
leusercontent.com/ImU5Y69Vxoc1aWxUPhCHeTr5S2C3l-45Rrd18LVB96T1wtOToLoQwFLLo=
qdJ64ddH9rK9ki51nYhn8K0WjVDnoYE37QWuCfDDS-UJ0vm6nuPTHrs9oqBDGToGxz24Qsc3A" =
style=3D"border:none" alt=3D"twitter.png" height=3D"27px;" width=3D"27px;">=
</span></a><a href=3D"https://plus.google.com/+Datastax/about" style=3D"tex=
t-decoration:none" target=3D"_blank"><span style=3D"font-size:15px;font-fam=
ily:Calibri;color:#666666;background-color:transparent;font-weight:normal;f=
ont-style:normal;font-variant:normal;text-decoration:none;vertical-align:ba=
seline"> </span><span style=3D"font-size:15px;font-family:Calibri;color:#66=
6666;background-color:transparent;font-weight:normal;font-style:normal;font=
-variant:normal;text-decoration:none;vertical-align:baseline"><img src=3D"h=
ttps://lh5.googleusercontent.com/AEt94GQQb6lDrJgtoYULEvlaL2A9PHc5TAZ_Qvik6_=
g6OfIdNvwDVN67sfVhIcb_JShb7KdK82n8-BeH26Ym9cXHJk6Tat-17Nm22441rOkMpjcU34dk5=
LikxkD7wa_G1A" style=3D"border:none" alt=3D"g+.png" height=3D"27px;" width=
=3D"27px;"></span></a><a href=3D"http://feeds.feedburner.com/datastax" styl=
e=3D"text-decoration:none" target=3D"_blank"><span style=3D"font-size:15px;=
font-family:Calibri;color:#666666;background-color:transparent;font-weight:=
normal;font-style:normal;font-variant:normal;text-decoration:none;vertical-=
align:baseline"> </span><span style=3D"font-size:15px;font-family:Calibri;c=
olor:#666666;background-color:transparent;font-weight:normal;font-style:nor=
mal;font-variant:normal;text-decoration:none;vertical-align:baseline"><img =
src=3D"https://lh4.googleusercontent.com/g_E-Kk1mFB6KumrFJDImPZcbUFQhgDob3E=
YJBbLsr7dtCXwL5yreGB1qF3Q0ZldLhYYU6U70dO4rh3qhP5fCWPtD892G_VvU0DjK3qeG-QpjX=
eFO3Q7e77xsaxc0TPbwQA" style=3D"border:none" height=3D"27px;" width=3D"27px=
;"></span></a><a href=3D"https://github.com/datastax/" style=3D"text-decora=
tion:none" target=3D"_blank"><span style=3D"font-size:12px;font-family:Aria=
l;color:#000000;background-color:#ffffff;font-weight:normal;font-style:norm=
al;font-variant:normal;text-decoration:none;vertical-align:baseline"> </spa=
n><span style=3D"font-size:12px;font-family:Arial;color:#000000;background-=
color:#ffffff;font-weight:normal;font-style:normal;font-variant:normal;text=
-decoration:none;vertical-align:baseline"><img src=3D"https://lh3.googleuse=
rcontent.com/XEb8siCDthQd9pPzGM62gd-KwmrCQkNuhLqToqta8XqIhJABtU8doRL7UQy0Yy=
liroXaqY6P95aMZpQCTBI2CjIjw5tvGBhAMsb68LRMOWbYlEn_kCjS459wU4aYbUoZEw" style=
=3D"border:none" height=3D"27px;" width=3D"27px;"></span></a></div></div></=
div><div><div>
<br><div class=3D"gmail_quote">On Tue, Dec 16, 2014 at 2:04 PM, Arne Claass=
en <span dir=3D"ltr">&lt;<a href=3D"mailto:arne@emotient.com" target=3D"_bl=
ank">arne@emotient.com</a>&gt;</span> wrote:<blockquote class=3D"gmail_quot=
e" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">=
<div dir=3D"ltr">I have a three node cluster that has been sitting at a loa=
d of 4 (for each node), 100% CPI utilization (although 92% nice) for that l=
ast 12 hours, ever since some significant writes finished. I&#39;m trying t=
o determine what tuning I should be doing to get it out of this state. The =
debug log is just an endless series of:<div><br></div><div><div>DEBUG [Sche=
duledTasks:1] 2014-12-16 19:03:35,042 GCInspector.java (line 118) GC for Pa=
rNew: 166 ms for 10 collections, 4400928736 used; max is 8000634880</div><d=
iv>DEBUG [ScheduledTasks:1] 2014-12-16 19:03:36,043 GCInspector.java (line =
118) GC for ParNew: 165 ms for 10 collections, 4440011176 used; max is 8000=
634880</div><div>DEBUG [ScheduledTasks:1] 2014-12-16 19:03:37,043 GCInspect=
or.java (line 118) GC for ParNew: 135 ms for 8 collections, <a href=3D"tel:=
4402220568" value=3D"+14402220568" target=3D"_blank">4402220568</a> used; m=
ax is 8000634880</div><div><br></div><div>iostat shows virtually no I/O.</d=
iv><div><br></div><div>Compaction may enter into this, but i don&#39;t real=
ly know what to make of compaction stats since they never change:</div><div=
><br></div><div><div>[root@cassandra-37919c3a ~]# nodetool compactionstats<=
/div><div>pending tasks: 10</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 co=
mpaction type =C2=A0 =C2=A0 =C2=A0 =C2=A0keyspace =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 table =C2=A0 =C2=A0 =C2=A0 completed =C2=A0 =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 total =C2=A0 =C2=A0 =C2=A0unit =C2=A0progress</div><div>=C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Compaction =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 mediamedia_tracks_raw =C2=A0 =C2=A0 =C2=A0 271651482 =C2=
=A0 =C2=A0 =C2=A0 563615497 =C2=A0 =C2=A0 bytes =C2=A0 =C2=A048.20%</div><d=
iv>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Compaction =C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 mediamedia_tracks_raw =C2=A0 =C2=A0 =C2=A0 =C2=
=A030308910 =C2=A0 =C2=A0 21676695677 =C2=A0 =C2=A0 bytes =C2=A0 =C2=A0 0.1=
4%</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Compact=
ion =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 mediamedia_tracks_raw =C2=A0 =C2=A0 =
=C2=A01198384080 =C2=A0 =C2=A0 =C2=A01815603161 =C2=A0 =C2=A0 bytes =C2=A0 =
=C2=A066.00%</div><div>Active compaction remaining time : =C2=A0 0h22m24s</=
div><div><br></div><div>5 minutes later:</div><div><br></div><div>[root@cas=
sandra-37919c3a ~]# nodetool compactionstats</div><div>pending tasks: 9</di=
v><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 compaction type =C2=A0 =C2=A0 =C2=
=A0 =C2=A0keyspace =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 table =C2=A0 =C2=A0 =
=C2=A0 completed =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 total =C2=A0 =C2=A0 =C2=
=A0unit =C2=A0progress</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0Compaction =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 mediamedia_track=
s_raw =C2=A0 =C2=A0 =C2=A0 271651482 =C2=A0 =C2=A0 =C2=A0 563615497 =C2=A0 =
=C2=A0 bytes =C2=A0 =C2=A048.20%</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0Compaction =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 media=
media_tracks_raw =C2=A0 =C2=A0 =C2=A0 =C2=A030308910 =C2=A0 =C2=A0 21676695=
677 =C2=A0 =C2=A0 bytes =C2=A0 =C2=A0 0.14%</div><div>=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Compaction =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 mediamedia_tracks_raw =C2=A0 =C2=A0 =C2=A01198384080 =C2=A0 =C2=A0 =
=C2=A01815603161 =C2=A0 =C2=A0 bytes =C2=A0 =C2=A066.00%</div><div>Active c=
ompaction remaining time : =C2=A0 0h22m24s</div></div><div><br></div><div>S=
ure the pending tasks went down by one, but the rest is identical. media_tr=
acks_raw likely has a bunch of tombstones (can&#39;t figure out how to get =
stats on that).</div><div><br></div><div>Is this behavior something that in=
dicates that i need more Heap, larger new generation? Should I be manually =
running compaction on tables with lots of tombstones?</div></div><div><br><=
/div><div>Any suggestions or places to educate myself better on performance=
 tuning would be appreciated.</div><span><font color=3D"#888888"><div><br><=
/div><div>arne</div></font></span></div>
</blockquote></div></div></div></div>
</blockquote></div></div>
</div></div></blockquote></div><br clear=3D"all"><br></div></div><span>-- <=
br><div><div dir=3D"ltr"><div><div dir=3D"ltr"><span><font color=3D"#888888=
"><div dir=3D"ltr"><span><p dir=3D"ltr" style=3D"line-height:1.15;margin-to=
p:0pt;margin-bottom:0pt"><a href=3D"http://www.datastax.com/" style=3D"text=
-decoration:none" target=3D"_blank"><span style=3D"font-size:12px;font-fami=
ly:Arial;color:rgb(17,85,204);text-decoration:underline;vertical-align:base=
line;white-space:pre-wrap"><img src=3D"https://lh5.googleusercontent.com/jJ=
2Psn7W3nI4grN78l9rhgqSAHPvOGKtkSncVviaaU6wqCyX_O7V343cIveDHLvo-m_bCXivhR1X9=
xVMjFaLodlh6gB-xgiCbCnhfHQqL0KV3CjxqaxSOVT-6SpSEEIOdA" style=3D"border:none=
" alt=3D"datastax_logo.png" height=3D"39px;" width=3D"187px;"></span></a></=
p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt=
"><span style=3D"font-size:15px;font-family:Calibri;color:rgb(0,0,0);backgr=
ound-color:transparent;vertical-align:baseline;white-space:pre-wrap">Ryan S=
vihla</span></p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;mar=
gin-bottom:0pt"><span style=3D"font-size:15px;font-family:Calibri;color:rgb=
(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pr=
e-wrap">Solution Architect</span><span style=3D"font-size:15px;font-family:=
Calibri;color:rgb(0,0,0);background-color:transparent;vertical-align:baseli=
ne;white-space:pre-wrap"></span></p><br></span><span><span style=3D"font-si=
ze:15px;font-family:Calibri;color:rgb(102,102,102);vertical-align:baseline;=
white-space:pre-wrap;background-color:transparent"><a href=3D"https://twitt=
er.com/foundev" target=3D"_blank"><img src=3D"https://lh5.googleusercontent=
.com/XHgtWuIKXKV0H6h45Tar8ApuFumdKYalFv9vZQZzLSW8K1B4X_v59KcuWY1zmrXw2J1mlP=
20YfcnYr5uTjXnV36N8GgQOqe3St2mRUcKjJoRs2qD96b6fOWjQyvuUQxo3Q" style=3D"bord=
er:none" alt=3D"twitter.png" height=3D"27px;" width=3D"27px;"></a></span><s=
pan style=3D"font-size:15px;font-family:Calibri;color:rgb(102,102,102);vert=
ical-align:baseline;white-space:pre-wrap;background-color:transparent"> </s=
pan><span style=3D"font-size:15px;font-family:Calibri;color:rgb(102,102,102=
);vertical-align:baseline;white-space:pre-wrap;background-color:transparent=
"><a href=3D"http://www.linkedin.com/pub/ryan-svihla/12/621/727/" target=3D=
"_blank"><img src=3D"https://lh5.googleusercontent.com/R5pNJwLLMRVugpyV0Zd0=
7jdOmRbGnfEJPXMWmnYft5eUUbwz7quM4aM85wAyWIEPnSpCIIOSqvji3nSiiK6fwPUjT6aUdjh=
Tli0bTWIpfk1e5fLuRp0Yl-17dIuZQ6NV3A" style=3D"border:none" alt=3D"linkedin.=
png" height=3D"28px;" width=3D"28px;"></a></span></span><br></div><div dir=
=3D"ltr"><span><br></span></div><div dir=3D"ltr"><span><p dir=3D"ltr" style=
=3D"line-height:1;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-siz=
e:12px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-spa=
ce:pre-wrap">DataStax is the fastest, most scalable distributed database te=
chnology, delivering Apache Cassandra to the world=E2=80=99s most innovativ=
e enterprises. Datastax is built to be agile, always-on, and predictably sc=
alable to any size. With more than 500 customers in 45 countries, </span><s=
pan style=3D"font-size:12px;font-family:Arial;color:rgb(34,34,34);vertical-=
align:baseline;white-space:pre-wrap">DataStax is the database technology an=
d transactional backbone of choice for the worlds most innovative companies=
 such as Netflix, Adobe, Intuit, and eBay.</span><span style=3D"font-size:1=
2px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:=
pre-wrap"> </span></p><div><span style=3D"font-size:12px;font-family:Arial;=
color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap"><br></span><=
/div></span></div></font></span></div></div></div></div>
</span></div>
</blockquote></div></div>
</div></div></blockquote></div></div>
</div></div></blockquote></div></div>
</div></div></blockquote></div><br clear=3D"all"><br>-- <br><div class=3D"g=
mail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr"><span><font color=3D=
"#888888"><div dir=3D"ltr"><span><p dir=3D"ltr" style=3D"line-height:1.15;m=
argin-top:0pt;margin-bottom:0pt"><a href=3D"http://www.datastax.com/" style=
=3D"text-decoration:none" target=3D"_blank"><span style=3D"font-size:12px;f=
ont-family:Arial;color:rgb(17,85,204);text-decoration:underline;vertical-al=
ign:baseline;white-space:pre-wrap"><img src=3D"https://lh5.googleuserconten=
t.com/jJ2Psn7W3nI4grN78l9rhgqSAHPvOGKtkSncVviaaU6wqCyX_O7V343cIveDHLvo-m_bC=
XivhR1X9xVMjFaLodlh6gB-xgiCbCnhfHQqL0KV3CjxqaxSOVT-6SpSEEIOdA" style=3D"bor=
der:none" alt=3D"datastax_logo.png" height=3D"39px;" width=3D"187px;"></spa=
n></a></p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bo=
ttom:0pt"><span style=3D"font-size:15px;font-family:Calibri;color:rgb(0,0,0=
);background-color:transparent;vertical-align:baseline;white-space:pre-wrap=
">Ryan Svihla</span></p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top=
:0pt;margin-bottom:0pt"><span style=3D"font-size:15px;font-family:Calibri;c=
olor:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-=
space:pre-wrap">Solution Architect</span><span style=3D"font-size:15px;font=
-family:Calibri;color:rgb(0,0,0);background-color:transparent;vertical-alig=
n:baseline;white-space:pre-wrap"></span></p><br></span><span><span style=3D=
"font-size:15px;font-family:Calibri;color:rgb(102,102,102);vertical-align:b=
aseline;white-space:pre-wrap;background-color:transparent"><a href=3D"https=
://twitter.com/foundev" target=3D"_blank"><img src=3D"https://lh5.googleuse=
rcontent.com/XHgtWuIKXKV0H6h45Tar8ApuFumdKYalFv9vZQZzLSW8K1B4X_v59KcuWY1zmr=
Xw2J1mlP20YfcnYr5uTjXnV36N8GgQOqe3St2mRUcKjJoRs2qD96b6fOWjQyvuUQxo3Q" style=
=3D"border:none" alt=3D"twitter.png" height=3D"27px;" width=3D"27px;"></a><=
/span><span style=3D"font-size:15px;font-family:Calibri;color:rgb(102,102,1=
02);vertical-align:baseline;white-space:pre-wrap;background-color:transpare=
nt"> </span><span style=3D"font-size:15px;font-family:Calibri;color:rgb(102=
,102,102);vertical-align:baseline;white-space:pre-wrap;background-color:tra=
nsparent"><a href=3D"http://www.linkedin.com/pub/ryan-svihla/12/621/727/" t=
arget=3D"_blank"><img src=3D"https://lh5.googleusercontent.com/R5pNJwLLMRVu=
gpyV0Zd07jdOmRbGnfEJPXMWmnYft5eUUbwz7quM4aM85wAyWIEPnSpCIIOSqvji3nSiiK6fwPU=
jT6aUdjhTli0bTWIpfk1e5fLuRp0Yl-17dIuZQ6NV3A" style=3D"border:none" alt=3D"l=
inkedin.png" height=3D"28px;" width=3D"28px;"></a></span></span><br></div><=
div dir=3D"ltr"><span><br></span></div><div dir=3D"ltr"><span><p dir=3D"ltr=
" style=3D"line-height:1;margin-top:0pt;margin-bottom:0pt"><span style=3D"f=
ont-size:12px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;wh=
ite-space:pre-wrap">DataStax is the fastest, most scalable distributed data=
base technology, delivering Apache Cassandra to the world=E2=80=99s most in=
novative enterprises. Datastax is built to be agile, always-on, and predict=
ably scalable to any size. With more than 500 customers in 45 countries, </=
span><span style=3D"font-size:12px;font-family:Arial;color:rgb(34,34,34);ve=
rtical-align:baseline;white-space:pre-wrap">DataStax is the database techno=
logy and transactional backbone of choice for the worlds most innovative co=
mpanies such as Netflix, Adobe, Intuit, and eBay.</span><span style=3D"font=
-size:12px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white=
-space:pre-wrap"> </span></p><div><span style=3D"font-size:12px;font-family=
:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap"><br><=
/span></div></span></div></font></span></div></div></div></div>
</div>

--001a1138e9deb91879050a5bd144--