Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of drwoland@gmail.com designates
 74.125.83.44 as permitted sender)
MIME-Version: 1.0
Sender: drwoland@gmail.com
In-Reply-To: <30C7D8E3-3C2E-40C5-8629-302DEBDC480A@thelastpickle.com>
References: 
 <CADVHTB-7zh+7-qWQE=QwMHZFm28cuhJMXQ=uL8gmrUvJOQqxfA@mail.gmail.com>
	<4F20464B.9060308@hiramoto.org>
	<CADVHTB_Fs9uRxR6ymxL_ztX_NtV6zK4mBD3UUDDbj7HEGgwxuA@mail.gmail.com>
	<4F20491F.3070304@hiramoto.org>
	<6F1D54DD1FE19B408496215469822A8701F060B376D3@OCDP-ERFMMBX03.ERF.THOMSON.COM>
	<CADVHTB8NBYj5N8tm=70-Xcmo9yBe+yqk068yMj2wx2Fnza-+yA@mail.gmail.com>
	<30C7D8E3-3C2E-40C5-8629-302DEBDC480A@thelastpickle.com>
Date: Wed, 25 Jan 2012 21:52:33 -0800
Message-ID: 
 <CADAt1aiJLuYrCdMRa0Xt6iWhT1didUVQTQaird6+H6meWMhBJA@mail.gmail.com>
Subject: Re: Restart cassandra every X days?
From: Mike Panchenko <m@mihasya.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016e6542d288f189904b767fc5d

--0016e6542d288f189904b767fc5d
Content-Type: text/plain; charset=ISO-8859-1

There are two relevant bugs (that I know of), both resolved in somewhat
recent versions, which make somewhat regular restarts beneficial

https://issues.apache.org/jira/browse/CASSANDRA-2868 (memory leak in
GCInspector, fixed in 0.7.9/0.8.5)
https://issues.apache.org/jira/browse/CASSANDRA-2252 (heap fragmentation
due to the way memtables used to be allocated, refactored in 1.0.0)

Restarting daily is probably too frequent for either one of those problems.
We usually notice degraded performance in our ancient cluster after ~2
weeks w/o a restart.

As Aaron mentioned, if you have plenty of disk space, there's no reason to
worry about "cruft" sstables. The size of your active set is what matters,
and you can determine if that's getting too big by watching for iowait (due
to reads from the data partition) and/or paging activity of the java
process. When you hit that problem, the solution is to 1. try to tune your
caches and 2. add more nodes to spread the load. I'll reiterate - looking
at raw disk space usage should not be your guide for that.

"Forcing" a gc generally works, but should not be relied upon (note
"suggest" in
http://docs.oracle.com/javase/6/docs/api/java/lang/System.html#gc()). It's
great news that 1.0 uses a better mechanism for releasing unused sstables.

nodetool compact triggers a "major" compaction and is no longer a
recommended by datastax (details here
http://www.datastax.com/docs/1.0/operations/tuning#tuning-compaction bottom
of the page).

Hope this helps.

Mike.

On Wed, Jan 25, 2012 at 5:14 PM, aaron morton <aaron@thelastpickle.com>wrote:

> That disk usage pattern is to be expected in pre 1.0 versions. Disk usage
> is far less interesting than disk free space, if it's using 60 GB and there
> is 200GB thats ok. If it's using 60Gb and there is 6MB free thats a problem.
>
> In pre 1.0 the compacted files are deleted on disk by waiting for the JVM
> do decide to GC all remaining references. If there is not enough space (to
> store the total size of the files it is about to write or compact) on disk
> GC is forced and the files are deleted. Otherwise they will get deleted at
> some point in the future.
>
> In 1.0 files are reference counted and space is freed much sooner.
>
> With regard to regular maintenance, node tool cleanup remvos data from a
> node that it is no longer a replica for. This is only of use when you have
> done a token move.
>
> I would not recommend a daily restart of the cassandra process. You will
> lose all the run time optimizations the JVM has made (i think the mapped
> files pages will stay resident). As well as adding additional entropy to
> the system which must be repaired via HH, RR or nodetool repair.
>
> If you want to see compacted files purged faster the best approach would
> be to upgrade to 1.0.
>
> Hope that helps.
>
> -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>
> On 26/01/2012, at 9:51 AM, R. Verlangen wrote:
>
> In his message he explains that it's for " Forcing a GC ". GC stands for
> garbage collection. For some more background see:
> http://en.wikipedia.org/wiki/Garbage_collection_(computer_science)
>
> Cheers!
>
> 2012/1/25 <mike.li@thomsonreuters.com>
>
>> Karl,
>>
>> Can you give a little more details on these 2 lines, what do they do?
>>
>> java -jar cmdline-jmxclient-0.10.3.jar - localhost:8080
>> java.lang:type=Memory gc
>>
>> Thank you,
>> Mike
>>
>> -----Original Message-----
>> From: Karl Hiramoto [mailto:karl@hiramoto.org]
>> Sent: Wednesday, January 25, 2012 12:26 PM
>> To: user@cassandra.apache.org
>> Subject: Re: Restart cassandra every X days?
>>
>>
>> On 01/25/12 19:18, R. Verlangen wrote:
>> > Ok thank you for your feedback. I'll add these tasks to our daily
>> > cassandra maintenance cronjob. Hopefully this will keep things under
>> > controll.
>>
>> I forgot to mention that we found that Forcing a GC also cleans up some
>> space.
>>
>>
>> in a cronjob you can do this with
>> http://crawler.archive.org/cmdline-jmxclient/
>>
>>
>> my cronjob looks more like
>>
>> nodetool repair
>> nodetool cleanup
>> nodetool compact
>> java -jar cmdline-jmxclient-0.10.3.jar - localhost:8080
>> java.lang:type=Memory gc
>>
>> --
>> Karl
>>
>> This email was sent to you by Thomson Reuters, the global news and
>> information company. Any views expressed in this message are those of the
>> individual sender, except where the sender specifically states them to be
>> the views of Thomson Reuters.
>>
>
>
>

--0016e6542d288f189904b767fc5d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

There are two relevant bugs (that I know of), both resolved in somewhat rec=
ent versions, which make somewhat regular restarts beneficial<div><br></div=
><div><a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2868">http=
s://issues.apache.org/jira/browse/CASSANDRA-2868</a> (memory leak in GCInsp=
ector, fixed in 0.7.9/0.8.5)</div>
<div><a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2252">https=
://issues.apache.org/jira/browse/CASSANDRA-2252</a> (heap fragmentation due=
 to the way memtables used to be allocated, refactored in 1.0.0)</div><div>
<br></div><div>Restarting daily is probably too frequent for either one of =
those problems. We usually notice degraded performance in our ancient clust=
er after ~2 weeks w/o a restart.</div><div><br></div><div>As Aaron mentione=
d, if you have plenty of disk space, there&#39;s no reason to worry about &=
quot;cruft&quot; sstables. The size of your active set is what matters, and=
 you can determine if that&#39;s getting too big by watching for iowait (du=
e to reads from the data partition) and/or paging activity of the java proc=
ess. When you hit that problem, the solution is to 1. try to tune your cach=
es and 2. add more nodes to spread the load. I&#39;ll reiterate - looking a=
t raw disk space usage should not be your guide for that.</div>
<div><br></div><div>&quot;Forcing&quot; a gc generally works, but should no=
t be relied upon (note &quot;suggest&quot; in=A0<a href=3D"http://docs.orac=
le.com/javase/6/docs/api/java/lang/System.html#gc()">http://docs.oracle.com=
/javase/6/docs/api/java/lang/System.html#gc()</a>). It&#39;s great news tha=
t 1.0 uses a better mechanism for releasing unused sstables.</div>
<div><br></div><div>nodetool compact triggers a &quot;major&quot; compactio=
n and is no longer a recommended by datastax (details here=A0<a href=3D"htt=
p://www.datastax.com/docs/1.0/operations/tuning#tuning-compaction">http://w=
ww.datastax.com/docs/1.0/operations/tuning#tuning-compaction</a> bottom of =
the page).</div>
<div><br></div><div>Hope this helps.</div><div><br></div><div>Mike.</div><d=
iv><br><div class=3D"gmail_quote">On Wed, Jan 25, 2012 at 5:14 PM, aaron mo=
rton <span dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com">aaron=
@thelastpickle.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word">That dis=
k usage pattern is to be expected in pre 1.0 versions. Disk usage is far le=
ss interesting than disk free space, if it&#39;s using 60 GB and there is 2=
00GB thats ok. If it&#39;s using 60Gb and there is 6MB free thats a problem=
.<div>
<br></div><div>In pre 1.0 the compacted files are deleted on disk by waitin=
g for the JVM do decide to GC all remaining references. If there is not eno=
ugh space (to store the total size of the files it is about to write or com=
pact) on disk GC is forced and the files are deleted. Otherwise they will g=
et deleted at some point in the future.=A0</div>
<div><br></div><div>In 1.0 files are reference counted and space is freed m=
uch sooner.=A0</div><div><br></div><div>With regard to regular maintenance,=
 node tool cleanup remvos data from a node that it is no longer a replica f=
or. This is only of use when you have done a token move.=A0</div>
<div><br></div><div>I would not recommend a daily restart of the cassandra =
process. You will lose all the run time optimizations the JVM has made (i t=
hink the mapped files pages will stay resident). As well as adding addition=
al entropy to the system which must be repaired via HH, RR or nodetool repa=
ir.=A0</div>
<div><br></div><div>If you want to see compacted files purged faster the be=
st approach would be to upgrade to 1.0.=A0</div><div><br></div><div>Hope th=
at helps.=A0</div><div><br></div><div><div>
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;te=
xt-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norm=
al;border-collapse:separate;text-transform:none;font-size:medium;white-spac=
e:normal;font-family:Helvetica;word-spacing:0px"><span style=3D"text-indent=
:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weigh=
t:normal;line-height:normal;border-collapse:separate;text-transform:none;fo=
nt-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><=
div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;fo=
nt-style:normal;font-weight:normal;line-height:normal;border-collapse:separ=
ate;text-transform:none;font-size:medium;white-space:normal;font-family:Hel=
vetica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<div><div>-----------------</div><span class=3D"HOEnZb"><font color=3D"#888=
888"><div>Aaron Morton</div><div>Freelance Developer</div><div>@aaronmorton=
</div><div><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http:=
//www.thelastpickle.com</a></div>
</font></span></div></div></span></div></span></div></span></span>
</div><div><div class=3D"h5">

<br><div><div>On 26/01/2012, at 9:51 AM, R. Verlangen wrote:</div><br><bloc=
kquote type=3D"cite">In his message he explains that it&#39;s for &quot;
<span>Forcing a GC</span>=A0&quot;. GC stands for garbage collection. For s=
ome more background see:=A0
<a href=3D"http://en.wikipedia.org/wiki/Garbage_collection_(computer_scienc=
e)" target=3D"_blank">http://en.wikipedia.org/wiki/Garbage_collection_(comp=
uter_science)</a>=A0<div><br></div><div>Cheers!<br><br><div class=3D"gmail_=
quote">
2012/1/25  <span dir=3D"ltr">&lt;<a href=3D"mailto:mike.li@thomsonreuters.c=
om" target=3D"_blank">mike.li@thomsonreuters.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Karl,<br>
<br>
Can you give a little more details on these 2 lines, what do they do?<br>
<div><br>
java -jar cmdline-jmxclient-0.10.3.jar - localhost:8080<br>
java.lang:type=3DMemory gc<br>
<br>
</div>Thank you,<br>
Mike<br>
<div><div><br>
-----Original Message-----<br>
From: Karl Hiramoto [mailto:<a href=3D"mailto:karl@hiramoto.org" target=3D"=
_blank">karl@hiramoto.org</a>]<br>
Sent: Wednesday, January 25, 2012 12:26 PM<br>
To: <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">user@cas=
sandra.apache.org</a><br>
Subject: Re: Restart cassandra every X days?<br>
<br>
<br>
On 01/25/12 19:18, R. Verlangen wrote:<br>
&gt; Ok thank you for your feedback. I&#39;ll add these tasks to our daily<=
br>
&gt; cassandra maintenance cronjob. Hopefully this will keep things under<b=
r>
&gt; controll.<br>
<br>
I forgot to mention that we found that Forcing a GC also cleans up some<br>
space.<br>
<br>
<br>
in a cronjob you can do this with<br>
<a href=3D"http://crawler.archive.org/cmdline-jmxclient/" target=3D"_blank"=
>http://crawler.archive.org/cmdline-jmxclient/</a><br>
<br>
<br>
my cronjob looks more like<br>
<br>
nodetool repair<br>
nodetool cleanup<br>
nodetool compact<br>
java -jar cmdline-jmxclient-0.10.3.jar - localhost:8080<br>
java.lang:type=3DMemory gc<br>
<br>
--<br>
Karl<br>
<br>
</div></div>This email was sent to you by Thomson Reuters, the global news =
and information company. Any views expressed in this message are those of t=
he individual sender, except where the sender specifically states them to b=
e the views of Thomson Reuters.<br>


</blockquote></div><br></div>
</blockquote></div><br></div></div></div></div></blockquote></div><br></div=
>

--0016e6542d288f189904b767fc5d--