Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=htjCSd7GyI
	HialaPUri70unCE8k/pOkAATHvbVxNokE1bDpijlCBDiwE7+VMs87+AJXKX4bu1K
	jk3r+kaZnBziPgIP6f60PpJEvpVEY9XVw8bqqPvS7hnF7UsfmvmfHaKb0hfR3fX/
	vYD/RqWjnNe0WYo8yZvgpvl8BLpEXtI+k=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1251.1)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_54F1207E-4BDD-463F-867C-12585E13F335"
Subject: Re: Restart cassandra every X days?
Date: Fri, 3 Feb 2012 08:44:52 +1300
In-Reply-To: 
 <CADVHTB-FoC9r0eqV=PYnrzg_PbQYsYUSUKGdNky8Z9oFWWQXPA@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CADVHTB-7zh+7-qWQE=QwMHZFm28cuhJMXQ=uL8gmrUvJOQqxfA@mail.gmail.com>
 <4F20464B.9060308@hiramoto.org>
 <CADVHTB_Fs9uRxR6ymxL_ztX_NtV6zK4mBD3UUDDbj7HEGgwxuA@mail.gmail.com>
 <4F20491F.3070304@hiramoto.org>
 <6F1D54DD1FE19B408496215469822A8701F060B376D3@OCDP-ERFMMBX03.ERF.THOMSON.COM>
 <CADVHTB8NBYj5N8tm=70-Xcmo9yBe+yqk068yMj2wx2Fnza-+yA@mail.gmail.com>
 <30C7D8E3-3C2E-40C5-8629-302DEBDC480A@thelastpickle.com>
 <CADAt1aiJLuYrCdMRa0Xt6iWhT1didUVQTQaird6+H6meWMhBJA@mail.gmail.com>
 <CAENxBwxZX3kVa1CkBReSH0vf_1UVbbsY=0LeRG9CWW3p8shtaw@mail.gmail.com>
 <CADVHTB-+xiNAFak3y+qdGtgbVXmo=W9f_oxRH8Of6y4iiJ3o3A@mail.gmail.com>
 <0A794C24-16F6-41AE-9ADA-F9FB0BA5EFB4@thelastpickle.com>
 <CADVHTB9VYd+J7L5fcm9ma-5mYnaU7H0BEnf9JFnpSMEP5z0XFA@mail.gmail.com>
 <4F2410BF.2050803@bnl.gov>
 <4D115C38-6CA7-4508-AEBB-42C11F715BC2@thelastpickle.com>
 <CADVHTB9oLifxBnfQwThum_15JuW8V909BrEwP4D3yU1NBcLp5w@mail.gmail.com>
 <947E79FB-ACC9-4384-AD7D-2AFBCB66E5A9@thelastpickle.com>
 <CADVHTB-FoC9r0eqV=PYnrzg_PbQ
 YsYUSUKGdNky8Z9oFWWQXPA@mail.gmail.com>
Message-Id: <A6D03018-1383-4DB2-8BCD-5AFDED299D71@thelastpickle.com>


--Apple-Mail=_54F1207E-4BDD-463F-867C-12585E13F335
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

Speaking technically, that ain't right.

I would:
* Check if node .135 is holding a lot of hints.=20
* Take a look on disk and see what is there.
* Go through a repair and compact on each node.   =20

Cheers

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 2/02/2012, at 9:55 PM, R. Verlangen wrote:

> Yes, I already did a repair and cleanup. Currently my ring looks like =
this:
>=20
> Address         DC          Rack        Status State   Load            =
Owns    Token
> ***.89    datacenter1 rack1       Up     Normal  2.44 GB         =
50.00%  0
> ***.135    datacenter1 rack1       Up     Normal  6.99 GB         =
50.00%  85070591730234615865843651857942052864
>=20
> It's not really a problem, but I'm still wondering why this happens.
>=20
> 2012/2/1 aaron morton <aaron@thelastpickle.com>
> Do you mean the load in nodetool ring is not even, despite the tokens =
been evenly distributed ?=20
>=20
> I would assume this is not the case given the difference, but it may =
be hints given you have just done an upgrade. Check the system using =
nodetool cfstats to see. They will eventually be delivered and deleted.=20=

>=20
> More likely you will want to:
> 1) nodetool repair to make sure all data is distributed then
> 2) nodetool cleanup if you have changed the tokens at any point =
finally
>=20
> Cheers
>=20
> -----------------
> Aaron Morton
> Freelance Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 31/01/2012, at 11:56 PM, R. Verlangen wrote:
>=20
>> After running 3 days on Cassandra 1.0.7 it seems the problem has been =
solved. One weird thing remains, on our 2 nodes (both 50% of the ring), =
the first's usage is just over 25% of the second.=20
>>=20
>> Anyone got an explanation for that?
>>=20
>> 2012/1/29 aaron morton <aaron@thelastpickle.com>
>> Yes but=85
>>=20
>> For every upgrade read the NEWS.TXT it will go through the upgrade =
procedure in detail. If you want to feel extra smart scan through the =
CHANGES.txt to get an idea of whats going on.=20
>>=20
>> Cheers
>>=20
>> -----------------
>> Aaron Morton
>> Freelance Developer
>> @aaronmorton
>> http://www.thelastpickle.com
>>=20
>> On 29/01/2012, at 4:14 AM, Maxim Potekhin wrote:
>>=20
>>> Sorry if this has been covered, I was concentrating solely on 0.8x =
--
>>> can I just d/l 1.0.x and continue using same data on same cluster?
>>>=20
>>> Maxim
>>>=20
>>>=20
>>> On 1/28/2012 7:53 AM, R. Verlangen wrote:
>>>>=20
>>>> Ok, seems that it's clear what I should do next ;-)
>>>>=20
>>>> 2012/1/28 aaron morton <aaron@thelastpickle.com>
>>>> There are no blockers to upgrading to 1.0.X.
>>>>=20
>>>> A=20
>>>> -----------------
>>>> Aaron Morton
>>>> Freelance Developer
>>>> @aaronmorton
>>>> http://www.thelastpickle.com
>>>>=20
>>>> On 28/01/2012, at 7:48 AM, R. Verlangen wrote:
>>>>=20
>>>>> Ok. Seems that an upgrade might fix these problems. Is Cassandra =
1.x.x stable enough to upgrade for, or should we wait for a couple of =
weeks?
>>>>>=20
>>>>> 2012/1/27 Edward Capriolo <edlinuxguru@gmail.com>
>>>>> I would not say that issuing restart after x days is a good idea. =
You are mostly developing a superstition. You should find the source of =
the problem. It could be jmx or thrift clients not closing connections. =
We don't restart nodes on a regiment they work fine.
>>>>>=20
>>>>>=20
>>>>> On Thursday, January 26, 2012, Mike Panchenko <m@mihasya.com> =
wrote:
>>>>> > There are two relevant bugs (that I know of), both resolved in =
somewhat recent versions, which make somewhat regular restarts =
beneficial
>>>>> > https://issues.apache.org/jira/browse/CASSANDRA-2868 (memory =
leak in GCInspector, fixed in 0.7.9/0.8.5)
>>>>> > https://issues.apache.org/jira/browse/CASSANDRA-2252 (heap =
fragmentation due to the way memtables used to be allocated, refactored =
in 1.0.0)
>>>>> > Restarting daily is probably too frequent for either one of =
those problems. We usually notice degraded performance in our ancient =
cluster after ~2 weeks w/o a restart.
>>>>> > As Aaron mentioned, if you have plenty of disk space, there's no =
reason to worry about "cruft" sstables. The size of your active set is =
what matters, and you can determine if that's getting too big by =
watching for iowait (due to reads from the data partition) and/or paging =
activity of the java process. When you hit that problem, the solution is =
to 1. try to tune your caches and 2. add more nodes to spread the load. =
I'll reiterate - looking at raw disk space usage should not be your =
guide for that.
>>>>> > "Forcing" a gc generally works, but should not be relied upon =
(note "suggest" in =
http://docs.oracle.com/javase/6/docs/api/java/lang/System.html#gc()). =
It's great news that 1.0 uses a better mechanism for releasing unused =
sstables.
>>>>> > nodetool compact triggers a "major" compaction and is no longer =
a recommended by datastax (details here =
http://www.datastax.com/docs/1.0/operations/tuning#tuning-compaction =
bottom of the page).
>>>>> > Hope this helps.
>>>>> > Mike.
>>>>> > On Wed, Jan 25, 2012 at 5:14 PM, aaron morton =
<aaron@thelastpickle.com> wrote:
>>>>> >
>>>>> > That disk usage pattern is to be expected in pre 1.0 versions. =
Disk usage is far less interesting than disk free space, if it's using =
60 GB and there is 200GB thats ok. If it's using 60Gb and there is 6MB =
free thats a problem.
>>>>> > In pre 1.0 the compacted files are deleted on disk by waiting =
for the JVM do decide to GC all remaining references. If there is not =
enough space (to store the total size of the files it is about to write =
or compact) on disk GC is forced and the files are deleted. Otherwise =
they will get deleted at some point in the future.=20
>>>>> > In 1.0 files are reference counted and space is freed much =
sooner.=20
>>>>> > With regard to regular maintenance, node tool cleanup remvos =
data from a node that it is no longer a replica for. This is only of use =
when you have done a token move.=20
>>>>> > I would not recommend a daily restart of the cassandra process. =
You will lose all the run time optimizations the JVM has made (i think =
the mapped files pages will stay resident). As well as adding additional =
entropy to the system which must be repaired via HH, RR or nodetool =
repair.=20
>>>>> > If you want to see compacted files purged faster the best =
approach would be to upgrade to 1.0.=20
>>>>> > Hope that helps.=20
>>>>> > -----------------
>>>>> > Aaron Morton
>>>>> > Freelance Developer
>>>>> > @aaronmorton
>>>>> > http://www.thelastpickle.com
>>>>> > On 26/01/2012, at 9:51 AM, R. Verlangen wrote:
>>>>> >
>>>>> > In his message he explains that it's for " Forcing a GC ". GC =
stands for garbage collection. For some more background see:  =
http://en.wikipedia.org/wiki/Garbage_collection_(computer_science)=20
>>>>> > Cheers!
>>>>> >
>>>>> > 2012/1/25 <mike.li@thomsonreuters.com>
>>>>> >
>>>>> > Karl,
>>>>> >
>>>>> > Can you give a little more details on these 2 lines, what do =
they do?
>>>>> >
>>>>> > java -jar cmdline-jmxclient-0.10.3.jar - localhost:8080
>>>>> > java.lang:type=3DMemory gc
>>>>> >
>>>>> > Thank you,
>>>>> > Mike
>>>>> >
>>>>> > -----Original Message-----
>>>>> > From: Karl Hiramoto [mailto:karl@hiramoto.org]
>>>>> > Sent: Wednesday, January 25, 2012 12:26 PM
>>>>> > To: user@cassandra.apache.org
>>>>> > Subject: Re: Restart cassandra every X days?
>>>>> >
>>>>> >
>>>>> > On 01/25/12 19:18, R. Verlangen wrote:
>>>>> >> Ok thank you for your feedback. I'll add these tasks to our =
daily
>>>>> >> cassandra maintenance cronjob. Hopefully this will keep things =
under
>>>>> >> controll.
>>>>> >
>>>>> > I forgot to mention that we found that Forcing a GC also cleans =
up some
>>>>> > space.
>>>>> >
>>>>> >
>>>>> > in a cronjob you can do this with
>>>>> > http://crawler.archive.org/cmdline-jmxclient/
>>>>> >
>>>>> >
>>>>> > my cron
>>>>>=20
>>>>=20
>>>>=20
>>>=20
>>=20
>>=20
>=20
>=20


--Apple-Mail=_54F1207E-4BDD-463F-867C-12585E13F335
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
">Speaking technically, that ain't right.<div><br></div><div>I =
would:</div><div>* Check if node .135 is holding a lot of =
hints.&nbsp;</div><div>* Take a look on disk and see what is =
there.</div><div>* Go through a repair and compact on each node. &nbsp; =
&nbsp;<br><div><br></div><div>Cheers</div><div><br></div><div =
apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></div></span></span>
</div>

<br><div><div>On 2/02/2012, at 9:55 PM, R. Verlangen wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">Yes, I =
already did a repair and cleanup. Currently my ring looks like =
this:<div><br></div><div><div>Address &nbsp; &nbsp; &nbsp; &nbsp; DC =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Rack &nbsp; &nbsp; &nbsp; &nbsp;Status =
State &nbsp; Load &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Owns &nbsp; =
&nbsp;Token</div><div>***.89 &nbsp; &nbsp;datacenter1 rack1 &nbsp; =
&nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;2.44 GB &nbsp; &nbsp; &nbsp; =
&nbsp; 50.00% &nbsp;0</div>
<div>***.135 &nbsp; &nbsp;datacenter1 rack1 &nbsp; &nbsp; &nbsp; Up =
&nbsp; &nbsp; Normal &nbsp;6.99 GB &nbsp; &nbsp; &nbsp; &nbsp; 50.00% =
&nbsp;85070591730234615865843651857942052864</div><div><br></div><div>It's=
 not really a problem, but I'm still wondering why this happens.</div>
<br><div class=3D"gmail_quote">2012/2/1 aaron morton <span =
dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan><br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">Do you mean the load in nodetool =
ring is not even, despite the tokens been evenly distributed =
?&nbsp;<div><br></div><div>I would assume this is not the case given the =
difference, but it may be hints given you have just done an upgrade. =
Check the system using nodetool cfstats to see. They will eventually be =
delivered and deleted.&nbsp;</div>
<div><br></div><div>More likely you will want to:</div><div>1) nodetool =
repair to make sure all data is distributed then</div><div>2) nodetool =
cleanup if you have changed the tokens at any point =
finally</div><div><br></div>
<div>Cheers</div><div><br></div><div><div class=3D"im"><div>
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">
<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div></div></div></span=
></div>
</span></div></span></span>
</div>

<br></div><div><div class=3D"h5"><div><div>On 31/01/2012, at 11:56 PM, =
R. Verlangen wrote:</div><br><blockquote type=3D"cite">After running 3 =
days on Cassandra 1.0.7 it seems the problem has been solved. One weird =
thing remains, on our 2 nodes (both 50% of the ring), the first's usage =
is just over 25% of the second.&nbsp;<div>
<br></div><div>Anyone got an explanation for that?</div>
<div><br><div class=3D"gmail_quote">2012/1/29 aaron morton <span =
dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span><br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">

<div style=3D"word-wrap:break-word">Yes but=85<div><br></div><div>For =
every upgrade read the NEWS.TXT it will go through the upgrade procedure =
in detail. If you want to feel extra smart scan through the CHANGES.txt =
to get an idea of whats going on.&nbsp;</div>

<div><br></div><div>Cheers</div><div><div><br><div>
<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">

<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">

<span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px"><div style=3D"word-wrap:break-word">

<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div></div></div>
</span></div>
</span></div></span></span>
</div>

<br></div><div><div><div><div>On 29/01/2012, at 4:14 AM, Maxim Potekhin =
wrote:</div><br><blockquote type=3D"cite">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    Sorry if this has been covered, I was concentrating solely on 0.8x
    --<br>
    can I just d/l 1.0.x and continue using same data on same =
cluster?<br>
    <br>
    Maxim<br>
    <br>
    <br>
    On 1/28/2012 7:53 AM, R. Verlangen wrote:
    <blockquote type=3D"cite">Ok, seems that it's clear what I should do =
next ;-)<br>
      <br>
      <div class=3D"gmail_quote">2012/1/28 aaron morton <span =
dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span><br>
        <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
          <div style=3D"word-wrap:break-word">There are no blockers to
            upgrading to 1.0.X.<span><font color=3D"#888888">
                <div><br>
                </div>
              </font></span>
            <div><span><font color=3D"#888888">A&nbsp;</font></span>
              <div>
                <div>
                  <div>
                    <span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;b=
order-collapse:separate;text-transform:none;font-size:medium;white-space:n=
ormal;font-family:Helvetica;word-spacing:0px"><span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px">
                        <div style=3D"word-wrap:break-word">
                          <span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px">
                            <div style=3D"word-wrap:break-word">
                              <span =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;font-st=
yle:normal;font-weight:normal;line-height:normal;border-collapse:separate;=
text-transform:none;font-size:medium;white-space:normal;font-family:Helvet=
ica;word-spacing:0px">
                                <div style=3D"word-wrap:break-word">
                                  <div>
                                    <div>-----------------</div>
                                    <div>Aaron Morton</div>
                                    <div>Freelance Developer</div>
                                    <div>@aaronmorton</div>
                                    <div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div>
                                  </div>
                                </div>
                              </span></div>
                          </span></div>
                      </span></span>
                  </div>
                  <br>
                </div>
                <div>
                  <div>
                    <div>
                      <div>On 28/01/2012, at 7:48 AM, R. Verlangen
                        wrote:</div>
                      <br>
                      <blockquote type=3D"cite">Ok. Seems that an =
upgrade
                        might fix these problems. Is Cassandra 1.x.x
                        stable enough to upgrade for, or should we wait
                        for a couple of weeks?
                        <div>
                          <br>
                          <div class=3D"gmail_quote">2012/1/27 Edward
                            Capriolo <span dir=3D"ltr">&lt;<a =
href=3D"mailto:edlinuxguru@gmail.com" =
target=3D"_blank">edlinuxguru@gmail.com</a>&gt;</span><br>
                            <blockquote class=3D"gmail_quote" =
style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">I would not
                              say that issuing restart after x days is a
                              good idea. You are mostly developing a
                              superstition. You should find the source
                              of the problem. It could be jmx or thrift
                              clients not closing connections. We don't
                              restart nodes on a regiment they work
                              fine.
                              <div>
                                <div><br>
                                  <br>
                                  On Thursday, January 26, 2012, Mike
                                  Panchenko &lt;<a =
href=3D"mailto:m@mihasya.com" target=3D"_blank">m@mihasya.com</a>&gt;
                                  wrote:<br>
                                  &gt; There are two relevant bugs (that
                                  I know of), both resolved in somewhat
                                  recent versions, which make somewhat
                                  regular restarts beneficial<br>
                                  &gt; <a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2868" =
target=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-2868</a>=

                                  (memory leak in GCInspector, fixed in
                                  0.7.9/0.8.5)<br>
                                  &gt; <a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2252" =
target=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-2252</a>=

                                  (heap fragmentation due to the way
                                  memtables used to be allocated,
                                  refactored in 1.0.0)<br>
                                  &gt; Restarting daily is probably too
                                  frequent for either one of those
                                  problems. We usually notice degraded
                                  performance in our ancient cluster
                                  after ~2 weeks w/o a restart.<br>
                                  &gt; As Aaron mentioned, if you have
                                  plenty of disk space, there's no
                                  reason to worry about "cruft"
                                  sstables. The size of your active set
                                  is what matters, and you can determine
                                  if that's getting too big by watching
                                  for iowait (due to reads from the data
                                  partition) and/or paging activity of
                                  the java process. When you hit that
                                  problem, the solution is to 1. try to
                                  tune your caches and 2. add more nodes
                                  to spread the load. I'll reiterate -
                                  looking at raw disk space usage should
                                  not be your guide for that.<br>
                                  &gt; "Forcing" a gc generally works,
                                  but should not be relied upon (note
                                  "suggest" in&nbsp;<a =
href=3D"http://docs.oracle.com/javase/6/docs/api/java/lang/System.html#gc%=
28%29" =
target=3D"_blank">http://docs.oracle.com/javase/6/docs/api/java/lang/Syste=
m.html#gc()</a>).
                                  It's great news that 1.0 uses a better
                                  mechanism for releasing unused
                                  sstables.<br>
                                  &gt; nodetool compact triggers a
                                  "major" compaction and is no longer a
                                  recommended by datastax (details =
here&nbsp;<a =
href=3D"http://www.datastax.com/docs/1.0/operations/tuning#tuning-compacti=
on" =
target=3D"_blank">http://www.datastax.com/docs/1.0/operations/tuning#tunin=
g-compaction</a>
                                  bottom of the page).<br>
                                  &gt; Hope this helps.<br>
                                  &gt; Mike.<br>
                                  &gt; On Wed, Jan 25, 2012 at 5:14 PM,
                                  aaron morton &lt;<a =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;
                                  wrote:<br>
                                  &gt;<br>
                                  &gt; That disk usage pattern is to be
                                  expected in pre 1.0 versions. Disk
                                  usage is far less interesting than
                                  disk free space, if it's using 60 GB
                                  and there is 200GB thats ok. If it's
                                  using 60Gb and there is 6MB free thats
                                  a problem.<br>
                                  &gt; In pre 1.0 the compacted files
                                  are deleted on disk by waiting for the
                                  JVM do decide to GC all remaining
                                  references. If there is not enough
                                  space (to store the total size of the
                                  files it is about to write or compact)
                                  on disk GC is forced and the files are
                                  deleted. Otherwise they will get
                                  deleted at some point in the =
future.&nbsp;<br>
                                  &gt; In 1.0 files are reference
                                  counted and space is freed much
                                  sooner.&nbsp;<br>
                                  &gt; With regard to regular
                                  maintenance, node tool cleanup remvos
                                  data from a node that it is no longer
                                  a replica for. This is only of use
                                  when you have done a token =
move.&nbsp;<br>
                                  &gt; I would not recommend a daily
                                  restart of the cassandra process. You
                                  will lose all the run time
                                  optimizations the JVM has made (i
                                  think the mapped files pages will stay
                                  resident). As well as adding
                                  additional entropy to the system which
                                  must be repaired via HH, RR or
                                  nodetool repair.&nbsp;<br>
                                  &gt; If you want to see compacted
                                  files purged faster the best approach
                                  would be to upgrade to 1.0.&nbsp;<br>
                                  &gt; Hope that helps.&nbsp;<br>
                                  &gt; -----------------<br>
                                  &gt; Aaron Morton<br>
                                  &gt; Freelance Developer<br>
                                  &gt; @aaronmorton<br>
                                  &gt; <a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
                                  &gt; On 26/01/2012, at 9:51 AM, R.
                                  Verlangen wrote:<br>
                                  &gt;<br>
                                  &gt; In his message he explains that
                                  it's for " Forcing a GC&nbsp;". GC =
stands
                                  for garbage collection. For some more
                                  background see:&nbsp; <a =
href=3D"http://en.wikipedia.org/wiki/Garbage_collection_%28computer_scienc=
e%29" =
target=3D"_blank">http://en.wikipedia.org/wiki/Garbage_collection_(compute=
r_science)</a>&nbsp;<br>


                                  &gt; Cheers!<br>
                                  &gt;<br>
                                  &gt; 2012/1/25 &lt;<a =
href=3D"mailto:mike.li@thomsonreuters.com" =
target=3D"_blank">mike.li@thomsonreuters.com</a>&gt;<br>
                                  &gt;<br>
                                  &gt; Karl,<br>
                                  &gt;<br>
                                  &gt; Can you give a little more
                                  details on these 2 lines, what do they
                                  do?<br>
                                  &gt;<br>
                                  &gt; java -jar
                                  cmdline-jmxclient-0.10.3.jar -
                                  localhost:8080<br>
                                  &gt; java.lang:type=3DMemory gc<br>
                                  &gt;<br>
                                  &gt; Thank you,<br>
                                  &gt; Mike<br>
                                  &gt;<br>
                                  &gt; -----Original Message-----<br>
                                  &gt; From: Karl Hiramoto [mailto:<a =
href=3D"mailto:karl@hiramoto.org" =
target=3D"_blank">karl@hiramoto.org</a>]<br>
                                  &gt; Sent: Wednesday, January 25, 2012
                                  12:26 PM<br>
                                  &gt; To: <a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a><br>
                                  &gt; Subject: Re: Restart cassandra
                                  every X days?<br>
                                  &gt;<br>
                                  &gt;<br>
                                  &gt; On 01/25/12 19:18, R. Verlangen
                                  wrote:<br>
                                  &gt;&gt; Ok thank you for your
                                  feedback. I'll add these tasks to our
                                  daily<br>
                                  &gt;&gt; cassandra maintenance
                                  cronjob. Hopefully this will keep
                                  things under<br>
                                  &gt;&gt; controll.<br>
                                  &gt;<br>
                                  &gt; I forgot to mention that we found
                                  that Forcing a GC also cleans up =
some<br>
                                  &gt; space.<br>
                                  &gt;<br>
                                  &gt;<br>
                                  &gt; in a cronjob you can do this =
with<br>
                                  &gt; <a =
href=3D"http://crawler.archive.org/cmdline-jmxclient/" =
target=3D"_blank">http://crawler.archive.org/cmdline-jmxclient/</a><br>
                                  &gt;<br>
                                  &gt;<br>
                                  &gt; my cron
                                </div>
                              </div>
                            </blockquote>
                          </div>
                          <br>
                        </div>
                      </blockquote>
                    </div>
                    <br>
                  </div>
                </div>
              </div>
            </div>
          </div>
        </blockquote>
      </div>
      <br>
    </blockquote>
    <br>
  </div>

=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_54F1207E-4BDD-463F-867C-12585E13F335--