Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: 209.85.160.52 is neither permitted
 nor denied by domain of oberman@civicscience.com)
MIME-Version: 1.0
In-Reply-To: 
 <CAAjbL_nqVsSdYAAdiKO6tsCkTqFUhOogXN4m8MoVCDtf+QgNSQ@mail.gmail.com>
References: 
 <CAAjbL_m7F-Qzhh4ycFXMQ9q3SjJRmYkq027+ACJXpRM2TjqQnQ@mail.gmail.com>
 <CAKYfx9pXWAJJoi0pbWu8D5n05JHMcRAXPo4cLYLV2=i+qyH2ZQ@mail.gmail.com>
 <80E5BF0B-A1E5-452A-815A-803119B00A77@thelastpickle.com>
 <CAAjbL_noeW_-yBbB0VsbWv_bkcMMr9aCw8=WhzOoT5mGMX85EA@mail.gmail.com>
 <CAENxBwzyC9jmNRCZHB9kNiEM+AB+K6uTSZFHDXMXSwW+gsTNcw@mail.gmail.com>
 <CAAjbL_nqVsSdYAAdiKO6tsCkTqFUhOogXN4m8MoVCDtf+QgNSQ@mail.gmail.com>
From: William Oberman <oberman@civicscience.com>
Date: Tue, 2 Apr 2013 11:13:16 -0400
Message-ID: 
 <CAAjbL_khuhK6VCxwrDDc1=DOQQ00cMEU5ctuq_FnLqdNra_-gw@mail.gmail.com>
Subject: Re: how to stop out of control compactions?
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=047d7b15fda183fdd604d9622e81

--047d7b15fda183fdd604d9622e81
Content-Type: text/plain; charset=ISO-8859-1

I just tried to use this setting (I'm using 1.1.9).  And it appears I can't
set min > 32, as that's the max max now (using nodetool at least).  Not
sure if JMX would allow more access, but I don't like bypassing things I
don't fully understand.  I think I'll just leave my compaction killers
running instead (not that killing compactions constantly isn't messing with
things as well....).

will


On Tue, Apr 2, 2013 at 10:43 AM, William Oberman
<oberman@civicscience.com>wrote:

> Edward, you make a good point, and I do think am getting closer to having
> to increase my cluster size (I'm around ~300GB/node now).
>
> In my case, I think it was neither.  I had one node OOM after working on a
> large compaction but it continued to run in a zombie like state (constantly
> GC'ing), which I didn't have an alert on.  Then I had the bad luck of a
> "close token" also starting a large compaction.  I have RF=3 with some of
> my R/W patterns at quorum, causing that segment of my cluster to get slow
> (e.g. a % of of my traffic started to slow).  I was running 1.1.2 (I
> haven't had to poke anything for quite some time, obviously), so I upgraded
> before moving on (as I saw a lot of bug fixes to compaction issues in
> release notes).  But the upgrade caused even more nodes to start
> compactions.  Which lead to my original email... I had a cluster where 80%
> of my nodes were compacting, and I really needed to boost production
> traffic and couldn't seem to "tamp cassandra down" temporarily.
>
> Thanks for the advice everyone!
>
> will
>
>
> On Tue, Apr 2, 2013 at 10:20 AM, Edward Capriolo <edlinuxguru@gmail.com>wrote:
>
>> Settings do not make compactions go away. If your compactions are "out of
>> control" it usually means one of these things,
>> 1)  you have a corrupt table that the compaction never finishes on,
>> sstables count keep growing
>> 2) you do not have enough hardware to handle your write load
>>
>>
>> On Tue, Apr 2, 2013 at 7:50 AM, William Oberman <oberman@civicscience.com
>> > wrote:
>>
>>> Thanks Gregg & Aaron. Missed that setting!
>>>
>>> On Tuesday, April 2, 2013, aaron morton wrote:
>>>
>>>> Set the min and max
>>>> compaction thresholds for a given column family
>>>>
>>>> +1 for setting the max_compaction_threshold (as well as the min) on the
>>>> a CF when you are getting behind. It can limit the size of the compactions
>>>> and give things a chance to complete in a reasonable time.
>>>>
>>>> Cheers
>>>>
>>>>    -----------------
>>>> Aaron Morton
>>>> Freelance Cassandra Consultant
>>>> New Zealand
>>>>
>>>> @aaronmorton
>>>> http://www.thelastpickle.com
>>>>
>>>> On 2/04/2013, at 3:42 AM, Gregg Ulrich <gulrich@netflix.com> wrote:
>>>>
>>>> You may want to set compaction threshold and not throughput.  If you
>>>> set the min threshold to something very large (100000), compactions will
>>>> not start until cassandra finds this many files to compact (which it should
>>>> not).
>>>>
>>>> In the past I have used this to stop compactions on a node, and then
>>>> run an offline major compaction to get though the compaction, then set the
>>>> min threshold back.  Not everyone likes major compactions though.
>>>>
>>>>
>>>>
>>>>   setcompactionthreshold <keyspace> <cfname> <minthreshold>
>>>> <maxthreshold> - Set the min and max
>>>> compaction thresholds for a given column family
>>>>
>>>>
>>>>
>>>> On Mon, Apr 1, 2013 at 12:38 PM, William Oberman <
>>>> oberman@civicscience.com> wrote:
>>>>
>>>>> I'll skip the prelude, but I worked myself into a bit of a jam.  I'm
>>>>> recovering now, but I want to double check if I'm thinking about things
>>>>> correct.
>>>>>
>>>>> Basically, I was in a state where a majority of my servers wanted to
>>>>> do compactions, and rather large ones.  This was impacting my site
>>>>> performance.  I tried nodetool stop COMPACTION.  I tried
>>>>> setcompactionthroughput=1.  I tried restarting servers, but they'd restart
>>>>> the compactions pretty much immediately on boot.
>>>>>
>>>>> Then I realized that:
>>>>> nodetool stop COMPACTION
>>>>> only stopped running compactions, and then the compactions would
>>>>> re-enqueue themselves rather quickly.
>>>>>
>>>>> So, right now I have:
>>>>> 1.) scripts running on N-1 servers looping on "nodetool stop
>>>>> COMPACTION" in a tight loop
>>>>> 2.) On the "Nth" server I've disabled gossip/thrift and turned up
>>>>> setcompactionthroughput to 999
>>>>> 3.) When the Nth server completes, I pick from the remaining N-1
>>>>> (well, I'm still running the first compaction, which is going to take 12
>>>>> more hours, but that is the plan at least).
>>>>>
>>>>> Does this make sense?  Other than the fact there was probably warning
>>>>> signs that would have prevented me from getting into this state in the
>>>>> first place? :-)
>>>>>
>>>>> will
>>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>>

--047d7b15fda183fdd604d9622e81
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I just tried to use this setting (I&#39;m using 1.1.9). =
=A0And it appears I can&#39;t set min &gt; 32, as that&#39;s the max max no=
w (using nodetool at least). =A0Not sure if JMX would allow more access, bu=
t I don&#39;t like bypassing things I don&#39;t fully understand. =A0I thin=
k I&#39;ll just leave my compaction killers running instead (not that killi=
ng compactions constantly isn&#39;t messing with things as well....).<div>

<br></div><div>will<br><div><div><div class=3D"gmail_extra"><br><br><div cl=
ass=3D"gmail_quote">On Tue, Apr 2, 2013 at 10:43 AM, William Oberman <span =
dir=3D"ltr">&lt;<a href=3D"mailto:oberman@civicscience.com" target=3D"_blan=
k">oberman@civicscience.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Edward, you make a good poi=
nt, and I do think am getting closer to having to increase my cluster size =
(I&#39;m around ~300GB/node now). =A0<div>

<br></div><div>In my case, I think it was neither. =A0I had one node OOM af=
ter working on a large compaction but it continued to run in a zombie like =
state (constantly GC&#39;ing), which I didn&#39;t have an alert on. =A0Then=
 I had the bad luck of a &quot;close token&quot; also starting a large comp=
action. =A0I have RF=3D3 with some of my R/W patterns at=A0quorum, causing =
that segment of my cluster to get slow (e.g. a % of of my traffic started t=
o slow). =A0I was running 1.1.2 (I haven&#39;t had to poke anything for qui=
te some time,=A0obviously), so I upgraded before moving on (as I saw a lot =
of bug fixes to compaction issues in release notes). =A0But the upgrade cau=
sed even more nodes to start compactions. =A0Which lead to my original emai=
l... I had a cluster where 80% of my nodes were compacting, and I really ne=
eded to boost production traffic and couldn&#39;t seem to &quot;tamp cassan=
dra down&quot; temporarily. =A0<div>


<div><br></div><div>Thanks for the advice everyone!</div><div><br></div><di=
v>will</div></div></div><div><div class=3D"h5"><div class=3D"gmail_extra"><=
br><br><div class=3D"gmail_quote">On Tue, Apr 2, 2013 at 10:20 AM, Edward C=
apriolo <span dir=3D"ltr">&lt;<a href=3D"mailto:edlinuxguru@gmail.com" targ=
et=3D"_blank">edlinuxguru@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Settings do not make c=
ompactions go away. If your compactions are &quot;out of control&quot; it u=
sually means one of these things,<br>


</div><div>1)=A0 you have a corrupt table that the compaction never finishe=
s on, sstables count keep growing</div>
<div>2) you do not have enough hardware to handle your write load</div></di=
v><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Tue=
, Apr 2, 2013 at 7:50 AM, William Oberman <span dir=3D"ltr">&lt;<a href=3D"=
mailto:oberman@civicscience.com" target=3D"_blank">oberman@civicscience.com=
</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Thanks G<span></span>regg &amp; Aaron. Misse=
d that setting!=A0<br><br>On Tuesday, April 2, 2013, aaron morton  wrote:<b=
r>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word">
<blockquote type=3D"cite"><div dir=3D"ltr"><div>Set the min and max=A0</div=
><div>compaction thresholds for a given column family</div></div></blockquo=
te>+1 for setting the max_compaction_threshold (as well as the min) on the =
a CF when you are getting behind. It can limit the size of the compactions =
and give things a chance to complete in a reasonable time.=A0<div>


<br></div><div>Cheers</div><div><br><div>
<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">


<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;text-align:-webkit-auto;font-style:normal;font-weight:norm=
al;line-height:normal;border-collapse:separate;text-transform:none;font-siz=
e:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><div st=
yle=3D"word-wrap:break-word">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<div>-----------------</div><div>Aaron Morton</div><div>Freelance Cassandra=
 Consultant</div><div>New Zealand</div><div><br></div><div>@aaronmorton</di=
v><div><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://ww=
w.thelastpickle.com</a></div>


</div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 2/04/2013, at 3:42 AM, Gregg Ulrich &lt;<a>gulrich@netflix=
.com</a>&gt; wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr">
<div>You may want to set compaction threshold and not throughput. =A0If you=
 set the min threshold to something very large (100000), compactions will n=
ot start until cassandra finds this many files to compact (which it should =
not).</div>


<div><br>In the past I have used this to stop compactions on a node, and th=
en run an offline major compaction to get though the compaction, then set t=
he min threshold back. =A0Not everyone likes major compactions though.</div=
>


<div><br></div><div><br></div><div><br></div><div>=A0 setcompactionthreshol=
d &lt;keyspace&gt; &lt;cfname&gt; &lt;minthreshold&gt; &lt;maxthreshold&gt;=
 - Set the min and max=A0</div><div>compaction thresholds for a given colum=
n family</div>


<div><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">On Mon, Apr 1, 2013 at 12:38 PM, William Oberman <span dir=3D"ltr">=
&lt;<a>oberman@civicscience.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">I&#39;ll skip the prelude, =
but I worked myself into a bit of a jam. =A0I&#39;m recovering now, but I w=
ant to double check if I&#39;m thinking about things correct.<div>


<br></div><div>Basically, I was in a state where a majority of my servers w=
anted to do compactions, and rather large ones. =A0This was impacting my si=
te performance. =A0I tried nodetool stop COMPACTION. =A0I tried setcompacti=
onthroughput=3D1. =A0I tried=A0restarting servers, but they&#39;d restart t=
he compactions pretty much immediately on boot.</div>


<div><br></div><div>Then I realized that:</div><div>nodetool stop COMPACTIO=
N</div><div>only stopped running compactions, and then the compactions woul=
d re-enqueue themselves rather quickly.</div><div>

<br></div><div>So, right now I have:</div><div>1.) scripts running on N-1 s=
ervers looping on &quot;nodetool stop COMPACTION&quot; in a tight loop</div=
><div>2.) On the &quot;Nth&quot; server I&#39;ve disabled gossip/thrift and=
 turned up setcompactionthroughput to 999</div>


<div>3.) When the Nth server completes, I pick from the remaining N-1 (well=
, I&#39;m still running the first compaction, which is going to take 12 mor=
e hours, but that is the plan at least).</div><div><br></div>

<div>Does this make sense? =A0Other than the fact there was probably warnin=
g signs that would have prevented me from getting into this state in the fi=
rst place? :-)</div><div><br></div><div>will</div></div>

</blockquote></div><br></div>
</blockquote></div><br></div></div></blockquote><span><font color=3D"#88888=
8"><br><br><br></font></span></blockquote></div></div></div></blockquote></=
div></div></div></div></div></blockquote></div></div></div></div></div></di=
v>


--047d7b15fda183fdd604d9622e81--