Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: 209.85.220.50 is neither permitted
 nor denied by domain of oberman@civicscience.com)
MIME-Version: 1.0
In-Reply-To: 
 <CAENxBwzyC9jmNRCZHB9kNiEM+AB+K6uTSZFHDXMXSwW+gsTNcw@mail.gmail.com>
References: 
 <CAAjbL_m7F-Qzhh4ycFXMQ9q3SjJRmYkq027+ACJXpRM2TjqQnQ@mail.gmail.com>
 <CAKYfx9pXWAJJoi0pbWu8D5n05JHMcRAXPo4cLYLV2=i+qyH2ZQ@mail.gmail.com>
 <80E5BF0B-A1E5-452A-815A-803119B00A77@thelastpickle.com>
 <CAAjbL_noeW_-yBbB0VsbWv_bkcMMr9aCw8=WhzOoT5mGMX85EA@mail.gmail.com>
 <CAENxBwzyC9jmNRCZHB9kNiEM+AB+K6uTSZFHDXMXSwW+gsTNcw@mail.gmail.com>
From: William Oberman <oberman@civicscience.com>
Date: Tue, 2 Apr 2013 10:43:08 -0400
Message-ID: 
 <CAAjbL_nqVsSdYAAdiKO6tsCkTqFUhOogXN4m8MoVCDtf+QgNSQ@mail.gmail.com>
Subject: Re: how to stop out of control compactions?
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=60eb69fdf43baffa8904d961c27f

--60eb69fdf43baffa8904d961c27f
Content-Type: text/plain; charset=ISO-8859-1

Edward, you make a good point, and I do think am getting closer to having
to increase my cluster size (I'm around ~300GB/node now).

In my case, I think it was neither.  I had one node OOM after working on a
large compaction but it continued to run in a zombie like state (constantly
GC'ing), which I didn't have an alert on.  Then I had the bad luck of a
"close token" also starting a large compaction.  I have RF=3 with some of
my R/W patterns at quorum, causing that segment of my cluster to get slow
(e.g. a % of of my traffic started to slow).  I was running 1.1.2 (I
haven't had to poke anything for quite some time, obviously), so I upgraded
before moving on (as I saw a lot of bug fixes to compaction issues in
release notes).  But the upgrade caused even more nodes to start
compactions.  Which lead to my original email... I had a cluster where 80%
of my nodes were compacting, and I really needed to boost production
traffic and couldn't seem to "tamp cassandra down" temporarily.

Thanks for the advice everyone!

will


On Tue, Apr 2, 2013 at 10:20 AM, Edward Capriolo <edlinuxguru@gmail.com>wrote:

> Settings do not make compactions go away. If your compactions are "out of
> control" it usually means one of these things,
> 1)  you have a corrupt table that the compaction never finishes on,
> sstables count keep growing
> 2) you do not have enough hardware to handle your write load
>
>
> On Tue, Apr 2, 2013 at 7:50 AM, William Oberman <oberman@civicscience.com>wrote:
>
>> Thanks Gregg & Aaron. Missed that setting!
>>
>> On Tuesday, April 2, 2013, aaron morton wrote:
>>
>>> Set the min and max
>>> compaction thresholds for a given column family
>>>
>>> +1 for setting the max_compaction_threshold (as well as the min) on the
>>> a CF when you are getting behind. It can limit the size of the compactions
>>> and give things a chance to complete in a reasonable time.
>>>
>>> Cheers
>>>
>>>    -----------------
>>> Aaron Morton
>>> Freelance Cassandra Consultant
>>> New Zealand
>>>
>>> @aaronmorton
>>> http://www.thelastpickle.com
>>>
>>> On 2/04/2013, at 3:42 AM, Gregg Ulrich <gulrich@netflix.com> wrote:
>>>
>>> You may want to set compaction threshold and not throughput.  If you set
>>> the min threshold to something very large (100000), compactions will not
>>> start until cassandra finds this many files to compact (which it should
>>> not).
>>>
>>> In the past I have used this to stop compactions on a node, and then run
>>> an offline major compaction to get though the compaction, then set the min
>>> threshold back.  Not everyone likes major compactions though.
>>>
>>>
>>>
>>>   setcompactionthreshold <keyspace> <cfname> <minthreshold>
>>> <maxthreshold> - Set the min and max
>>> compaction thresholds for a given column family
>>>
>>>
>>>
>>> On Mon, Apr 1, 2013 at 12:38 PM, William Oberman <
>>> oberman@civicscience.com> wrote:
>>>
>>>> I'll skip the prelude, but I worked myself into a bit of a jam.  I'm
>>>> recovering now, but I want to double check if I'm thinking about things
>>>> correct.
>>>>
>>>> Basically, I was in a state where a majority of my servers wanted to do
>>>> compactions, and rather large ones.  This was impacting my site
>>>> performance.  I tried nodetool stop COMPACTION.  I tried
>>>> setcompactionthroughput=1.  I tried restarting servers, but they'd restart
>>>> the compactions pretty much immediately on boot.
>>>>
>>>> Then I realized that:
>>>> nodetool stop COMPACTION
>>>> only stopped running compactions, and then the compactions would
>>>> re-enqueue themselves rather quickly.
>>>>
>>>> So, right now I have:
>>>> 1.) scripts running on N-1 servers looping on "nodetool stop
>>>> COMPACTION" in a tight loop
>>>> 2.) On the "Nth" server I've disabled gossip/thrift and turned up
>>>> setcompactionthroughput to 999
>>>> 3.) When the Nth server completes, I pick from the remaining N-1 (well,
>>>> I'm still running the first compaction, which is going to take 12 more
>>>> hours, but that is the plan at least).
>>>>
>>>> Does this make sense?  Other than the fact there was probably warning
>>>> signs that would have prevented me from getting into this state in the
>>>> first place? :-)
>>>>
>>>> will
>>>>
>>>
>>>
>>>
>>
>> --
>> Will Oberman
>> Civic Science, Inc.
>> 6101 Penn Avenue, Fifth Floor
>> Pittsburgh, PA 15206
>> (M) 412-480-7835
>> (E) oberman@civicscience.com
>>
>
>

--60eb69fdf43baffa8904d961c27f
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Edward, you make a good point, and I do think am getting c=
loser to having to increase my cluster size (I&#39;m around ~300GB/node now=
). =A0<div><br></div><div>In my case, I think it was neither. =A0I had one =
node OOM after working on a large compaction but it continued to run in a z=
ombie like state (constantly GC&#39;ing), which I didn&#39;t have an alert =
on. =A0Then I had the bad luck of a &quot;close token&quot; also starting a=
 large compaction. =A0I have RF=3D3 with some of my R/W patterns at=A0quoru=
m, causing that segment of my cluster to get slow (e.g. a % of of my traffi=
c started to slow). =A0I was running 1.1.2 (I haven&#39;t had to poke anyth=
ing for quite some time,=A0obviously), so I upgraded before moving on (as I=
 saw a lot of bug fixes to compaction issues in release notes). =A0But the =
upgrade caused even more nodes to start compactions. =A0Which lead to my or=
iginal email... I had a cluster where 80% of my nodes were compacting, and =
I really needed to boost production traffic and couldn&#39;t seem to &quot;=
tamp cassandra down&quot; temporarily. =A0<div>

<div><br></div><div style>Thanks for the advice everyone!</div><div><br></d=
iv><div style>will</div></div></div><div class=3D"gmail_extra"><br><br><div=
 class=3D"gmail_quote">On Tue, Apr 2, 2013 at 10:20 AM, Edward Capriolo <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:edlinuxguru@gmail.com" target=3D"_blan=
k">edlinuxguru@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Settings do not make c=
ompactions go away. If your compactions are &quot;out of control&quot; it u=
sually means one of these things,<br>

</div><div>1)=A0 you have a corrupt table that the compaction never finishe=
s on, sstables count keep growing</div>
<div>2) you do not have enough hardware to handle your write load</div></di=
v><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br><b=
r><div class=3D"gmail_quote">On Tue, Apr 2, 2013 at 7:50 AM, William Oberma=
n <span dir=3D"ltr">&lt;<a href=3D"mailto:oberman@civicscience.com" target=
=3D"_blank">oberman@civicscience.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Thanks G<span></span>regg &amp; Aaron. Misse=
d that setting!=A0<br><br>On Tuesday, April 2, 2013, aaron morton  wrote:<b=
r>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word">
<blockquote type=3D"cite"><div dir=3D"ltr"><div>Set the min and max=A0</div=
><div>compaction thresholds for a given column family</div></div></blockquo=
te>+1 for setting the max_compaction_threshold (as well as the min) on the =
a CF when you are getting behind. It can limit the size of the compactions =
and give things a chance to complete in a reasonable time.=A0<div>


<br></div><div>Cheers</div><div><br><div>
<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">


<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;text-align:-webkit-auto;font-style:normal;font-weight:norm=
al;line-height:normal;border-collapse:separate;text-transform:none;font-siz=
e:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><div st=
yle=3D"word-wrap:break-word">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">


<div>-----------------</div><div>Aaron Morton</div><div>Freelance Cassandra=
 Consultant</div><div>New Zealand</div><div><br></div><div>@aaronmorton</di=
v><div><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://ww=
w.thelastpickle.com</a></div>


</div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 2/04/2013, at 3:42 AM, Gregg Ulrich &lt;<a>gulrich@netflix=
.com</a>&gt; wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr">
<div>You may want to set compaction threshold and not throughput. =A0If you=
 set the min threshold to something very large (100000), compactions will n=
ot start until cassandra finds this many files to compact (which it should =
not).</div>


<div><br>In the past I have used this to stop compactions on a node, and th=
en run an offline major compaction to get though the compaction, then set t=
he min threshold back. =A0Not everyone likes major compactions though.</div=
>


<div><br></div><div><br></div><div><br></div><div>=A0 setcompactionthreshol=
d &lt;keyspace&gt; &lt;cfname&gt; &lt;minthreshold&gt; &lt;maxthreshold&gt;=
 - Set the min and max=A0</div><div>compaction thresholds for a given colum=
n family</div>


<div><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">On Mon, Apr 1, 2013 at 12:38 PM, William Oberman <span dir=3D"ltr">=
&lt;<a>oberman@civicscience.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">I&#39;ll skip the prelude, =
but I worked myself into a bit of a jam. =A0I&#39;m recovering now, but I w=
ant to double check if I&#39;m thinking about things correct.<div>


<br></div><div>Basically, I was in a state where a majority of my servers w=
anted to do compactions, and rather large ones. =A0This was impacting my si=
te performance. =A0I tried nodetool stop COMPACTION. =A0I tried setcompacti=
onthroughput=3D1. =A0I tried=A0restarting servers, but they&#39;d restart t=
he compactions pretty much immediately on boot.</div>


<div><br></div><div>Then I realized that:</div><div>nodetool stop COMPACTIO=
N</div><div>only stopped running compactions, and then the compactions woul=
d re-enqueue themselves rather quickly.</div><div>

<br></div><div>So, right now I have:</div><div>1.) scripts running on N-1 s=
ervers looping on &quot;nodetool stop COMPACTION&quot; in a tight loop</div=
><div>2.) On the &quot;Nth&quot; server I&#39;ve disabled gossip/thrift and=
 turned up setcompactionthroughput to 999</div>


<div>3.) When the Nth server completes, I pick from the remaining N-1 (well=
, I&#39;m still running the first compaction, which is going to take 12 mor=
e hours, but that is the plan at least).</div><div><br></div>

<div>Does this make sense? =A0Other than the fact there was probably warnin=
g signs that would have prevented me from getting into this state in the fi=
rst place? :-)</div><div><br></div><div>will</div></div>

</blockquote></div><br></div>
</blockquote></div><br></div></div></blockquote><span><font color=3D"#88888=
8"><br><br>-- <br>Will Oberman<br>Civic Science, Inc.<br>6101 Penn Avenue, =
Fifth Floor<br>Pittsburgh, PA 15206<br>(M) <a href=3D"tel:412-480-7835" val=
ue=3D"+14124807835" target=3D"_blank">412-480-7835</a><br>


(E) <a href=3D"mailto:oberman@civicscience.com" target=3D"_blank">oberman@c=
ivicscience.com</a><br>
</font></span></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div>

--60eb69fdf43baffa8904d961c27f--