Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: 
 <CAM+WaZj_sj_ND4mti3e5Tmihi98fj-COJZkjV6bCyBaYdJx7hw@mail.gmail.com>
References: 
 <CAM+WaZj_sj_ND4mti3e5Tmihi98fj-COJZkjV6bCyBaYdJx7hw@mail.gmail.com>
Date: Wed, 27 Aug 2014 17:02:33 -0500
Message-ID: 
 <CAKmMYa8wKaNim7Fi1T=th4+0fpbpxOO3yah2i7FWSSNBxWScVA@mail.gmail.com>
Subject: Re: Too many SSTables after rebalancing cluster (LCS)
From: Nate McCall <nate@thelastpickle.com>
To: Cassandra Users <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=001a1133b54eb96f9e0501a3931e

--001a1133b54eb96f9e0501a3931e
Content-Type: text/plain; charset=UTF-8

Try turning down 'tombstone_threshold' to something like '0.05' from it's
default of '0.2.' This will cause the SSTable to be considered for
tombstone only compactions more frequently (if %5 of the columns are
tombstones instead of 20%).

For a bit more info, see:
http://www.datastax.com/documentation/cql/3.0/cql/cql_reference/compactSubprop.html


On Tue, Aug 26, 2014 at 1:38 PM, Paulo Ricardo Motta Gomes <
paulo.motta@chaordicsystems.com> wrote:

> Hey folks,
>
> After adding more nodes and moving tokens of "old" nodes to rebalance the
> ring, I noticed that the "old" nodes had significant more data then the
> newly bootstrapped nodes, even after cleanup.
>
> I noticed that the old nodes had a much larger number of SSTables on LCS
> CFs, and most of them located on the last level:
>
> Node N-1 (old node): [1, 10, 102/100, 173, 2403, 0, 0, 0, 0] (total:2695)
>
> *Node N (new node): [1, 10, 108/100, 214, 0, 0, 0, 0, 0] (total: 339)*Node
> N+1 (old node): [1, 10, 87, 113, 1076, 0, 0, 0, 0] (total: 1287)
>
> Since these sstables have a lot of tombstones, and they're not updated
> frequently, they remain in the last level forever, and are never cleaned.
>
> What is the solution here? The good old "change to STCS and then back to
> LCS", or is there something less brute force?
>
> Environment: Cassandra 1.2.16 - non-vnondes
>
> Any help would be very much appreciated.
>
> Cheers,
>
> --
> *Paulo Motta*
>
> Chaordic | *Platform*
> *www.chaordic.com.br <http://www.chaordic.com.br/>*
> +55 48 3232.3200
>


-- 
-----------------
Nate McCall
Austin, TX
@zznate

Co-Founder & Sr. Technical Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

--001a1133b54eb96f9e0501a3931e
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Try turning down &#39;tombstone_threshold&#39; to somethin=
g like &#39;0.05&#39; from it&#39;s default of &#39;0.2.&#39; This will cau=
se the SSTable to be considered for tombstone only compactions more frequen=
tly (if %5 of the columns are tombstones instead of 20%).=C2=A0<div>
<br></div><div>For a bit more info, see:</div><div><a href=3D"http://www.da=
tastax.com/documentation/cql/3.0/cql/cql_reference/compactSubprop.html">htt=
p://www.datastax.com/documentation/cql/3.0/cql/cql_reference/compactSubprop=
.html</a><br>
</div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">O=
n Tue, Aug 26, 2014 at 1:38 PM, Paulo Ricardo Motta Gomes <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:paulo.motta@chaordicsystems.com" target=3D"_blank">p=
aulo.motta@chaordicsystems.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hey folks,<div><br></div><d=
iv>After adding more nodes and moving tokens of &quot;old&quot; nodes to re=
balance the ring, I noticed that the &quot;old&quot; nodes had significant =
more data then the newly bootstrapped nodes, even after cleanup.</div>


<div><br></div><div>I noticed that the old nodes had a much larger number o=
f SSTables on LCS CFs, and most of them located on the last level:<br></div=
><div><br></div><div>Node N-1 (old node): [1, 10, 102/100, 173, 2403, 0, 0,=
 0, 0] (total:2695)<br>


</div><div><b>Node N (new node):=C2=A0[1, 10, 108/100, 214, 0, 0, 0, 0, 0]=
=C2=A0(total: 339)<br></b>Node N+1 (old node):=C2=A0[1, 10, 87, 113, 1076, =
0, 0, 0, 0] (total: 1287)</div><div><br></div><div>Since these sstables hav=
e a lot of tombstones, and they&#39;re not updated frequently, they remain =
in the last level forever, and are never cleaned.</div>


<div><br></div><div>What is the solution here? The good old &quot;change to=
 STCS and then back to LCS&quot;, or is there something less brute force?</=
div><div><br></div><div><div>Environment: Cassandra 1.2.16 - non-vnondes</d=
iv>


</div><div><br>Any help would be very much appreciated.</div><div><br></div=
><div>Cheers,</div><span class=3D"HOEnZb"><font color=3D"#888888"><div><div=
><div><br></div>-- <br><div dir=3D"ltr"><div style=3D"background-color:rgb(=
255,255,255)">
<b>Paulo Motta</b></div><div style=3D"background-color:rgb(255,255,255)">

<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px;background-color:rgb(255,255,255)"><div style=3D"color:rgb(136,136=
,136);font-size:small;font-family:arial"><span style=3D"color:rgb(68,68,68)=
">Chaordic | <i>Platform</i></span><br>


</div><div style=3D"color:rgb(136,136,136);font-size:small;font-family:aria=
l"><u><a href=3D"http://www.chaordic.com.br/" style=3D"color:rgb(17,85,204)=
" target=3D"_blank"><font color=3D"#444444">www.chaordic.com.br</font></a><=
/u></div>


<div style=3D"color:rgb(136,136,136);font-size:small;font-family:arial"><fo=
nt size=3D"1" color=3D"#666666"><a href=3D"tel:%2B55%2048%203232.3200" valu=
e=3D"+554832323200" target=3D"_blank">+55 48 3232.3200</a></font></div></di=
v></div>

</div></div></font></span></div>
</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><div dir=3D"=
ltr">-----------------<br>Nate McCall<br>Austin, TX<br>@zznate<br><br>Co-Fo=
under &amp; Sr. Technical Consultant<br>Apache Cassandra Consulting<br><a h=
ref=3D"http://www.thelastpickle.com" target=3D"_blank">http://www.thelastpi=
ckle.com</a></div>

</div>

--001a1133b54eb96f9e0501a3931e--