Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of ngrigoriev@gmail.com designates
 209.85.192.180 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAL=4nrvunpSm1igcY7uu+EOmcgc8Y6kr2=Hc0AYnPPz0wowU0A@mail.gmail.com>
References: 
 <CAFMDvQtOfZ=ekEuOshz7MNQR0UPmijfyvPg9U36uyz_4Ypizpw@mail.gmail.com>
	<CAEp=YLj=QPnWCdiJcaqDwOvtu6XEAR3oJrXVSmTH4eXMvBFdcg@mail.gmail.com>
	<!&!AAAAAAAAAAAYAAAAAAAAAKgtAblvlKlMj1v1Qm7fLr4igQAAEAAAAIexAc/VgnBAklwhyccnvT8BAAAAAA==@gmail.com>
	<CAL=4nrvunpSm1igcY7uu+EOmcgc8Y6kr2=Hc0AYnPPz0wowU0A@mail.gmail.com>
Date: Sun, 23 Nov 2014 22:37:41 -0500
Message-ID: 
 <CAEp=YLhRYhJpqdWc9Hd1dngB3Vqzff3CEGWd0cu36dVyXAMJ0A@mail.gmail.com>
Subject: Re: Compaction Strategy guidance
From: Nikolai Grigoriev <ngrigoriev@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a113329f05203a30508928471

--001a113329f05203a30508928471
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Just to clarify - when I was talking about the large amount of data I
really meant large amount of data per node in a single CF (table). LCS does
not seem to like it when it gets thousands of sstables (makes 4-5 levels).

When bootstraping a new node you'd better enable that option from
CASSANDRA-6621 (the one that disables STCS in L0). But it will still be a
mess - I have a node that I have bootstrapped ~2 weeks ago. Initially it
had 7,5K pending compactions, now it has almost stabilized ad 4,6K. Does
not go down. Number of sstables at L0  is over 11K and it is slowly slowly
building upper levels. Total number of sstables is 4x the normal amount.
Now I am not entirely sure if this node will ever get back to normal life.
And believe me - this is not because of I/O, I have SSDs everywhere and 16
physical cores. This machine is barely using 1-3 cores at most of the time.
The problem is that allowing STCS fallback is not a good option either - it
will quickly result in a few 200Gb+ sstables in my configuration and then
these sstables will never be compacted. Plus, it will require close to 2x
disk space on EVERY disk in my JBOD configuration...this will kill the node
sooner or later. This is all because all sstables after bootstrap end at L0
and then the process slowly slowly moves them to other levels. If you have
write traffic to that CF then the number of sstables and L0 will grow
quickly - like it happens in my case now.

Once something like https://issues.apache.org/jira/browse/CASSANDRA-8301 is
implemented it may be better.


On Sun, Nov 23, 2014 at 4:53 AM, Andrei Ivanov <aivanov@iponweb.net> wrote:

> Stephane,
>
> We are having a somewhat similar C* load profile. Hence some comments
> in addition Nikolai's answer.
> 1. Fallback to STCS - you can disable it actually
> 2. Based on our experience, if you have a lot of data per node, LCS
> may work just fine. That is, till the moment you decide to join
> another node - chances are that the newly added node will not be able
> to compact what it gets from old nodes. In your case, if you switch
> strategy the same thing may happen. This is all due to limitations
> mentioned by Nikolai.
>
> Andrei,
>
>
> On Sun, Nov 23, 2014 at 8:51 AM, Servando Mu=C3=B1oz G. <smgesi@gmail.com=
>
> wrote:
> > ABUSE
> >
> >
> >
> > YA NO QUIERO MAS MAILS SOY DE MEXICO
> >
> >
> >
> > De: Nikolai Grigoriev [mailto:ngrigoriev@gmail.com]
> > Enviado el: s=C3=A1bado, 22 de noviembre de 2014 07:13 p. m.
> > Para: user@cassandra.apache.org
> > Asunto: Re: Compaction Strategy guidance
> > Importancia: Alta
> >
> >
> >
> > Stephane,
> >
> > As everything good, LCS comes at certain price.
> >
> > LCS will put most load on you I/O system (if you use spindles - you may
> need
> > to be careful about that) and on CPU. Also LCS (by default) may fall
> back to
> > STCS if it is falling behind (which is very possible with heavy writing
> > activity) and this will result in higher disk space usage. Also LCS has
> > certain limitation I have discovered lately. Sometimes LCS may not be
> able
> > to use all your node's resources (algorithm limitations) and this reduc=
es
> > the overall compaction throughput. This may happen if you have a large
> > column family with lots of data per node. STCS won't have this
> limitation.
> >
> >
> >
> > By the way, the primary goal of LCS is to reduce the number of sstables
> C*
> > has to look at to find your data. With LCS properly functioning this
> number
> > will be most likely between something like 1 and 3 for most of the read=
s.
> > But if you do few reads and not concerned about the latency today, most
> > likely LCS may only save you some disk space.
> >
> >
> >
> > On Sat, Nov 22, 2014 at 6:25 PM, Stephane Legay <slegay@looplogic.com>
> > wrote:
> >
> > Hi there,
> >
> >
> >
> > use case:
> >
> >
> >
> > - Heavy write app, few reads.
> >
> > - Lots of updates of rows / columns.
> >
> > - Current performance is fine, for both writes and reads..
> >
> > - Currently using SizedCompactionStrategy
> >
> >
> >
> > We're trying to limit the amount of storage used during compaction.
> Should
> > we switch to LeveledCompactionStrategy?
> >
> >
> >
> > Thanks
> >
> >
> >
> >
> > --
> >
> > Nikolai Grigoriev
> > (514) 772-5178
>


--=20
Nikolai Grigoriev
(514) 772-5178

--001a113329f05203a30508928471
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Just to clarify - when I was talking about the large =
amount of data I really meant large amount of data per node in a single CF =
(table). LCS does not seem to like it when it gets thousands of sstables (m=
akes 4-5 levels).<br><br>When bootstraping a new node you&#39;d better enab=
le that option from CASSANDRA-6621 (the one that disables STCS in L0). But =
it will still be a mess - I have a node that I have bootstrapped ~2 weeks a=
go. Initially it had 7,5K pending compactions, now it has almost stabilized=
 ad 4,6K. Does not go down. Number of sstables at L0=C2=A0 is over 11K and =
it is slowly slowly building upper levels. Total number of sstables is 4x t=
he normal amount. Now I am not entirely sure if this node will ever get bac=
k to normal life. And believe me - this is not because of I/O, I have SSDs =
everywhere and 16 physical cores. This machine is barely using 1-3 cores at=
 most of the time. The problem is that allowing STCS fallback is not a good=
 option either - it will quickly result in a few 200Gb+ sstables in my conf=
iguration and then these sstables will never be compacted. Plus, it will re=
quire close to 2x disk space on EVERY disk in my JBOD configuration...this =
will kill the node sooner or later. This is all because all sstables after =
bootstrap end at L0 and then the process slowly slowly moves them to other =
levels. If you have write traffic to that CF then the number of sstables an=
d L0 will grow quickly - like it happens in my case now. <br><br></div><div=
>Once something like <a href=3D"https://issues.apache.org/jira/browse/CASSA=
NDRA-8301">https://issues.apache.org/jira/browse/CASSANDRA-8301</a> is impl=
emented it may be better. <br><br></div><div><div><div class=3D"gmail_extra=
"><br><div class=3D"gmail_quote">On Sun, Nov 23, 2014 at 4:53 AM, Andrei Iv=
anov <span dir=3D"ltr">&lt;<a href=3D"mailto:aivanov@iponweb.net" target=3D=
"_blank">aivanov@iponweb.net</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(20=
4,204,204);padding-left:1ex">Stephane,<br>
<br>
We are having a somewhat similar C* load profile. Hence some comments<br>
in addition Nikolai&#39;s answer.<br>
1. Fallback to STCS - you can disable it actually<br>
2. Based on our experience, if you have a lot of data per node, LCS<br>
may work just fine. That is, till the moment you decide to join<br>
another node - chances are that the newly added node will not be able<br>
to compact what it gets from old nodes. In your case, if you switch<br>
strategy the same thing may happen. This is all due to limitations<br>
mentioned by Nikolai.<br>
<span class=3D""><font color=3D"#888888"><br>
Andrei,<br>
</font></span><div class=3D""><div class=3D"h5"><br>
<br>
On Sun, Nov 23, 2014 at 8:51 AM, Servando Mu=C3=B1oz G. &lt;<a href=3D"mail=
to:smgesi@gmail.com">smgesi@gmail.com</a>&gt; wrote:<br>
&gt; ABUSE<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; YA NO QUIERO MAS MAILS SOY DE MEXICO<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; De: Nikolai Grigoriev [mailto:<a href=3D"mailto:ngrigoriev@gmail.com">=
ngrigoriev@gmail.com</a>]<br>
&gt; Enviado el: s=C3=A1bado, 22 de noviembre de 2014 07:13 p. m.<br>
&gt; Para: <a href=3D"mailto:user@cassandra.apache.org">user@cassandra.apac=
he.org</a><br>
&gt; Asunto: Re: Compaction Strategy guidance<br>
&gt; Importancia: Alta<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; Stephane,<br>
&gt;<br>
&gt; As everything good, LCS comes at certain price.<br>
&gt;<br>
&gt; LCS will put most load on you I/O system (if you use spindles - you ma=
y need<br>
&gt; to be careful about that) and on CPU. Also LCS (by default) may fall b=
ack to<br>
&gt; STCS if it is falling behind (which is very possible with heavy writin=
g<br>
&gt; activity) and this will result in higher disk space usage. Also LCS ha=
s<br>
&gt; certain limitation I have discovered lately. Sometimes LCS may not be =
able<br>
&gt; to use all your node&#39;s resources (algorithm limitations) and this =
reduces<br>
&gt; the overall compaction throughput. This may happen if you have a large=
<br>
&gt; column family with lots of data per node. STCS won&#39;t have this lim=
itation.<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; By the way, the primary goal of LCS is to reduce the number of sstable=
s C*<br>
&gt; has to look at to find your data. With LCS properly functioning this n=
umber<br>
&gt; will be most likely between something like 1 and 3 for most of the rea=
ds.<br>
&gt; But if you do few reads and not concerned about the latency today, mos=
t<br>
&gt; likely LCS may only save you some disk space.<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; On Sat, Nov 22, 2014 at 6:25 PM, Stephane Legay &lt;<a href=3D"mailto:=
slegay@looplogic.com">slegay@looplogic.com</a>&gt;<br>
&gt; wrote:<br>
&gt;<br>
&gt; Hi there,<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; use case:<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; - Heavy write app, few reads.<br>
&gt;<br>
&gt; - Lots of updates of rows / columns.<br>
&gt;<br>
&gt; - Current performance is fine, for both writes and reads..<br>
&gt;<br>
&gt; - Currently using SizedCompactionStrategy<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; We&#39;re trying to limit the amount of storage used during compaction=
. Should<br>
&gt; we switch to LeveledCompactionStrategy?<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; Thanks<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt;<br>
&gt; Nikolai Grigoriev<br>
&gt; <a href=3D"tel:%28514%29%20772-5178" value=3D"+15147725178">(514) 772-=
5178</a><br>
</div></div></blockquote></div><br><br clear=3D"all"><br>-- <br><div class=
=3D"gmail_signature">Nikolai Grigoriev<br>(514) 772-5178</div>
</div></div></div></div>

--001a113329f05203a30508928471--