Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of arodrime@gmail.com designates
 209.85.215.53 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <00B2CB04-D2B7-48F2-A62D-9EB22D61B9D3@thelastpickle.com>
References: <1364407229.62804.GenericBBA@web160901.mail.bf1.yahoo.com>
 <00B2CB04-D2B7-48F2-A62D-9EB22D61B9D3@thelastpickle.com>
From: Alain RODRIGUEZ <arodrime@gmail.com>
Date: Thu, 28 Mar 2013 10:18:14 +0100
Message-ID: 
 <CA+VSrLrO7k9bw3h020RzYZi4OLZsa5YMDshqEU-Uw9uFQqe0KQ@mail.gmail.com>
Subject: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=bcaec55555fe941aa304d8f8a332

--bcaec55555fe941aa304d8f8a332
Content-Type: text/plain; charset=ISO-8859-1

"remember is used more IO than STS"

Are you meaning during compactions ? Because I thought that LCS should
decrease the number of disks reads (since 90% of the data aren't spread
across multiple sstables and C* needs to read only a file to find the
entire row) while not compacting right ?


2013/3/28 aaron morton <aaron@thelastpickle.com>

> You nailed it. A significant number of reads are done from hundreds of
> sstables ( I have to add, compaction is apparently constantly 6000-7000
> tasks behind and the vast majority of the reads access recently written
> data )
>
> So that's not good.
> If IO is saturated then maybe LCS is not for you, remember is used more IO
> than STS.
> Otherwise look at the compaction yaml settings to see if you can make it
> go faster but watch out that you don't hurt normal requests.
>
> CHeers
>
>    -----------------
> Aaron Morton
> Freelance Cassandra Consultant
> New Zealand
>
> @aaronmorton
> http://www.thelastpickle.com
>
> On 28/03/2013, at 7:00 AM, Wei Zhu <wz1975@yahoo.com> wrote:
>
> Welcome to the wonderland of SSTableSize of LCS. There is some discussion
> around it, but no guidelines yet.
>
> I asked the people in the IRC, someone is running as high as 128M on the
> production with no problem. I guess you have to test it on your system and
> see how it performs.
>
> Attached is the related thread for your reference.
>
> -Wei
>
> ----- Original Message -----
> From: "Andras Szerdahelyi" <andras.szerdahelyi@ignitionone.com>
> To: user@cassandra.apache.org
> Sent: Wednesday, March 27, 2013 1:19:06 AM
> Subject: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01
>
>
> Aaron,
>
>
>
>
> What version are you using ?
>
>
> 1.1.9
>
>
>
>
>
> Have you changed the bf_ chance ? The sstables need to be rebuilt for it
> to take affect.
>
>
> I did ( several times ) and I ran upgradesstables after
>
>
>
>
>
> Not sure what this means.
> Are you saying it's in a boat on a river, with tangerine trees and
> marmalade skies ?
>
>
> You nailed it. A significant number of reads are done from hundreds of
> sstables ( I have to add, compaction is apparently constantly 6000-7000
> tasks behind and the vast majority of the reads access recently written
> data )
>
>
>
>
>
> Take a look at the nodetool cfhistograms to get a better idea of the row
> size and use that info when consdiering the sstable size.
>
>
> It's around 1-20K, what should I optimise the LCS sstable size for? I
> suppose "I want to fit as many complete rows as possible in to a single
> sstable to keep file count down while avoiding compactions of oversized (
> double digit gigabytes? ) sstables at higher levels ? "
> Do I have to run a major compaction after a change to sstable_size_in_mb ?
> The larger sstable size wouldn't really affect sstables on levels above L0
> , would it?
>
>
>
>
>
>
> Thanks!!
> Andras
>
>
>
>
>
>
> From: aaron morton < aaron@thelastpickle.com >
> Reply-To: " user@cassandra.apache.org " < user@cassandra.apache.org >
> Date: Tuesday 26 March 2013 21:46
> To: " user@cassandra.apache.org " < user@cassandra.apache.org >
> Subject: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01
>
>
>
>
> What version are you using ?
> 1.2.0 allowed a null bf chance, and I think it returned .1 for LCS and .01
> for STS compaction.
> Have you changed the bf_ chance ? The sstables need to be rebuilt for it
> to take affect.
>
>
>
>
>
> and sstables read is in the skies Not sure what this means.
> Are you saying it's in a boat on a river, with tangerine trees and
> marmalade skies ?
>
>
>
>
>
> SSTable count: 22682
>
> Lots of files there, I imagine this would dilute the effectiveness of the
> key cache. It's caching (sstable, key) tuples.
> You may want to look at increasing the sstable_size with LCS.
>
>
>
>
>
> Compacted row minimum size: 104
> Compacted row maximum size: 263210
>
>
> Compacted row mean size: 3041
> Take a look at the nodetool cfhistograms to get a better idea of the row
> size and use that info when consdiering the sstable size.
>
>
> Cheers
>
>
>
>
>
>
>
>
> -----------------
> Aaron Morton
> Freelance Cassandra Consultant
> New Zealand
>
>
> @aaronmorton
> http://www.thelastpickle.com
>
>
> On 26/03/2013, at 6:16 AM, Andras Szerdahelyi <
> andras.szerdahelyi@ignitionone.com > wrote:
>
>
>
>
> Hello list,
>
>
> Could anyone shed some light on how an FP chance of 0.01 coexist with a
> measured FP ratio of .. 0.98 ? Am I reading this wrong or are 98% of the
> requests hitting the bloom filter create a false positive while the
> "target" false ratio is 0.01?
> ( Also key cache hit ratio is around 0.001 and sstables read is in the
> skies ( non-exponential (non-) drop off for LCS ) but that should be filed
> under "effect" and not "cause"? )
>
>
>
> [default@unknown] use KS;
> Authenticated to keyspace: KS
> [default@KS] describe CF;
> ColumnFamily: CF
> Key Validation Class: org.apache.cassandra.db.marshal.BytesType
> Default column value validator: org.apache.cassandra.db.marshal.BytesType
> Columns sorted by: org.apache.cassandra.db.marshal.BytesType
> GC grace seconds: 691200
> Compaction min/max thresholds: 4/32
> Read repair chance: 0.1
> DC Local Read repair chance: 0.0
> Replicate on write: true
> Caching: ALL
> Bloom Filter FP chance: 0.01
> Built indexes: []
> Compaction Strategy:
> org.apache.cassandra.db.compaction.LeveledCompactionStrategy
> Compaction Strategy Options:
> sstable_size_in_mb: 5
> Compression Options:
> sstable_compression: org.apache.cassandra.io.compress.SnappyCompressor
>
>
>
> Keyspace: KS
> Read Count: 628950
> Read Latency: 93.19921121869784 ms.
> Write Count: 1219021
> Write Latency: 0.14352380885973254 ms.
> Pending Tasks: 0
> Column Family: CF
> SSTable count: 22682
> Space used (live): 119771434915
> Space used (total): 119771434915
> Number of Keys (estimate): 203837952
> Memtable Columns Count: 13125
> Memtable Data Size: 33212827
> Memtable Switch Count: 15
> Read Count: 629009
> Read Latency: 88.434 ms.
> Write Count: 1219038
> Write Latency: 0.095 ms.
> Pending Tasks: 0
> Bloom Filter False Positives: 37939419
> Bloom Filter False Ratio: 0.97928
> Bloom Filter Space Used: 261572784
> Compacted row minimum size: 104
> Compacted row maximum size: 263210
> Compacted row mean size: 3041
>
>
> I upgraded sstables after changing the FP chance
>
>
> Thanks!
> Andras
> <attachment.eml>
>
>
>

--bcaec55555fe941aa304d8f8a332
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">&quot;<span style=3D"font-family:Calibri,sans-serif;font-s=
ize:13px">remember is used more IO than STS&quot;</span><div><span style=3D=
"font-family:Calibri,sans-serif;font-size:13px"><br></span></div><div style=
><span style=3D"font-family:Calibri,sans-serif;font-size:13px">Are you mean=
ing during compactions ? Because I thought that LCS should decrease the num=
ber of disks reads (since 90% of the data aren&#39;t spread across multiple=
 sstables and C* needs to read only a file to find the entire row) while no=
t compacting right ?</span></div>

</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/3/=
28 aaron morton <span dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle=
.com" target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span><br><blockquo=
te class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc so=
lid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><div class=3D"im"><blockquote type=3D"c=
ite"><span style=3D"font-family:Calibri,sans-serif;font-size:14px">You nail=
ed it. A significant number of reads are done from hundreds of sstables ( I=
 have to add, compaction is apparently constantly 6000-7000 tasks behind an=
d the vast majority of the reads access recently written data )</span></blo=
ckquote>

</div><div><font face=3D"Calibri, sans-serif">So that&#39;s not good.=A0</f=
ont></div><div><font face=3D"Calibri, sans-serif">If IO is saturated then m=
aybe LCS is not for you, remember is used more IO than STS.=A0</font></div>=
<div>

<font face=3D"Calibri, sans-serif">Otherwise look at the compaction yaml se=
ttings to see if you can make it go faster but watch out that you don&#39;t=
 hurt normal requests.=A0</font></div><div><font face=3D"Calibri, sans-seri=
f"><br>

</font></div><div><font face=3D"Calibri, sans-serif">CHeers</font></div><di=
v><div class=3D"im"><font face=3D"Calibri, sans-serif"><br></font><div>
<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">

<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">

<span style=3D"border-collapse:separate;border-spacing:0px"><div style=3D"w=
ord-wrap:break-word"><span style=3D"border-spacing:0px;text-indent:0px;lett=
er-spacing:normal;font-variant:normal;font-style:normal;font-weight:normal;=
line-height:normal;border-collapse:separate;text-transform:none;font-size:m=
edium;white-space:normal;font-family:Helvetica;word-spacing:0px"><div style=
=3D"word-wrap:break-word">

<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">

<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">

<div>-----------------</div><div>Aaron Morton</div><div>Freelance Cassandra=
 Consultant</div><div>New Zealand</div><div><br></div><div>@aaronmorton</di=
v><div><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://ww=
w.thelastpickle.com</a></div>

</div></span></div></span></div></span></div></span></div></div>
</div>

<br></div><div><div><div class=3D"h5"><div>On 28/03/2013, at 7:00 AM, Wei Z=
hu &lt;<a href=3D"mailto:wz1975@yahoo.com" target=3D"_blank">wz1975@yahoo.c=
om</a>&gt; wrote:</div><br></div></div><blockquote type=3D"cite"><div><div =
class=3D"h5">

Welcome to the wonderland of SSTableSize of LCS. There is some discussion a=
round it, but no guidelines yet. <br><br>I asked the people in the IRC, som=
eone is running as high as 128M on the production with no problem. I guess =
you have to test it on your system and see how it performs. <br>

<br>Attached is the related thread for your reference.<br><br>-Wei<br><br>-=
---- Original Message -----<br>From: &quot;Andras Szerdahelyi&quot; &lt;<a =
href=3D"mailto:andras.szerdahelyi@ignitionone.com" target=3D"_blank">andras=
.szerdahelyi@ignitionone.com</a>&gt;<br>

To: <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">user@cas=
sandra.apache.org</a><br>Sent: Wednesday, March 27, 2013 1:19:06 AM<br>Subj=
ect: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01<br><br><br>Aa=
ron, <br>

<br><br><br><br>What version are you using ? <br><br><br>1.1.9 <br><br><br>=
<br><br><br>Have you changed the bf_ chance ? The sstables need to be rebui=
lt for it to take affect. <br><br><br>I did ( several times ) and I ran upg=
radesstables after <br>

<br><br><br><br><br>Not sure what this means. <br>Are you saying it&#39;s i=
n a boat on a river, with tangerine trees and marmalade skies ? <br><br><br=
>You nailed it. A significant number of reads are done from hundreds of sst=
ables ( I have to add, compaction is apparently constantly 6000-7000 tasks =
behind and the vast majority of the reads access recently written data ) <b=
r>

<br><br><br><br><br>Take a look at the nodetool cfhistograms to get a bette=
r idea of the row size and use that info when consdiering the sstable size.=
 <br><br><br>It&#39;s around 1-20K, what should I optimise the LCS sstable =
size for? I suppose &quot;I want to fit as many complete rows as possible i=
n to a single sstable to keep file count down while avoiding compactions of=
 oversized ( double digit gigabytes? ) sstables at higher levels ? &quot; <=
br>

Do I have to run a major compaction after a change to sstable_size_in_mb ? =
The larger sstable size wouldn&#39;t really affect sstables on levels above=
 L0 , would it? <br><br><br><br><br><br><br>Thanks!! <br>Andras <br><br>

<br><br><br><br><br>From: aaron morton &lt; <a href=3D"mailto:aaron@thelast=
pickle.com" target=3D"_blank">aaron@thelastpickle.com</a> &gt; <br>Reply-To=
: &quot; <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">use=
r@cassandra.apache.org</a> &quot; &lt; <a href=3D"mailto:user@cassandra.apa=
che.org" target=3D"_blank">user@cassandra.apache.org</a> &gt; <br>

Date: Tuesday 26 March 2013 21:46 <br>To: &quot; <a href=3D"mailto:user@cas=
sandra.apache.org" target=3D"_blank">user@cassandra.apache.org</a> &quot; &=
lt; <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">user@cas=
sandra.apache.org</a> &gt; <br>

Subject: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01 <br><br><=
br><br><br>What version are you using ? <br>1.2.0 allowed a null bf chance,=
 and I think it returned .1 for LCS and .01 for STS compaction. <br>Have yo=
u changed the bf_ chance ? The sstables need to be rebuilt for it to take a=
ffect. <br>

<br><br><br><br><br>and sstables read is in the skies Not sure what this me=
ans. <br>Are you saying it&#39;s in a boat on a river, with tangerine trees=
 and marmalade skies ? <br><br><br><br><br><br>SSTable count: 22682 <br>

<br>Lots of files there, I imagine this would dilute the effectiveness of t=
he key cache. It&#39;s caching (sstable, key) tuples. <br>You may want to l=
ook at increasing the sstable_size with LCS. <br><br><br><br><br><br>Compac=
ted row minimum size: 104 <br>

Compacted row maximum size: 263210 <br><br><br>Compacted row mean size: 304=
1 <br>Take a look at the nodetool cfhistograms to get a better idea of the =
row size and use that info when consdiering the sstable size. <br><br>
<br>
Cheers <br><br><br><br><br><br><br><br><br>----------------- <br>Aaron Mort=
on <br>Freelance Cassandra Consultant <br>New Zealand <br><br><br>@aaronmor=
ton <br><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://w=
ww.thelastpickle.com</a> <br>

<br><br>On 26/03/2013, at 6:16 AM, Andras Szerdahelyi &lt; <a href=3D"mailt=
o:andras.szerdahelyi@ignitionone.com" target=3D"_blank">andras.szerdahelyi@=
ignitionone.com</a> &gt; wrote: <br><br><br><br><br>Hello list, <br><br><br=
>

Could anyone shed some light on how an FP chance of 0.01 coexist with a mea=
sured FP ratio of .. 0.98 ? Am I reading this wrong or are 98% of the reque=
sts hitting the bloom filter create a false positive while the &quot;target=
&quot; false ratio is 0.01? <br>

( Also key cache hit ratio is around 0.001 and sstables read is in the skie=
s ( non-exponential (non-) drop off for LCS ) but that should be filed unde=
r &quot;effect&quot; and not &quot;cause&quot;? ) <br><br><br><br>[default@=
unknown] use KS; <br>

Authenticated to keyspace: KS <br>[default@KS] describe CF; <br>ColumnFamil=
y: CF <br>Key Validation Class: org.apache.cassandra.db.marshal.BytesType <=
br>Default column value validator: org.apache.cassandra.db.marshal.BytesTyp=
e <br>

Columns sorted by: org.apache.cassandra.db.marshal.BytesType <br>GC grace s=
econds: 691200 <br>Compaction min/max thresholds: 4/32 <br>Read repair chan=
ce: 0.1 <br>DC Local Read repair chance: 0.0 <br>Replicate on write: true <=
br>

Caching: ALL <br>Bloom Filter FP chance: 0.01 <br>Built indexes: [] <br>Com=
paction Strategy: org.apache.cassandra.db.compaction.LeveledCompactionStrat=
egy <br>Compaction Strategy Options: <br>sstable_size_in_mb: 5 <br>Compress=
ion Options: <br>

sstable_compression: org.apache.cassandra.io.compress.SnappyCompressor <br>=
<br><br><br>Keyspace: KS <br>Read Count: 628950 <br>Read Latency: 93.199211=
21869784 ms. <br>Write Count: 1219021 <br>Write Latency: 0.1435238088597325=
4 ms. <br>

Pending Tasks: 0 <br>Column Family: CF <br>SSTable count: 22682 <br>Space u=
sed (live): 119771434915 <br>Space used (total): 119771434915 <br>Number of=
 Keys (estimate): 203837952 <br>Memtable Columns Count: 13125 <br>Memtable =
Data Size: 33212827 <br>

Memtable Switch Count: 15 <br>Read Count: 629009 <br>Read Latency: 88.434 m=
s. <br>Write Count: 1219038 <br>Write Latency: 0.095 ms. <br>Pending Tasks:=
 0 <br>Bloom Filter False Positives: 37939419 <br>Bloom Filter False Ratio:=
 0.97928 <br>

Bloom Filter Space Used: 261572784 <br>Compacted row minimum size: 104 <br>=
Compacted row maximum size: 263210 <br>Compacted row mean size: 3041 <br><b=
r><br>I upgraded sstables after changing the FP chance <br><br><br>Thanks! =
<br>

Andras <br></div></div><span>&lt;attachment.eml&gt;</span></blockquote></di=
v><br></div></div></blockquote></div><br></div>

--bcaec55555fe941aa304d8f8a332--