Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of arodrime@gmail.com designates
 209.85.215.49 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <1E807EB9EA831F4184920D0403D4CD6D014F6D24@exchange2010.dimcap.corp>
References: 
 <1E807EB9EA831F4184920D0403D4CD6D014EE1C7@exchange2010.dimcap.corp>
 <CA+VSrLpoCXzi=83aTDrMpMBsRQhdRKQVXQ5KaEhWwUGzNpmByQ@mail.gmail.com>
 <CAORswtxnUKmCW38O85gsq=oc2gx5oFn-NT6P3ZZDVjrSVfCnig@mail.gmail.com>
 <1E807EB9EA831F4184920D0403D4CD6D014F4A3A@exchange2010.dimcap.corp>
 <045D8FD556C73347A47F956EE65F8220185B4DA4@S11MAILD013N2.sh11.lan>
 <CA+VSrLqmJ2uLKsC4vR-5w_sxNb51G0Sb+4n98BUBMkvdTZzvjQ@mail.gmail.com>
 <1E807EB9EA831F4184920D0403D4CD6D014F6D24@exchange2010.dimcap.corp>
From: Alain RODRIGUEZ <arodrime@gmail.com>
Date: Tue, 3 Feb 2015 21:48:10 +0100
Message-ID: 
 <CA+VSrLoPj6jsrJyr4ib0J9Bb2iNPJJKp=X=F1YrfPt_JVqp4Tw@mail.gmail.com>
Subject: Re: Tombstone gc after gc grace seconds
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=089e0158b79c8cf0f6050e353196

--089e0158b79c8cf0f6050e353196
Content-Type: text/plain; charset=ISO-8859-1

Hi, thanks for sharing your tests !

Though, how did you inserted the data ? Did you try adding columns in an
atomic and random order, with a small memtable size to achieve a big
sharding (normal in time series use case) ?

I think performing ./md_test against this set of data would be interesting,
taking care to shard enough to have parts of each key on each SSTable, or
in many at least.

This would measure the effectiveness of this parameter in a "normal" time
series workflow (which is a standard use case of C*).

By the way, this might be done on both LCS
<http://www.datastax.com/dev/blog/leveled-compaction-in-apache-cassandra>
and STCS
<http://www.datastax.com/documentation/cql/3.1/cql/cql_reference/tabProp.html?scroll=tabProp__moreCompaction>
and
see how both behave.

Thanks again, it is interesting to hear about tests like yours !

C*heers

Alain

2015-01-30 17:32 GMT+01:00 Ravi Agrawal <ragrawal@clearpoolgroup.com>:

>  I did a small test. I wrote data to 4 different column family. 30MB of
> data.
>
> 256 rowkeys and 100K columns on an average.
>
> And then deleted all data from all of them.
>
>
>
> 1.       Md_normal - created using default compaction parameters and Gc
> Grace seconds was 5 seconds. Data was written and then deleted. Compaction
> was ran using "nodetool compact keyspace columnfamily" - I see full disk
> data, but cannot query columns(since data was deleted consistent behavior)
> and cannot query rows in cqlsh. Hits timeout.
>
> 2.       Md_test - created using following compact parameters -
> "compaction={'tombstone_threshold': '0.000001', 'class':
> 'SizeTieredCompactionStrategy'}" and Gc Grace seconds was 5 seconds.
> Disksize is reduced, and am able to query rows which return 0.
>
> 3.       Md_test2 - created using following compact parameters -
> "compaction={'tombstone_threshold': '0.0', 'class':
> 'SizeTieredCompactionStrategy'}". Disksize is reduced, not able to query
> rows using cqlsh. Hits timeout.
>
> 4.       Md_forcecompact - created using compaction parameters
> "compaction={'unchecked_tombstone_compaction': 'true', 'class':
> 'SizeTieredCompactionStrategy'}" and Gc Grace seconds was 5 seconds. Data
> was written and then deleted. I see full disk data, but cannot query any
> data using mddbreader and cannot query rows in cqlsh. Hits timeout.
>
>
>
> Next day sizes were -
>
> 30M     ./md_forcecompact
>
> 4.0K    ./md_test
>
> 304K    ./md_test2
>
> 30M     ./md_normal
>
>
>
> Feel of the data that we have is -
>
> 8000 rowkeys per day and columns are added throughout the day. 300K
> columns on an average per rowKey.
>
>
>
>
>
>
>
> *From:* Alain RODRIGUEZ [mailto:arodrime@gmail.com]
> *Sent:* Friday, January 30, 2015 4:26 AM
>
> *To:* user@cassandra.apache.org
> *Subject:* Re: Tombstone gc after gc grace seconds
>
>
>
> The point is that all the "parts" or "fragments" of the row need to be in
> the SSTables implied in the compaction for C* to be able to evict the row
> effectively.
>
>
>
> My understanding of those parameters is that they will trigger a
> compaction on the SSTable that exceed this ratio. This will work properly
> if you never "update" a row (by modifying a value or adding a column). If
> your workflow is something like "Write once per partition key", this
> parameter will do the job.
>
>
>
> If you have fragments, you might trigger this compaction for nothing. In
> the case of frequently updated rows (like when using wide rows / time
> series) your only way to get rid of tombstone is a major compaction.
>
>
>
> That's how I understand this.
>
>
>
> Hope this help,
>
>
>
> C*heers,
>
>
>
> Alain
>
>
>
> 2015-01-30 1:29 GMT+01:00 Mohammed Guller <mohammed@glassbeam.com>:
>
>  Ravi -
>
>
>
> It may help.
>
>
>
> What version are you running? Do you know if minor compaction is getting
> triggered at all? One way to check would be see how many sstables the data
> directory has.
>
>
>
> Mohammed
>
>
>
> *From:* Ravi Agrawal [mailto:ragrawal@clearpoolgroup.com]
> *Sent:* Thursday, January 29, 2015 1:29 PM
> *To:* user@cassandra.apache.org
> *Subject:* RE: Tombstone gc after gc grace seconds
>
>
>
> Hi,
>
> I saw there are 2 more interesting parameters -
>
> a.       tombstone_threshold - A ratio of garbage-collectable tombstones
> to all contained columns, which if exceeded by the SSTable triggers
> compaction (with no other SSTables) for the purpose of purging the
> tombstones. Default value - 0.2
>
> b.      unchecked_tombstone_compaction - True enables more aggressive
> than normal tombstone compactions. A single SSTable tombstone compaction
> runs without checking the likelihood of success. Cassandra 2.0.9 and later.
>
> Could I use these to get what I want?
>
> Problem I am encountering is even long after gc_grace_seconds I see no
> reduction in disk space until I run compaction manually. I was thinking to
> make tombstone threshold close to 0 and unchecked compaction set to true.
>
> Also we are not running nodetool repair on weekly basis as of now.
>
>
>
> *From:* Eric Stevens [mailto:mightye@gmail.com <mightye@gmail.com>]
> *Sent:* Monday, January 26, 2015 12:11 PM
> *To:* user@cassandra.apache.org
> *Subject:* Re: Tombstone gc after gc grace seconds
>
>
>
> My understanding is consistent with Alain's, there's no way to force a
> tombstone-only compaction, your only option is major compaction.  If you're
> using size tiered, that comes with its own drawbacks.
>
>
>
> I wonder if there's a technical limitation that prevents introducing a
> shadowed data cleanup style operation (overwritten data, including deletes,
> plus tombstones past their gc grace period); or maybe even couple it
> directly with cleanup since most of the work (rewriting old SSTables) would
> be identical.  I can't think of something off the top of my head, but it
> would be so useful that it seems like there's got to be something I'm
> missing.
>
>
>
> On Mon, Jan 26, 2015 at 4:15 AM, Alain RODRIGUEZ <arodrime@gmail.com>
> wrote:
>
>  I don't think that such a thing exists as SSTables are immutable. You
> compact it entirely or you don't. Minor compaction will eventually evict
> tombstones. If it is too slow, AFAIK, the "better" solution is a major
> compaction.
>
>
>
> C*heers,
>
>
>
> Alain
>
>
>
> 2015-01-23 0:00 GMT+01:00 Ravi Agrawal <ragrawal@clearpoolgroup.com>:
>
>  Hi,
>
> I want to trigger just tombstone compaction after gc grace seconds is
> completed not nodetool compact keyspace column family.
>
> Anyway I can do that?
>
>
>
> Thanks
>
>
>
>
>
>
>
>
>
>
>

--089e0158b79c8cf0f6050e353196
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi, thanks for sharing your tests !<div><br></div><div>Tho=
ugh, how did you inserted the data ? Did you try adding columns in an atomi=
c and random order, with a small memtable size to achieve a big sharding (n=
ormal in time series use case) ?</div><div><br></div><div>I think performin=
g ./md_test against this set of data would be interesting, taking care to s=
hard enough to have parts of each key on each SSTable, or in many at least.=
</div><div><br></div><div>This would measure the effectiveness of this para=
meter in a &quot;normal&quot; time series workflow (which is a standard use=
 case of C*).</div><div><br></div><div>By the way, this might be done on bo=
th <a href=3D"http://www.datastax.com/dev/blog/leveled-compaction-in-apache=
-cassandra">LCS</a> and <a href=3D"http://www.datastax.com/documentation/cq=
l/3.1/cql/cql_reference/tabProp.html?scroll=3DtabProp__moreCompaction">STCS=
</a>&nbsp;and see how both behave.</div><div><br></div><div>Thanks again, i=
t is interesting to hear about tests like yours !</div><div><br></div><div>=
C*heers</div><div><br></div><div>Alain</div></div><div class=3D"gmail_extra=
"><br><div class=3D"gmail_quote">2015-01-30 17:32 GMT+01:00 Ravi Agrawal <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:ragrawal@clearpoolgroup.com" target=
=3D"_blank">ragrawal@clearpoolgroup.com</a>&gt;</span>:<br><blockquote clas=
s=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pad=
ding-left:1ex">


<div lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">I did a small test. I wrote data to 4=
 different column family. 30MB of data.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">256 rowkeys and 100K columns on an av=
erage.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">And then deleted all data from all of=
 them.<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p>
<u></u><span style=3D"color:#1f497d"><span>1.<span style=3D"font:7.0pt &quo=
t;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><u></u><span style=3D"color:#1f497d">Md_normal &ndash;=
 created using default compaction parameters and Gc Grace seconds was 5 sec=
onds. Data was written and then deleted. Compaction was ran using &ldquo;no=
detool compact keyspace columnfamily&rdquo; &ndash; I see
 full disk data, but cannot query columns(since data was deleted consistent=
 behavior) and cannot query rows in cqlsh. Hits timeout.<u></u><u></u></spa=
n></p>
<p>
<u></u><span style=3D"color:#1f497d;background:lime"><span>2.<span style=3D=
"font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;
</span></span></span><u></u><span style=3D"color:#1f497d;background:lime">M=
d_test &ndash; created using following compact parameters &ndash; &ldquo;co=
mpaction=3D{&#39;tombstone_threshold&#39;: &#39;0.000001&#39;, &#39;class&#=
39;: &#39;SizeTieredCompactionStrategy&#39;}&rdquo; and Gc Grace seconds
 was 5 seconds. Disksize is reduced, and am able to query rows which return=
 0. <u></u>
<u></u></span></p>
<p>
<u></u><span style=3D"color:#1f497d"><span>3.<span style=3D"font:7.0pt &quo=
t;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><u></u><span style=3D"color:#1f497d">Md_test2 &ndash; =
created using following compact parameters &ndash; &ldquo;compaction=3D{=
9;tombstone_threshold&#39;: &#39;0.0&#39;, &#39;class&#39;: &#39;SizeTiered=
CompactionStrategy&#39;}&rdquo;. Disksize is reduced, not able to query row=
s using cqlsh.
 Hits timeout. <u></u><u></u></span></p>
<p>
<u></u><span style=3D"color:#1f497d"><span>4.<span style=3D"font:7.0pt &quo=
t;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><u></u><span style=3D"color:#1f497d">Md_forcecompact &=
ndash; created using compaction parameters &ldquo;compaction=3D{&#39;unchec=
ked_tombstone_compaction&#39;: &#39;true&#39;, &#39;class&#39;: &#39;SizeTi=
eredCompactionStrategy&#39;}&rdquo; and Gc Grace seconds was 5 seconds. Dat=
a was written
 and then deleted. I see full disk data, but cannot query any data using md=
dbreader and cannot query rows in cqlsh. Hits timeout.<u></u><u></u></span>=
</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Next day sizes were &ndash;<u></u><u>=
</u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">30M&nbsp;&nbsp;&nbsp;&nbsp; ./md_forc=
ecompact<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d;background:lime">4.0K&nbsp;&nbsp;&nbsp=
; ./md_test</span><span style=3D"font-size:11.0pt;font-family:&quot;Calibri=
&quot;,sans-serif;color:#1f497d"><u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">304K&nbsp;&nbsp;&nbsp; ./md_test2<u><=
/u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">30M&nbsp;&nbsp;&nbsp;&nbsp; ./md_norm=
al<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Feel of the data that we have is -
<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">8000 rowkeys per day and columns are =
added throughout the day. 300K columns on an average per rowKey.<u></u><u><=
/u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>&nbsp;<u></u></span></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,sans-serif">From:</span></b><span style=3D"font-size:11.0pt;=
font-family:&quot;Calibri&quot;,sans-serif"> Alain RODRIGUEZ [mailto:<a hre=
f=3D"mailto:arodrime@gmail.com" target=3D"_blank">arodrime@gmail.com</a>]
<br>
<b>Sent:</b> Friday, January 30, 2015 4:26 AM</span></p><div><div class=3D"=
h5"><br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: Tombstone gc after gc grace seconds<u></u><u></u></div>=
</div><p></p><div><div class=3D"h5">
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
<div>
<p class=3D"MsoNormal">The point is that all the &quot;parts&quot; or &quot=
;fragments&quot; of the row need to be in the SSTables implied in the compa=
ction for C* to be able to evict the row effectively.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">My understanding of those parameters is that they wi=
ll trigger a compaction on the SSTable that exceed this ratio. This will wo=
rk properly if you never &quot;update&quot; a row (by modifying a value or =
adding a column). If your workflow is something
 like &quot;Write once per partition key&quot;, this parameter will do the =
job.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">If you have fragments, you might trigger this compac=
tion for nothing. In the case of frequently updated rows (like when using w=
ide rows / time series) your only way to get rid of tombstone is a major co=
mpaction.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">That&#39;s how I understand this.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Hope this help,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">C*heers,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Alain<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
<div>
<p class=3D"MsoNormal">2015-01-30 1:29 GMT+01:00 Mohammed Guller &lt;<a hre=
f=3D"mailto:mohammed@glassbeam.com" target=3D"_blank">mohammed@glassbeam.co=
m</a>&gt;:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0i=
n 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Ravi &ndash;
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">&nbsp;</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">It may help.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">&nbsp;</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">What version are you running? Do you =
know if minor compaction is getting triggered at all? One way
 to check would be see how many sstables the data directory has. </span><u>=
</u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">&nbsp;</span><u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Mohammed</span><u></u><u></u></p>
</div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">&nbsp;</span><u></u><u></u></p>
<div>
<div style=3D"border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0in =
0in 0in">
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Tahoma&quot;,sans-serif">From:</span></b><span style=3D"font-size:10.0pt;f=
ont-family:&quot;Tahoma&quot;,sans-serif"> Ravi Agrawal [mailto:<a href=3D"=
mailto:ragrawal@clearpoolgroup.com" target=3D"_blank">ragrawal@clearpoolgro=
up.com</a>]
<br>
<b>Sent:</b> Thursday, January 29, 2015 1:29 PM<br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> RE: Tombstone gc after gc grace seconds</span><u></u><u></u=
></p>
</div>
</div>
<div>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-size:11.0pt;line-height:115%;font-family:&quot;Calibri&=
quot;,sans-serif;color:#1f497d">Hi,</span><u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-size:11.0pt;line-height:115%;font-family:&quot;Calibri&=
quot;,sans-serif;color:#1f497d">I saw there are 2 more interesting paramete=
rs &ndash;
</span><u></u><u></u></p>
<p style=3D"margin-left:1.0in">a.<span style=3D"font-size:7.0pt">&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp; </span><span style=3D"font-size:10.5pt;font-fami=
ly:&quot;Arial&quot;,sans-serif;color:#636466">tombstone_threshold</span> -=
 A ratio of garbage-collectable tombstones to all contained columns, which =
if
 exceeded by the SSTable triggers compaction (with no other SSTables) for t=
he purpose of purging the tombstones. Default value &ndash; 0.2<u></u><u></=
u></p>
<p style=3D"margin-left:1.0in">b.<span style=3D"font-size:7.0pt">&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp; </span><span style=3D"font-size:10.5pt;font-family:&qu=
ot;Arial&quot;,sans-serif;color:#636466">unchecked_tombstone_compaction</sp=
an> - True enables more aggressive than normal tombstone compactions. A sin=
gle
 SSTable tombstone compaction runs without checking the likelihood of succe=
ss. Cassandra 2.0.9 and later.<u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-size:11.0pt;line-height:115%;font-family:&quot;Calibri&=
quot;,sans-serif;color:#1f497d">Could I use these to get what I want?</span=
><u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-size:11.0pt;line-height:115%;font-family:&quot;Calibri&=
quot;,sans-serif;color:#1f497d">Problem I am encountering is even long afte=
r gc_grace_seconds I see no reduction in disk space until I run compaction =
manually. I was thinking to make tombstone threshold
 close to 0 and unchecked compaction set to true.</span><u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-size:11.0pt;line-height:115%;font-family:&quot;Calibri&=
quot;,sans-serif;color:#1f497d">Also we are not running nodetool repair on =
weekly basis as of now.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">&nbsp;</span><u></u><u></u></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,sans-serif">From:</span></b><span style=3D"font-size:11.0pt;=
font-family:&quot;Calibri&quot;,sans-serif"> Eric Stevens [<a href=3D"mailt=
o:mightye@gmail.com" target=3D"_blank">mailto:mightye@gmail.com</a>]
<br>
<b>Sent:</b> Monday, January 26, 2015 12:11 PM<br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: Tombstone gc after gc grace seconds</span><u></u><u></u=
></p>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">My understanding is consistent with Alain&#39;s, the=
re&#39;s no way to force a tombstone-only compaction, your only option is m=
ajor compaction.&nbsp; If you&#39;re using size tiered, that comes
 with its own drawbacks.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">I wonder if there&#39;s a technical limitation that =
prevents introducing a shadowed data cleanup style operation (overwritten d=
ata, including deletes, plus tombstones past their gc
 grace period); or maybe even couple it directly with cleanup since most of=
 the work (rewriting old SSTables) would be identical.&nbsp; I can&#39;t th=
ink of something off the top of my head, but it would be so useful that it =
seems like there&#39;s got to be something I&#39;m
 missing.&nbsp;<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">On Mon, Jan 26, 2015 at 4:15 AM, Alain RODRIGUEZ &lt=
;<a href=3D"mailto:arodrime@gmail.com" target=3D"_blank">arodrime@gmail.com=
</a>&gt; wrote:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0i=
n 0in 0in 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0in;margin-=
bottom:5.0pt">
<div>
<p class=3D"MsoNormal">I don&#39;t think that such a thing exists as SSTabl=
es are immutable. You compact it entirely or you don&#39;t. Minor compactio=
n will eventually evict tombstones. If it is too slow, AFAIK,
 the &quot;better&quot; solution is a major compaction.<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">C*heers,<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Alain<u></u><u></u></p>
</div>
</div>
<div>
<div>
<div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">2015-01-23 0:00 GMT+01:00 Ravi Agrawal &lt;<a href=
=3D"mailto:ragrawal@clearpoolgroup.com" target=3D"_blank">ragrawal@clearpoo=
lgroup.com</a>&gt;:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0i=
n 0in 0in 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0in;margin-=
bottom:5.0pt">
<div>
<div>
<p class=3D"MsoNormal">Hi,<u></u><u></u></p>
<p class=3D"MsoNormal">I want to trigger just tombstone compaction after gc=
 grace seconds is completed not nodetool compact keyspace column family.<u>=
</u><u></u></p>
<p class=3D"MsoNormal">Anyway I can do that?<u></u><u></u></p>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
Thanks<u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:8.0pt;line-height:115%">
<span style=3D"font-family:&quot;Courier New&quot;;color:#023a80">&nbsp;</s=
pan><u></u><u></u></p>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal">&nbsp;<u></u><u></u></p>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal"><u></u>&nbsp;<u></u></p>
</div>
</div></div></div>
</div>

</blockquote></div><br></div>

--089e0158b79c8cf0f6050e353196--