Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of redmumba@gmail.com designates
 209.85.216.47 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <etPan.538f5a11.7a6d8d3c.1dd@Russells-iMac.local>
References: 
 <CAPwXK799p=6Rmu0ikgf6ROUL=m1YJEMUwP8cbJpp7RK0PjU9nw@mail.gmail.com>
 <etPan.538f4d98.6a2342ec.1dd@Russells-iMac.local>
 <CAPwXK7_9iFujk1suY4Vuw+pmJsc+yykdJogA3=mpwgAYEsw+zg@mail.gmail.com>
 <etPan.538f51b7.1d4ed43b.1dd@Russells-iMac.local>
 <CAPwXK78jQ7TVV_4KUT7VAi-zidJXQWLC2FdNwq_dw7jcp2e0bg@mail.gmail.com>
 <etPan.538f5669.2cd89a32.1dd@Russells-iMac.local>
 <CAPwXK7_fPyHrN58j_7hXiaJLvc+M3vitZt7xyfsswppCvS=gVw@mail.gmail.com>
 <etPan.538f5a11.7a6d8d3c.1dd@Russells-iMac.local>
From: Redmumba <redmumba@gmail.com>
Date: Wed, 4 Jun 2014 10:44:56 -0700
Message-ID: 
 <CAPwXK7_Z242xpg5gtu8r+MoQhkppb9eMaucPKqELPDuVA5NbGA@mail.gmail.com>
Subject: Re: Customized Compaction Strategy: Dev Questions
To: Russell Bradberry <rbradberry@gmail.com>
Cc: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11333facfa2c7204fb0630bf

--001a11333facfa2c7204fb0630bf
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Sorry, yes, that is what I was looking to do--i.e., create a
"TopologicalCompactionStrategy" or similar.


On Wed, Jun 4, 2014 at 10:40 AM, Russell Bradberry <rbradberry@gmail.com>
wrote:

> Maybe I=E2=80=99m misunderstanding something, but what makes you think th=
at
> running a major compaction every day will cause they data from January 1s=
t
> to exist in only one SSTable and not have data from other days in the
> SSTable as well? Are you talking about making a new compaction strategy
> that creates SSTables by day?
>
>
>
> On June 4, 2014 at 1:36:10 PM, Redmumba (redmumba@gmail.com) wrote:
>
>  Let's say I run a major compaction every day, so that the "oldest"
> sstable contains only the data for January 1st.  Assuming all the nodes a=
re
> in-sync and have had at least one repair run before the table is dropped
> (so that all information for that time period is "the same"), wouldn't it
> be safe to assume that the same data would be dropped on all nodes?  Ther=
e
> might be a period when the compaction is running where different nodes
> might have an inconsistent view of just that days' data (in that some wou=
ld
> have it and others would not), but the cluster would still function and
> become eventually consistent, correct?
>
> Also, if the entirety of the sstable is being dropped, wouldn't the
> tombstones be removed with it?  I wouldn't be concerned with individual
> rows and columns, and this is a write-only table, more or less--the only
> deletes that occur in the current system are to delete the old data.
>
>
> On Wed, Jun 4, 2014 at 10:24 AM, Russell Bradberry <rbradberry@gmail.com>
> wrote:
>
>>  I=E2=80=99m not sure what you want to do is feasible.  At a high level =
I can
>> see you running into issues with RF etc.  The SSTables node to node are =
not
>> identical, so if you drop a full SSTable on one node there is no one
>> corresponding SSTable on the adjacent nodes to drop.    You would need t=
o
>> choose data to compact out, and ensure it is removed on all replicas as
>> well.  But if your problem is that you=E2=80=99re low on disk space then=
 you
>> probably won=E2=80=99t be able to write out a new SSTable with the older
>> information compacted out. Also, there is more to an SSTable than just
>> data, the SSTable could have tombstones and other relics that haven=E2=
=80=99t been
>> cleaned up from nodes coming or going.
>>
>>
>>
>>
>> On June 4, 2014 at 1:10:58 PM, Redmumba (redmumba@gmail.com) wrote:
>>
>>   Thanks, Russell--yes, a similar concept, just applied to sstables.
>> I'm assuming this would require changes to both major compactions, and
>> probably GC (to remove the old tables), but since I'm not super-familiar
>> with the C* internals, I wanted to make sure it was feasible with the
>> current toolset before I actually dived in and started tinkering.
>>
>> Andrew
>>
>>
>> On Wed, Jun 4, 2014 at 10:04 AM, Russell Bradberry <rbradberry@gmail.com=
>
>> wrote:
>>
>>>  hmm, I see. So something similar to Capped Collections in MongoDB.
>>>
>>>
>>>
>>> On June 4, 2014 at 1:03:46 PM, Redmumba (redmumba@gmail.com) wrote:
>>>
>>>   Not quite; if I'm at say 90% disk usage, I'd like to drop the oldest
>>> sstable rather than simply run out of space.
>>>
>>> The problem with using TTLs is that I have to try and guess how much
>>> data is being put in--since this is auditing data, the usage can vary
>>> wildly depending on time of year, verbosity of auditing, etc..  I'd lik=
e to
>>> maximize the disk space--not optimize the cleanup process.
>>>
>>> Andrew
>>>
>>>
>>> On Wed, Jun 4, 2014 at 9:47 AM, Russell Bradberry <rbradberry@gmail.com=
>
>>> wrote:
>>>
>>>>  You mean this:
>>>>
>>>>  https://issues.apache.org/jira/browse/CASSANDRA-5228
>>>>
>>>>  ?
>>>>
>>>>
>>>>
>>>> On June 4, 2014 at 12:42:33 PM, Redmumba (redmumba@gmail.com) wrote:
>>>>
>>>>   Good morning!
>>>>
>>>> I've asked (and seen other people ask) about the ability to drop old
>>>> sstables, basically creating a FIFO-like clean-up process.  Since we'r=
e
>>>> using Cassandra as an auditing system, this is particularly appealing =
to us
>>>> because it means we can maximize the amount of auditing data we can ke=
ep
>>>> while still allowing Cassandra to clear old data automatically.
>>>>
>>>> My idea is this: perform compaction based on the range of dates
>>>> available in the sstable (or just metadata about when it was created).=
  For
>>>> example, a major compaction could create a combined sstable per day--s=
o
>>>> that, say, 60 days of data after a major compaction would contain 60
>>>> sstables.
>>>>
>>>> My question then is, will this be possible by simply implementing a
>>>> separate AbstractCompactionStrategy?  Does this sound feasilble at all=
?
>>>> Based on the implementation of Size and Leveled strategies, it looks l=
ike I
>>>> would have the ability to control what and how things get compacted, b=
ut I
>>>> wanted to verify before putting time into it.
>>>>
>>>> Thank you so much for your time!
>>>>
>>>> Andrew
>>>>
>>>>
>>>
>>
>

--001a11333facfa2c7204fb0630bf
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Sorry, yes, that is what I was looking to do--i.e., create=
 a &quot;TopologicalCompactionStrategy&quot; or similar.<br></div><div clas=
s=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Jun 4, 2014 at=
 10:40 AM, Russell Bradberry <span dir=3D"ltr">&lt;<a href=3D"mailto:rbradb=
erry@gmail.com" target=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote=
:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div style=3D"word-wrap:break-word"><div sty=
le=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1.0);marg=
in:0px;line-height:auto">

Maybe I=E2=80=99m misunderstanding something, but what makes you think that=
 running a major compaction every day will cause they data from January 1st=
 to exist in only one SSTable and not have data from other days in the SSTa=
ble as well? Are you talking about making a new compaction strategy that cr=
eates SSTables by day?</div>

<div><div class=3D"h5"> <div><br><br><span style=3D"font-family:helvetica,a=
rial;font-size:13px"></span><span></span></div> <br><p style=3D"color:#000"=
>On June 4, 2014 at 1:36:10 PM, Redmumba (<a href=3D"mailto:redmumba@gmail.=
com" target=3D"_blank">redmumba@gmail.com</a>) wrote:</p>

 <blockquote type=3D"cite"><span><div><div></div><div>


<div dir=3D"ltr">
<div>Let&#39;s say I run a major compaction every day, so that the
&quot;oldest&quot; sstable contains only the data for January 1st.=C2=A0
Assuming all the nodes are in-sync and have had at least one repair
run before the table is dropped (so that all information for that
time period is &quot;the same&quot;), wouldn&#39;t it be safe to assume tha=
t the
same data would be dropped on all nodes?=C2=A0 There might be a
period when the compaction is running where different nodes might
have an inconsistent view of just that days&#39; data (in that some
would have it and others would not), but the cluster would still
function and become eventually consistent, correct?<br>
<br></div>
Also, if the entirety of the sstable is being dropped, wouldn&#39;t the
tombstones be removed with it?=C2=A0 I wouldn&#39;t be concerned with
individual rows and columns, and this is a write-only table, more
or less--the only deletes that occur in the current system are to
delete the old data.<br></div>
<div class=3D"gmail_extra"><br>
<br>
<div class=3D"gmail_quote">On Wed, Jun 4, 2014 at 10:24 AM, Russell
Bradberry <span dir=3D"ltr">&lt;<a href=3D"mailto:rbradberry@gmail.com" tar=
get=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
I=E2=80=99m not sure what you want to do is feasible. =C2=A0At a high level
I can see you running into issues with RF etc. =C2=A0The SSTables
node to node are not identical, so if you drop a full SSTable on
one node there is no one corresponding SSTable on the adjacent
nodes to drop. =C2=A0 =C2=A0You would need to choose data to
compact out, and ensure it is removed on all replicas as well.
=C2=A0But if your problem is that you=E2=80=99re low on disk space then you
probably won=E2=80=99t be able to write out a new SSTable with the older
information compacted out. Also, there is more to an SSTable than
just data, the SSTable could have tombstones and other relics that
haven=E2=80=99t been cleaned up from nodes coming or going.=C2=A0</div>
<div>
<div>
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
<br></div>
<div><br>
<br></div>
<br>
<p style=3D"color:#000">On June 4, 2014 at 1:10:58 PM, Redmumba
(<a href=3D"mailto:redmumba@gmail.com" target=3D"_blank">redmumba@gmail.com=
</a>) wrote:</p>
<blockquote type=3D"cite">
<div>
<div>
<div dir=3D"ltr">
<div><span>Thanks, Russell--yes, a similar concept, just applied to
sstables.=C2=A0 I&#39;m assuming this would require changes to both
major compactions, and probably GC (to remove the old tables), but
since I&#39;m not super-familiar with the C* internals, I wanted to
make sure it was feasible with the current toolset before I
actually dived in and started tinkering.<br></span></div>
<div><span><br></span></div>
<div><span>Andrew<br></span></div>
</div>
<div class=3D"gmail_extra"><span><br>
<br></span>
<div class=3D"gmail_quote"><span>On Wed, Jun 4, 2014 at 10:04 AM,
Russell Bradberry <span dir=3D"ltr">&lt;<a href=3D"mailto:rbradberry@gmail.=
com" target=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote:<br></span=
>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
hmm, I see. So something similar to Capped Collections in
MongoDB.</div>
<div>
<div>
<div><br>
<br></div>
<br>
<p style=3D"color:#000">On June 4, 2014 at 1:03:46 PM, Redmumba
(<a href=3D"mailto:redmumba@gmail.com" target=3D"_blank">redmumba@gmail.com=
</a>) wrote:</p>
<blockquote type=3D"cite">
<div>
<div>
<div dir=3D"ltr">
<div><span>Not quite; if I&#39;m at say 90% disk usage, I&#39;d like to
drop the oldest sstable rather than simply run out of space.<br>
<br>
The problem with using TTLs is that I have to try and guess how
much data is being put in--since this is auditing data, the usage
can vary wildly depending on time of year, verbosity of auditing,
etc..=C2=A0 I&#39;d like to maximize the disk space--not optimize the
cleanup process.<br>
<br></span></div>
<span>Andrew<br></span></div>
<div class=3D"gmail_extra"><span><br>
<br></span>
<div class=3D"gmail_quote"><span>On Wed, Jun 4, 2014 at 9:47 AM,
Russell Bradberry <span dir=3D"ltr">&lt;<a href=3D"mailto:rbradberry@gmail.=
com" target=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote:<br></span=
>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
You mean this:</div>
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
<br></div>
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-5228" target=3D"=
_blank">https://issues.apache.org/jira/browse/CASSANDRA-5228</a></div>
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
<br></div>
<div style=3D"font-family:Helvetica,Arial;font-size:13px;color:rgba(0,0,0,1=
.0);margin:0px;line-height:auto">
?</div>
<div>
<div>
<div><br>
<br></div>
<br>
<p style=3D"color:#000">On June 4, 2014 at 12:42:33 PM, Redmumba
(<a href=3D"mailto:redmumba@gmail.com" target=3D"_blank">redmumba@gmail.com=
</a>) wrote:</p>
<blockquote type=3D"cite">
<div>
<div>
<div dir=3D"ltr">
<div><span>Good morning!<br>
<br>
I&#39;ve asked (and seen other people ask) about the ability to drop
old sstables, basically creating a FIFO-like clean-up
process.=C2=A0 Since we&#39;re using Cassandra as an auditing system,
this is particularly appealing to us because it means we can
maximize the amount of auditing data we can keep while still
allowing Cassandra to clear old data automatically.<br>
<br>
My idea is this: perform compaction based on the range of dates
available in the sstable (or just metadata about when it was
created).=C2=A0 For example, a major compaction could create a
combined sstable per day--so that, say, 60 days of data after a
major compaction would contain 60 sstables.<br></span></div>
<div><span><br></span></div>
<div><span>My question then is, will this be possible by simply
implementing a separate AbstractCompactionStrategy?=C2=A0 Does this
sound feasilble at all?=C2=A0 Based on the implementation of Size
and Leveled strategies, it looks like I would have the ability to
control what and how things get compacted, but I wanted to verify
before putting time into it.<br>
<br>
Thank you so much for your time!<br>
<br></span></div>
<div><span>Andrew<br></span></div>
</div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
<br></div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
<br></div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
<br></div>


</div></div></span></blockquote></div></div></div></blockquote></div><br></=
div>

--001a11333facfa2c7204fb0630bf--