Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of cayiroglu@gmail.com designates
 74.125.82.170 as permitted sender)
MIME-Version: 1.0
Date: Tue, 28 May 2013 20:46:18 +0200
Message-ID: 
 <CAJV_UYcs4FJmtwzWjyDAAY69cYZxdY4UDS=V4LaCZsAOBa9Hkw@mail.gmail.com>
Subject: data clean up problem
From: cem <cayiroglu@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11c2257449d83e04ddcbae0d

--001a11c2257449d83e04ddcbae0d
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Hi Experts,

We have general problem about cleaning up data from the disk. I need to
free the disk space after retention period and the customer wants to
dimension the disk space base on that.

After running multiple performance tests with TTL of 1 day we saw that the
compaction couldn't keep up with the request rate. Disks were getting full
after 3 days. There were also a lot of sstables that are older than 1 day
after 3 days.

Things that we tried:

-Change the compaction strategy to leveled. (helped a bit but not much)

-Use big sstable size (10G) with leveled compaction to have more aggressive
compaction.(helped a bit but not much)

-Upgrade Cassandra from 1.0 to 1.2 to use TTL histograms (didn't help at
all since it has key overlapping estimation algorithm that generates %100
match. Although we don't have...)

Our column family structure is like this:

Event_data_cf: (we store event data. Event_id  is randomly generated and
each event has attributes like location=3Dlondon)

row                  data

event id          data blob

timeseries_cf: (key is the attribute that we want to index. It can be
location=3Dlondon, we didnt use secondary indexes because the indexes are
dynamic.)

row                  data

index key       time series of event id (event1_id, event2_id=85.)

timeseries_inv_cf: (this is used for removing event by event row key. )

row                  data

event id          set of index keys

Candidate Solution: Implementing time range partitions.

Each partition will have column family set and will be managed by client.

Suppose that you want to have 7 days retention period. Then you can
configure the partition size as 1 day and have 7 active partitions at any
time. Then you can drop inactive partitions (older that 7 days). Dropping
will immediate remove the data from the disk. (With proper Cassandra.yaml
configuration)

Storing an event:

Find the current partition p1

store to event_data to Event_data_cf_p1

store to indexes to timeseries_cff_p1

store to inverted indexes to timeseries_inv_cf_p1


A time range query with an index:

Find the all partitions belongs to that time range

Do read starting from the first partition until you reach to limit

.....

Could you please provide your comments and concerns ?

Is there any other option that we can try?

What do you think about the candidate solution?

Does anyone have the same issue? How would you solve it in another way?


Thanks in advance!

Cem

--001a11c2257449d83e04ddcbae0d
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div style>Hi Experts,</div><div><br></div><div style><p c=
lass=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">We ha=
ve general problem about cleaning up data from the disk. I
need to free the disk space=A0after=A0retention period and the customer
wants to dimension the disk space base on that.</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">A=
fter running multiple performance tests with TTL of 1 day we
saw that the compaction=A0couldn&#39;t=A0keep up=A0with=A0the request
rate. Disks were getting full after 3 days. There were also a lot of sstabl=
es
that are older than 1 day after 3 days.</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">T=
hings that we tried:</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">-=
Change the compaction strategy to=A0leveled. (helped a bit
but not much)</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">-=
Use big sstable size (10G) with=A0leveled compaction to have
more aggressive compaction.(helped a bit but not much)=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">-=
Upgrade Cassandra from 1.0 to 1.2 to use
TTL=A0histograms=A0(didn&#39;t=A0help at all since it has key overlapping
estimation algorithm that generates %100
match.=A0Although=A0we=A0don&#39;t=A0have...)</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">O=
ur column family structure is like this:</span><span style=3D"font-family:A=
rial,sans-serif;font-size:12pt">=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">E=
vent_data_cf: (we store event data. Event_id =A0is randomly generated and e=
ach event has
attributes like location=3Dlondon)=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">r=
ow=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 data</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">e=
vent id=A0=A0=A0=A0=A0=A0=A0=A0=A0 data
blob=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">t=
imeseries_cf: (key is the attribute that we want to index. It
can be location=3Dlondon, we didnt use secondary indexes=A0because=A0the
indexes are dynamic.)=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">r=
ow=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 data</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">i=
ndex key=A0=A0=A0=A0=A0=A0 time
series of event id (event1_id, event2_id=85.)</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">t=
imeseries_inv_cf: (this is used for removing event by event row
key. )</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">r=
ow=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 data</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">e=
vent id=A0=A0=A0=A0=A0=A0=A0=A0=A0 set
of index keys=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">C=
andidate Solution: Implementing time range partitions.=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">E=
ach partition will have column=A0family=A0set and will be
managed by client.</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">S=
uppose that you want to have 7 days=A0retention=A0period.
Then you can configure the partition size as 1 day and have 7 active partit=
ions
at any time. Then you can drop inactive partitions (older that 7 days).
Dropping will=A0immediate=A0remove the data from the disk. (With proper
Cassandra.yaml configuration)</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">S=
toring an event:=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">F=
ind the current=A0partition p1</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">s=
tore to event_data to=A0Event_data_cf_p1 =A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">s=
tore to indexes to timeseries_cff_p1 =A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">s=
tore to inverted indexes to timeseries_inv_cf_p1</span><span style=3D"font-=
family:Arial,sans-serif;font-size:12pt">=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif"><=
br></span></p><p class=3D""><span style=3D"font-size:12pt;font-family:Arial=
,sans-serif">A time range query with an index:</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">F=
ind the all partitions belongs to that time range</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">D=
o read starting from the first partition until you reach to
limit</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">.=
....</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">C=
ould you please provide your comments and concerns ?=A0</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">I=
s there any other option that we can try?</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">W=
hat do you think about the candidate solution?</span></p>

<p class=3D""><span style=3D"font-size:12pt;font-family:Arial,sans-serif">D=
oes anyone have the same issue? How would you solve it in
another way?</span></p><p class=3D""><span style=3D"font-size:12pt;font-fam=
ily:Arial,sans-serif"><br></span></p><p class=3D"" style><font face=3D"Aria=
l, sans-serif" size=3D"3">Thanks in advance!</font></p><p class=3D"" style>=
<font face=3D"Arial, sans-serif" size=3D"3">Cem</font></p>
</div></div>

--001a11c2257449d83e04ddcbae0d--