Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=SW0ysndt/x
	X2zhFOoPfqD0uITLeRVQ6ojP0kS98Sb+xYXS7OiI4NfWBLTtMTBHv2IG7F2QBw2f
	JZU0sIislIrXefdxkrNk2hyxUMNmNFqY4TGdJoc5eUKVxt6FSD+4fQ27jgCv0C+t
	3ttjptwts3VQ0ox8YcpGsRBg0TkWTshlM=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1278)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_90392707-C7CD-47E7-9E77-0BA5B59F9B07"
Subject: Re: Cassandra and massive TTL expirations cause HEAP issue
Date: Tue, 3 Jul 2012 11:33:34 +1200
In-Reply-To: <CC0F1757.64310%npommerien@roku.com>
To: user@cassandra.apache.org
References: <CC0F1757.64310%npommerien@roku.com>
Message-Id: <BFDADE60-B02C-45D0-9175-53423642F036@thelastpickle.com>


--Apple-Mail=_90392707-C7CD-47E7-9E77-0BA5B59F9B07
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

>  After 10 days my cluster crashes due to a java.lang.OutOfMemoryError =
during compaction of the big column family that contains roughly 95% of =
the data.=20

Does this column family have very wide rows ?=20

>  simply some tweaks I need to make in the yaml file.  I have tried:
The main things that reduce the impact compaction has on memory are:

in_memory_compaction_limit_in_mb
concurrent_compactors

Of the top of my head I cannot think of any shortcuts taken by =
compaction if/when all data in an SSTable is past TTL. I think there was =
talk of something like that though.=20

Hope that helps.

-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 27/06/2012, at 2:38 AM, Nils Pommerien wrote:

> Hello,
> I am evaluating Cassandra in a log retrieval application.  My ring =
conists of3 m2.xlarge instances (17.1 GB memory, 6.5 ECU (2 virtual =
cores with 3.25 EC2 Compute Units each), 420 GB of local instance =
storage, 64-bit platform) and I am writing at roughly 220 writes/sec.  =
Per day I am adding roughly 60GB of data.  All of this sounds simple and =
easy and all three nodes are humming along with basically no load. =20
>=20
> The issue is that I am writing all my data with a TTL of 10 days.  =
After 10 days my cluster crashes due to a java.lang.OutOfMemoryError =
during compaction of the big column family that contains roughly 95% of =
the data.  So basically after 10 days my data set is 600GB and after 10 =
days Cassandra would have to tombstone and purge 60GB of data at the =
same rate of roughly 220 deletes/second.  I am not sure if Cassandra =
should be able to do it, whether I should take a partitioning approach =
(one CF per day), or if there is simply some tweaks I need to make in =
the yaml file.  I have tried:
> Decrease flush-largest-memtables-at to .4=20
> reduce_cache_sizes_at and reduce_cache_capacity_to set to 1
> Now, the issue remains the same:
>=20
> WARN [ScheduledTasks:1] 2012-06-11 19:39:42,017 GCInspector.java (line =
145) Heap is 0.9920103380107628 full.  You may need to reduce memtable =
and/or cache sizes.  Cassandra will now flush up to the two largest =
memtables to free up memory.  Adjust flush_largest_memtables_at =
threshold in cassandra.yaml if you don't want Cassandra to do this =
automatically.
>=20
> Eventually it will just die with this message.  This affects all nodes =
in the cluster, not just one.=20
> =20
> Dump file is incomplete: file size limit
> ERROR 19:39:39,695 Exception in thread Thread[ReadStage:134,5,main]
> java.lang.OutOfMemoryError: Java heap space
> ERROR 19:39:39,724 Exception in thread Thread[MutationStage:57,5,main]
> java.lang.OutOfMemoryError: Java heap space
>       at =
org.apache.cassandra.utils.FBUtilities.hashToBigInteger(FBUtilities.java:2=
13)
>       at =
org.apache.cassandra.dht.RandomPartitioner.getToken(RandomPartitioner.java=
:154)
>       at =
org.apache.cassandra.dht.RandomPartitioner.decorateKey(RandomPartitioner.j=
ava:47)
>       at =
org.apache.cassandra.db.RowPosition.forKey(RowPosition.java:54)
> =20
> Any help is highly appreciated.  It would be cool to tweak it in a way =
that I can have a moving window of 10 days in Cassandra while dropping =
the old data=85 Or, if there is any other recommended way to deal with =
such sliding time windows I am open for ideas.
>=20
> Thank you for your help!	=09


--Apple-Mail=_90392707-C7CD-47E7-9E77-0BA5B59F9B07
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div><blockquote type=3D"cite"><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: =
rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif; "><span =
id=3D"OLK_SRC_BODY_SECTION"><div><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: =
rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif; =
"><div><div><div>&nbsp;After 10 days my cluster crashes due to =
a&nbsp;java.lang.OutOfMemoryError&nbsp;during compaction of the big =
column family that contains roughly 95% of the =
data.&nbsp;</div></div></div></div></div></span></div></blockquote></div><=
div>Does this column family have very wide rows =
?&nbsp;</div><div><br></div><blockquote type=3D"cite"><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: =
14px; font-family: Calibri, sans-serif; "><span =
id=3D"OLK_SRC_BODY_SECTION"><div><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: =
rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif; =
"><div><div><div>&nbsp;simply some tweaks I need to make in the yaml =
file. &nbsp;I have =
tried:</div></div></div></div></div></span></div></blockquote>The main =
things that reduce the impact compaction has on memory =
are:<div><br></div><div>in_memory_compaction_limit_in_mb</div><div>concurr=
ent_compactors<br><div><br></div><div>Of the top of my head I cannot =
think of any shortcuts taken by compaction if/when all data in an =
SSTable is past TTL. I think there was talk of something like that =
though.&nbsp;</div><div><br></div><div>Hope that helps.</div><div><div =
apple-content-edited=3D"true"><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><br></div></div></span></div></span></div></div></div><div =
apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></div></span></span>
</div>
<br><div><div>On 27/06/2012, at 2:38 AM, Nils Pommerien wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">
<meta http-equiv=3D"Content-Type" content=3D"text/html; =
charset=3DWindows-1252"><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: =
rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif; =
"><div>Hello,</div><span id=3D"OLK_SRC_BODY_SECTION"><div><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: =
14px; font-family: Calibri, sans-serif; "><div><div><div>I am evaluating =
Cassandra in a log retrieval application. &nbsp;My ring conists of3 =
m2.xlarge instances (17.1 GB memory, 6.5 ECU (2 virtual cores with 3.25 =
EC2 Compute Units each), 420 GB of local instance storage, 64-bit =
platform)&nbsp;and I am writing at roughly 220 writes/sec. &nbsp;Per day =
I am adding roughly 60GB of data. &nbsp;All of this sounds simple and =
easy and all three nodes are humming along with basically no load. =
&nbsp;</div><div><br></div><div>The issue is that I am writing all my =
data with a TTL of 10 days. &nbsp;After 10 days my cluster crashes due =
to a&nbsp;java.lang.OutOfMemoryError&nbsp;during compaction of the big =
column family that contains roughly 95% of the data. &nbsp;So basically =
after 10 days my data set is 600GB and after 10 days Cassandra would =
have to tombstone and purge 60GB of data at the same rate of roughly 220 =
deletes/second. &nbsp;I am not sure if Cassandra should be able to do =
it, whether I should take a partitioning approach (one CF per day), or =
if there is simply some tweaks I need to make in the yaml file. &nbsp;I =
have tried:</div></div><ol><li>Decrease&nbsp;flush-largest-memtables-at =
to .4&nbsp;</li><li>reduce_cache_sizes_at =
and&nbsp;reduce_cache_capacity_to set to 1</li></ol><div>Now, the issue =
remains the same:</div><div><br></div><div><div>WARN [ScheduledTasks:1] =
2012-06-11 19:39:42,017 GCInspector.java (line 145) Heap is =
0.9920103380107628 full. &nbsp;You may need to reduce memtable and/or =
cache sizes. &nbsp;Cassandra will now flush up to the two largest =
memtables to free up memory. &nbsp;Adjust flush_largest_memtables_at =
threshold in cassandra.yaml if you don't want Cassandra to do this =
automatically.</div><div><br></div><div>Eventually it will just die with =
this message. &nbsp;This affects all nodes in the cluster, not just =
one.&nbsp;</div><div>&nbsp;</div><div>Dump file is incomplete: file size =
limit</div><div>ERROR 19:39:39,695 Exception in thread =
Thread[ReadStage:134,5,main]</div><div>java.lang.OutOfMemoryError: Java =
heap space</div><div>ERROR 19:39:39,724 Exception in thread =
Thread[MutationStage:57,5,main]</div><div>java.lang.OutOfMemoryError: =
Java heap space</div><div>&nbsp;&nbsp; &nbsp; &nbsp;at =
org.apache.cassandra.utils.FBUtilities.hashToBigInteger(FBUtilities.java:2=
13)</div><div>&nbsp;&nbsp; &nbsp; &nbsp;at =
org.apache.cassandra.dht.RandomPartitioner.getToken(RandomPartitioner.java=
:154)</div><div>&nbsp;&nbsp; &nbsp; &nbsp;at =
org.apache.cassandra.dht.RandomPartitioner.decorateKey(RandomPartitioner.j=
ava:47)</div><div>&nbsp;&nbsp; &nbsp; &nbsp;at =
org.apache.cassandra.db.RowPosition.forKey(RowPosition.java:54)</div><div>=
&nbsp;</div></div><div>Any help is highly appreciated. &nbsp;It would be =
cool to tweak it in a way that I can have a moving window of 10 days in =
Cassandra while dropping the old data=85 Or, if there is any other =
recommended way to deal with such sliding time windows I am open for =
ideas.</div><div><br></div><div>Thank you for your help!<span =
class=3D"Apple-tab-span" style=3D"white-space:pre">		=
</span></div></div></div></div></span></div>
</blockquote></div><br></div></div></body></html>=

--Apple-Mail=_90392707-C7CD-47E7-9E77-0BA5B59F9B07--