Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of shimi.k@gmail.com designates
 209.85.213.172 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=muW5+iYA6k9fiWewKJ4nfts/iHrfO49Sx1t8K1bM2IQw7cVmpSEQjAH9lcqd1F34os
         7piuIxuG8RVsRIjIpvfdoPG/ej/lQOY4p8VVS0L1YYPretw0ETxAz+ph3UQbYqy7g8u3
         8pJZtCVo2vt3SRj2WluJ9Vfpa2d5AVPLjUZDA=
MIME-Version: 1.0
In-Reply-To: <AANLkTikup28d-PXRmTn3u4TcrCVhePc2P9MVmZV7C-M5@mail.gmail.com>
References: <AANLkTinBMDEDjcL728vGoshSBooCteaVEPiqj_+DKhGv@mail.gmail.com>
	<AANLkTi=1ECuib8RSqWdX2EB6+mpZeNYW5922h9VyAEj4@mail.gmail.com>
	<AANLkTinW6Ufd3tjiRxyrbzi1iN86GjJczN6mWiYwF1zg@mail.gmail.com>
	<AANLkTinS-gtB+7g9ZV6vnSKmXdUPX9X-ifstWrqQh_UA@mail.gmail.com>
	<AANLkTimZ3+2-KnvWAZUsbotOG4BSB0zq5ZNL1w2gNADG@mail.gmail.com>
	<AANLkTin7qUstmSB6EM88wn4RW8raXO45pPzoJmKk-Nk+@mail.gmail.com>
	<AANLkTikG2owt1SE74d4RiCJHyAo-u0NdDO9as-uPu72X@mail.gmail.com>
	<AANLkTim8x8MUZjbGndbDEwrbJUiFSAOGyhFrYeSVMyaK@mail.gmail.com>
	<AANLkTimhMLnaof1ze0a1f5XfJk+7hHPFHdON=JEFT35Y@mail.gmail.com>
	<AANLkTim7M5VSJN+gE-sPOXGwrj2NMzB8WSSUkR7QxkVe@mail.gmail.com>
	<AANLkTi=wcfbFFSBCK_Mrujss1CBdP_K=e_AS3VM_4+vf@mail.gmail.com>
	<AANLkTikup28d-PXRmTn3u4TcrCVhePc2P9MVmZV7C-M5@mail.gmail.com>
Date: Mon, 10 Jan 2011 16:00:14 +0200
Message-ID: <AANLkTikxRSLVmGpv6uvaT-5aKjH1dJ42EaGG_B_jAWTo@mail.gmail.com>
Subject: Re: Reclaim deleted rows space
From: shimi <shimi.k@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016363b80ae18ffc704997e636b

--0016363b80ae18ffc704997e636b
Content-Type: text/plain; charset=ISO-8859-1

I modified the code to limit the size of the SSTables.
I will be glad if someone can take a look at it

https://github.com/Shimi/cassandra/tree/cassandra-0.6

<https://github.com/Shimi/cassandra/tree/cassandra-0.6>Shimi

On Fri, Jan 7, 2011 at 2:04 AM, Jonathan Shook <jshook@gmail.com> wrote:

> I believe the following condition within submitMinorIfNeeded(...)
> determines whether to continue, so it's not a hard loop.
>
> // if (sstables.size() >= minThreshold) ...
>
>
>
> On Thu, Jan 6, 2011 at 2:51 AM, shimi <shimi.k@gmail.com> wrote:
> > According to the code it make sense.
> > submitMinorIfNeeded() calls doCompaction() which
> > calls submitMinorIfNeeded().
> > With minimumCompactionThreshold = 1 submitMinorIfNeeded() will always run
> > compaction.
> >
> > Shimi
> > On Thu, Jan 6, 2011 at 10:26 AM, shimi <shimi.k@gmail.com> wrote:
> >>
> >>
> >> On Wed, Jan 5, 2011 at 11:31 PM, Jonathan Ellis <jbellis@gmail.com>
> wrote:
> >>>
> >>> Pretty sure there's logic in there that says "don't bother compacting
> >>> a single sstable."
> >>
> >> No. You can do it.
> >> Based on the log I have a feeling that it triggers an infinite
> compaction
> >> loop.
> >>
> >>>
> >>> On Wed, Jan 5, 2011 at 2:26 PM, shimi <shimi.k@gmail.com> wrote:
> >>> > How does minor compaction is triggered? Is it triggered Only when a
> new
> >>> > SStable is added?
> >>> >
> >>> > I was wondering if triggering a compaction
> >>> > with minimumCompactionThreshold
> >>> > set to 1 would be useful. If this can happen I assume it will do
> >>> > compaction
> >>> > on files with similar size and remove deleted rows on the rest.
> >>> > Shimi
> >>> > On Tue, Jan 4, 2011 at 9:56 PM, Peter Schuller
> >>> > <peter.schuller@infidyne.com>
> >>> > wrote:
> >>> >>
> >>> >> > I don't have a problem with disk space. I have a problem with the
> >>> >> > data
> >>> >> > size.
> >>> >>
> >>> >> [snip]
> >>> >>
> >>> >> > Bottom line is that I want to reduce the number of requests that
> >>> >> > goes to
> >>> >> > disk. Since there is enough data that is no longer valid I can do
> it
> >>> >> > by
> >>> >> > reclaiming the space. The only way to do it is by running Major
> >>> >> > compaction.
> >>> >> > I can wait and let Cassandra do it for me but then the data size
> >>> >> > will
> >>> >> > get
> >>> >> > even bigger and the response time will be worst. I can do it
> >>> >> > manually
> >>> >> > but I
> >>> >> > prefer it to happen in the background with less impact on the
> system
> >>> >>
> >>> >> Ok - that makes perfect sense then. Sorry for misunderstanding :)
> >>> >>
> >>> >> So essentially, for workloads that are teetering on the edge of
> cache
> >>> >> warmness and is subject to significant overwrites or removals, it
> may
> >>> >> be beneficial to perform much more aggressive background compaction
> >>> >> even though it might waste lots of CPU, to keep the in-memory
> working
> >>> >> set down.
> >>> >>
> >>> >> There was talk (I think in the compaction redesign ticket) about
> >>> >> potentially improving the use of bloom filters such that obsolete
> data
> >>> >> in sstables could be eliminated from the read set without
> >>> >> necessitating actual compaction; that might help address cases like
> >>> >> these too.
> >>> >>
> >>> >> I don't think there's a pre-existing silver bullet in a current
> >>> >> release; you probably have to live with the need for
> >>> >> greater-than-theoretically-optimal memory requirements to keep the
> >>> >> working set in memory.
> >>> >>
> >>> >> --
> >>> >> / Peter Schuller
> >>> >
> >>> >
> >>>
> >>>
> >>>
> >>> --
> >>> Jonathan Ellis
> >>> Project Chair, Apache Cassandra
> >>> co-founder of Riptano, the source for professional Cassandra support
> >>> http://riptano.com
> >>
> >
> >
>

--0016363b80ae18ffc704997e636b
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I modified the code to limit the size of the SSTables.<div=
>I will be glad if someone can take a look at it</div><div><br></div><div><=
meta http-equiv=3D"content-type" content=3D"text/html; charset=3Dutf-8"><a =
href=3D"https://github.com/Shimi/cassandra/tree/cassandra-0.6">https://gith=
ub.com/Shimi/cassandra/tree/cassandra-0.6</a></div>
<div><br></div><div><a href=3D"https://github.com/Shimi/cassandra/tree/cass=
andra-0.6"></a>Shimi<br><br><div class=3D"gmail_quote">On Fri, Jan 7, 2011 =
at 2:04 AM, Jonathan Shook <span dir=3D"ltr">&lt;<a href=3D"mailto:jshook@g=
mail.com">jshook@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;">I believe the following condition within su=
bmitMinorIfNeeded(...)<br>
determines whether to continue, so it&#39;s not a hard loop.<br>
<br>
// if (sstables.size() &gt;=3D minThreshold) ...<br>
<div><div></div><div class=3D"h5"><br>
<br>
<br>
On Thu, Jan 6, 2011 at 2:51 AM, shimi &lt;<a href=3D"mailto:shimi.k@gmail.c=
om">shimi.k@gmail.com</a>&gt; wrote:<br>
&gt; According to the code it make sense.<br>
&gt; submitMinorIfNeeded() calls doCompaction() which<br>
&gt; calls=A0submitMinorIfNeeded().<br>
&gt; With=A0minimumCompactionThreshold =3D 1=A0submitMinorIfNeeded() will a=
lways run<br>
&gt; compaction.<br>
&gt;<br>
&gt; Shimi<br>
&gt; On Thu, Jan 6, 2011 at 10:26 AM, shimi &lt;<a href=3D"mailto:shimi.k@g=
mail.com">shimi.k@gmail.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Wed, Jan 5, 2011 at 11:31 PM, Jonathan Ellis &lt;<a href=3D"mai=
lto:jbellis@gmail.com">jbellis@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Pretty sure there&#39;s logic in there that says &quot;don&#39=
;t bother compacting<br>
&gt;&gt;&gt; a single sstable.&quot;<br>
&gt;&gt;<br>
&gt;&gt; No. You can do it.<br>
&gt;&gt; Based on the log I have a feeling that it triggers an infinite com=
paction<br>
&gt;&gt; loop.<br>
&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Wed, Jan 5, 2011 at 2:26 PM, shimi &lt;<a href=3D"mailto:sh=
imi.k@gmail.com">shimi.k@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt; &gt; How does minor compaction is triggered? Is it triggered O=
nly when a new<br>
&gt;&gt;&gt; &gt; SStable is added?<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; I was wondering if triggering a compaction<br>
&gt;&gt;&gt; &gt; with=A0minimumCompactionThreshold<br>
&gt;&gt;&gt; &gt; set to 1 would be useful. If this can happen I assume it =
will do<br>
&gt;&gt;&gt; &gt; compaction<br>
&gt;&gt;&gt; &gt; on files with similar size and remove deleted rows on the=
 rest.<br>
&gt;&gt;&gt; &gt; Shimi<br>
&gt;&gt;&gt; &gt; On Tue, Jan 4, 2011 at 9:56 PM, Peter Schuller<br>
&gt;&gt;&gt; &gt; &lt;<a href=3D"mailto:peter.schuller@infidyne.com">peter.=
schuller@infidyne.com</a>&gt;<br>
&gt;&gt;&gt; &gt; wrote:<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; &gt; I don&#39;t have a problem with disk space. I ha=
ve a problem with the<br>
&gt;&gt;&gt; &gt;&gt; &gt; data<br>
&gt;&gt;&gt; &gt;&gt; &gt; size.<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; [snip]<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; &gt; Bottom line is that I want to reduce the number =
of requests that<br>
&gt;&gt;&gt; &gt;&gt; &gt; goes to<br>
&gt;&gt;&gt; &gt;&gt; &gt; disk. Since there is enough data that is no long=
er valid=A0I can do it<br>
&gt;&gt;&gt; &gt;&gt; &gt; by<br>
&gt;&gt;&gt; &gt;&gt; &gt; reclaiming the space. The only way to do it is b=
y running Major<br>
&gt;&gt;&gt; &gt;&gt; &gt; compaction.<br>
&gt;&gt;&gt; &gt;&gt; &gt; I can wait and let Cassandra do it for me but th=
en the data size<br>
&gt;&gt;&gt; &gt;&gt; &gt; will<br>
&gt;&gt;&gt; &gt;&gt; &gt; get<br>
&gt;&gt;&gt; &gt;&gt; &gt; even bigger and the response time will be worst.=
 I can do it<br>
&gt;&gt;&gt; &gt;&gt; &gt; manually<br>
&gt;&gt;&gt; &gt;&gt; &gt; but I<br>
&gt;&gt;&gt; &gt;&gt; &gt; prefer it to happen in the background with less =
impact on the system<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; Ok - that makes perfect sense then. Sorry for misunde=
rstanding :)<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; So essentially, for workloads that are teetering on t=
he edge of cache<br>
&gt;&gt;&gt; &gt;&gt; warmness and is subject to significant overwrites or =
removals, it may<br>
&gt;&gt;&gt; &gt;&gt; be beneficial to perform much more aggressive backgro=
und compaction<br>
&gt;&gt;&gt; &gt;&gt; even though it might waste lots of CPU, to keep the i=
n-memory working<br>
&gt;&gt;&gt; &gt;&gt; set down.<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; There was talk (I think in the compaction redesign ti=
cket) about<br>
&gt;&gt;&gt; &gt;&gt; potentially improving the use of bloom filters such t=
hat obsolete data<br>
&gt;&gt;&gt; &gt;&gt; in sstables could be eliminated from the read set wit=
hout<br>
&gt;&gt;&gt; &gt;&gt; necessitating actual compaction; that might help addr=
ess cases like<br>
&gt;&gt;&gt; &gt;&gt; these too.<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; I don&#39;t think there&#39;s a pre-existing silver b=
ullet in a current<br>
&gt;&gt;&gt; &gt;&gt; release; you probably have to live with the need for<=
br>
&gt;&gt;&gt; &gt;&gt; greater-than-theoretically-optimal memory requirement=
s to keep the<br>
&gt;&gt;&gt; &gt;&gt; working set in memory.<br>
&gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt;&gt; &gt;&gt; --<br>
&gt;&gt;&gt; &gt;&gt; / Peter Schuller<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Jonathan Ellis<br>
&gt;&gt;&gt; Project Chair, Apache Cassandra<br>
&gt;&gt;&gt; co-founder of Riptano, the source for professional Cassandra s=
upport<br>
&gt;&gt;&gt; <a href=3D"http://riptano.com" target=3D"_blank">http://riptan=
o.com</a><br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
</div></div></blockquote></div><br></div></div>

--0016363b80ae18ffc704997e636b--