Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=c/Cypth+vT
	cFjnNuIRz8cnq+wquozbOyjuebIJYyFLbzbV8aAD1/7SIIlz2tn0ze52ebQlxmG/
	83+AKeKU/U64ZPUw8IWAKDbW1O6R9RDNH3pKhSc0pQ4nf/qC/W2MiK9REYgRnomS
	1iQbhJpKEXjtivxJ/lP+zMTwk049z8+XA=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_26D1CE5C-D91D-4BF5-AB6B-7402BE0306D6"
Subject: Re: Weird problem with empty CF
Date: Wed, 5 Oct 2011 09:34:15 +1300
In-Reply-To: <4E8B3C77.5020202@netseer.com>
To: user@cassandra.apache.org
References: <4E835ADD.2020809@netseer.com>
 <DE3C87ED-41DC-46A9-BF50-0DBADCD5BBF7@thelastpickle.com>
 <CAGiE6h9h1wjNGBR3XH48owQsgoebu91-xy76-wGc9CK5mV_Jrw@mail.gmail.com>
 <F8E9F533-DCF8-40F1-81A2-1199C1D31735@thelastpickle.com>
 <CAGiE6h9bfrLiiaQ1EYRAn876d9vDtAvz8qc41Q99Eo2B8MEyUg@mail.gmail.com>
 <94DC18F0-5EA0-446D-AEE3-E1505E05E157@thelastpickle.com>
 <4E8B3C77.5020202@netseer.com>
Message-Id: <98D1932E-600A-49DB-B55E-93AE4B290206@thelastpickle.com>


--Apple-Mail=_26D1CE5C-D91D-4BF5-AB6B-7402BE0306D6
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

I would not get gc_grace seconds to 0, set to to something small.=20

gc_grace_seconds or ttl is only the minimum amount of time the column =
will stay in the data files. The columns are only purged when compaction =
runs some time after that timespan has ended.=20

If you are seeing issues where a heavy delete workload is having an =
noticeably adverse effect on read performance then you should look at =
the data model. Consider ways to spread the write / read / delete =
workload over multiple rows.

If you cannot get away from it then experiment with reducing the =
min_compactioon_threshold of the CF's so that compaction kicks in =
quicker, and (potentially) tombstones are purged faster.=20

Chees

=20
-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 5/10/2011, at 6:03 AM, Daning wrote:

> Thanks Aaron.  How about I set the gc_grace_seconds to 0 or like 2 =
hours? I like to clean up tomebstone sooner, I don't care losing     =
some data and all my columns have ttl.=20
>=20
> If one node is down longer than gc_grace_seconds, and I got tombstone =
removed, once the node is up, from my understanding deleted data will be =
synced back. In this case my data will be processed twice and it will =
not be a big deal to me.
>=20
> Thanks,
>=20
> Daning
>=20
>=20
> On 10/04/2011 01:27 AM, aaron morton wrote:
>>=20
>> Yes that's the slice query skipping past the tombstone columns.=20
>>=20
>> Cheers
>>=20
>> -----------------
>> Aaron Morton
>> Freelance Cassandra Developer
>> @aaronmorton
>> http://www.thelastpickle.com
>>=20
>> On 4/10/2011, at 4:24 PM, Daning Wang wrote:
>>=20
>>> Lots of SliceQueryFilter in the log, is that handling tombstone?
>>>=20
>>> DEBUG [ReadStage:49] 2011-10-03 20:15:07,942 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317582939743663:true:4@1317582939933000
>>> DEBUG [ReadStage:50] 2011-10-03 20:15:07,942 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317573253148778:true:4@1317573253354000
>>> DEBUG [ReadStage:43] 2011-10-03 20:15:07,942 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317669552951428:true:4@1317669553018000
>>> DEBUG [ReadStage:33] 2011-10-03 20:15:07,942 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317581886709261:true:4@1317581886957000
>>> DEBUG [ReadStage:52] 2011-10-03 20:15:07,942 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317568165152246:true:4@1317568165482000
>>> DEBUG [ReadStage:36] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317567265089211:true:4@1317567265405000
>>> DEBUG [ReadStage:53] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317674324843122:true:4@1317674324946000
>>> DEBUG [ReadStage:38] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317571990078721:true:4@1317571990141000
>>> DEBUG [ReadStage:57] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317671855234221:true:4@1317671855239000
>>> DEBUG [ReadStage:54] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317558305262954:true:4@1317558305337000
>>> DEBUG [RequestResponseStage:11] 2011-10-03 20:15:07,941 =
ResponseVerbHandler.java (line 48) Processing response on a callback =
from 12347@/10.210.101.104
>>> DEBUG [RequestResponseStage:9] 2011-10-03 20:15:07,941 =
AbstractRowResolver.java (line 66) Preprocessed data response
>>> DEBUG [RequestResponseStage:13] 2011-10-03 20:15:07,941 =
AbstractRowResolver.java (line 66) Preprocessed digest response
>>> DEBUG [ReadStage:58] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317581337972739:true:4@1317581338044000
>>> DEBUG [ReadStage:64] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317582656796332:true:4@1317582656970000
>>> DEBUG [ReadStage:55] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317569432886284:true:4@1317569432984000
>>> DEBUG [ReadStage:45] 2011-10-03 20:15:07,941 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317572658687019:true:4@1317572658718000
>>> DEBUG [ReadStage:47] 2011-10-03 20:15:07,940 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317582281617755:true:4@1317582281717000
>>> DEBUG [ReadStage:48] 2011-10-03 20:15:07,940 SliceQueryFilter.java =
(line 123) collecting 0 of 1: 1317549607869226:true:4@1317549608118000
>>> DEBUG [ReadStage:34] 2011-10-03 20:15:07,940 SliceQueryFilter.java =
(line 123) collecting 0 of 1:=20
>>> On Thu, Sep 29, 2011 at 2:17 PM, aaron morton =
<aaron@thelastpickle.com> wrote:
>>> As with any situation involving the un-dead, it really is the number =
of Zombies, Mummies or Vampires that is the concern. =20
>>>=20
>>> If you delete data there will always be tombstones. If you have a =
delete heavy workload there will be more tombstones. This is why =
implementing a queue with cassandra is a bad idea.
>>>=20
>>> gc_grace_seconds (and column TTL) are the *minimum* about of time =
the tombstones will stay in the data files, there is no maximum.=20
>>>=20
>>> Your read performance also depends on the number of SSTables the row =
is spread over, see =
http://thelastpickle.com/2011/04/28/Forces-of-Write-and-Read/
>>>=20
>>> If you really wanted to purge them then yes a repair and then major =
compaction would be the way to go. Also consider if it's possible to =
design the data model around the problem, e.g. partitioning rows by =
date. IMHO I would look to make data model changes before implementing a =
compaction policy, or consider if cassandra is the right store if you =
have a delete heavy workload.
>>>=20
>>> Cheers
>>>=20
>>> =20
>>> -----------------
>>> Aaron Morton
>>> Freelance Cassandra Developer
>>> @aaronmorton
>>> http://www.thelastpickle.com
>>>=20
>>> On 30/09/2011, at 3:27 AM, Daning Wang wrote:
>>>=20
>>>> Jonathan/Aaron,
>>>>=20
>>>> Thank you guy's reply, I will change GCGracePeriod to 1 day to see =
what will happen.
>>>>=20
>>>> Is there a way to purge tombstones at anytime? because if =
tombstones affect performance, we want them to be purged right away, not =
after GCGracePeriod. We know all the nodes are up, and we can do repair =
first to make sure the consistency before purging.
>>>>=20
>>>> Thanks,
>>>>=20
>>>> Daning
>>>>=20
>>>>=20
>>>> On Wed, Sep 28, 2011 at 5:22 PM, aaron morton =
<aaron@thelastpickle.com> wrote:
>>>> if I had to guess I would say it was spending time handling =
tombstones. If you see it happen again, and are interested, turn the =
logging up to DEBUG and look for messages from something starting with =
"Slice"
>>>>=20
>>>> Minor (automatic) compaction will, over time, purge the tombstones. =
Until then reads must read discard the data deleted by the tombstones. =
If you perform a big (i.e. 100k's ) delete this can reduce performance =
until compaction does it's thing.
>>>>=20
>>>> My second guess would be read repair (or the simple consistency =
checks on read) kicking in. That would show up in the "ReadRepairStage" =
in TPSTATS
>>>>=20
>>>> it may have been neither of those two things, just guesses. If you =
have more issues let us know and provide some more info.
>>>>=20
>>>> Cheers
>>>>=20
>>>>=20
>>>> -----------------
>>>> Aaron Morton
>>>> Freelance Cassandra Developer
>>>> @aaronmorton
>>>> http://www.thelastpickle.com
>>>>=20
>>>> On 29/09/2011, at 6:35 AM, Daning wrote:
>>>>=20
>>>> > I have an app polling a few CFs (select first N * from CF), there =
were data in CFs but later were deleted so CFs were empty for a long =
time. I found Cassandra CPU usage was getting high to 80%, normally it =
uses less than 30%. I issued the select query manually and feel the =
response is slow. I have tried nodetool compact/repair for those CFs but =
that does not work. later, I issue 'truncate' for all the CFs and CPU =
usage gets down to 1%.
>>>> >
>>>> > Can somebody explain to me why I need to truncate an empty CF? =
and what else I could do to bring the CPU usage down?
>>>> >
>>>> > I am running 0.8.6.
>>>> >
>>>> > Thanks,
>>>> >
>>>> > Daning
>>>> >
>>>>=20
>>>>=20
>>>=20
>>>=20
>>=20
>=20


--Apple-Mail=_26D1CE5C-D91D-4BF5-AB6B-7402BE0306D6
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">I =
would not get gc_grace seconds to 0, set to to something =
small.&nbsp;<div><br></div><div>gc_grace_seconds or ttl is only the =
minimum amount of time the column will stay in the data files. The =
columns are only purged when compaction runs some time after that =
timespan has ended.&nbsp;</div><div><br></div><div>If you are seeing =
issues where a heavy delete workload is having an noticeably adverse =
effect on read performance then you should look at the data model. =
Consider ways to spread the write / read / delete workload over multiple =
rows.</div><div><br></div><div>If you cannot get away from it then =
experiment with reducing the min_compactioon_threshold of the CF's so =
that compaction kicks in quicker, and (potentially) tombstones are =
purged =
faster.&nbsp;</div><div><br></div><div>Chees</div><div><br></div><div>&nbs=
p;<br><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 5/10/2011, at 6:03 AM, Daning wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">

 =20
    <meta content=3D"text/html; charset=3DISO-8859-1" =
http-equiv=3D"Content-Type">
 =20
  <div text=3D"#000000" bgcolor=3D"#ffffff">
    Thanks Aaron.&nbsp; How about I set the gc_grace_seconds to 0 or =
like 2
    hours? I like to clean up tomebstone sooner, I don't care losing
    some data and all my columns have ttl. <br>
    <br>
    If one node is down longer than gc_grace_seconds, and I got
    tombstone removed, once the node is up, from my understanding
    deleted data will be synced back. In this case my data will be
    processed twice and it will not be a big deal to me.<br>
    <br>
    Thanks,<br>
    <br>
    Daning<br>
    <br>
    <br>
    On 10/04/2011 01:27 AM, aaron morton wrote:
    <blockquote =
cite=3D"mid:94DC18F0-5EA0-446D-AEE3-E1505E05E157@thelastpickle.com" =
type=3D"cite">Yes that's the slice query skipping past the tombstone
      columns.&nbsp;
      <div><br>
      </div>
      <div>Cheers</div>
      <div><br>
        <div>
          <span class=3D"Apple-style-span" style=3D"border-collapse: =
separate; font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
font-family: Helvetica; font-style: normal; font-variant: normal; =
font-weight: normal; letter-spacing: normal; line-height: normal; =
orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; =
widows: 2; word-spacing: 0px; font-size: medium; ">
              <div style=3D"word-wrap: break-word;"><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
font-family: Helvetica; font-style: normal; font-variant: normal; =
font-weight: normal; letter-spacing: normal; line-height: normal; =
orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; =
widows: 2; word-spacing: 0px; font-size: medium; ">
                  <div style=3D"word-wrap: break-word;">
                    <div>
                      <div>-----------------</div>
                      <div>Aaron Morton</div>
                      <div>Freelance Cassandra Developer</div>
                      <div>@aaronmorton</div>
                      <div><a moz-do-not-send=3D"true" =
href=3D"http://www.thelastpickle.com/">http://www.thelastpickle.com</a></d=
iv>
                    </div>
                  </div>
                </span></div>
            </span></span>
        </div>
        <br>
        <div>
          <div>On 4/10/2011, at 4:24 PM, Daning Wang wrote:</div>
          <br class=3D"Apple-interchange-newline">
          <blockquote type=3D"cite">Lots of SliceQueryFilter in the log,
            is that handling tombstone?<br>
            <br>
            DEBUG [ReadStage:49] 2011-10-03 20:15:07,942
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317582939743663:true:4@1317582939933000<br>
            DEBUG [ReadStage:50] 2011-10-03 20:15:07,942
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317573253148778:true:4@1317573253354000<br>
            DEBUG [ReadStage:43] 2011-10-03 20:15:07,942
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317669552951428:true:4@1317669553018000<br>
            DEBUG [ReadStage:33] 2011-10-03 20:15:07,942
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317581886709261:true:4@1317581886957000<br>
            DEBUG [ReadStage:52] 2011-10-03 20:15:07,942
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317568165152246:true:4@1317568165482000<br>
            DEBUG [ReadStage:36] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317567265089211:true:4@1317567265405000<br>
            DEBUG [ReadStage:53] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317674324843122:true:4@1317674324946000<br>
            DEBUG [ReadStage:38] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317571990078721:true:4@1317571990141000<br>
            DEBUG [ReadStage:57] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317671855234221:true:4@1317671855239000<br>
            DEBUG [ReadStage:54] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317558305262954:true:4@1317558305337000<br>
            DEBUG [RequestResponseStage:11] 2011-10-03 20:15:07,941
            ResponseVerbHandler.java (line 48) Processing response on a
            callback from 12347@/<a moz-do-not-send=3D"true" =
href=3D"http://10.210.101.104/">10.210.101.104</a><br>
            DEBUG [RequestResponseStage:9] 2011-10-03 20:15:07,941
            AbstractRowResolver.java (line 66) Preprocessed data
            response<br>
            DEBUG [RequestResponseStage:13] 2011-10-03 20:15:07,941
            AbstractRowResolver.java (line 66) Preprocessed digest
            response<br>
            DEBUG [ReadStage:58] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317581337972739:true:4@1317581338044000<br>
            DEBUG [ReadStage:64] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317582656796332:true:4@1317582656970000<br>
            DEBUG [ReadStage:55] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317569432886284:true:4@1317569432984000<br>
            DEBUG [ReadStage:45] 2011-10-03 20:15:07,941
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317572658687019:true:4@1317572658718000<br>
            DEBUG [ReadStage:47] 2011-10-03 20:15:07,940
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317582281617755:true:4@1317582281717000<br>
            DEBUG [ReadStage:48] 2011-10-03 20:15:07,940
            SliceQueryFilter.java (line 123) collecting 0 of 1:
            1317549607869226:true:4@1317549608118000<br>
            DEBUG [ReadStage:34] 2011-10-03 20:15:07,940
            SliceQueryFilter.java (line 123) collecting 0 of 1: <br>
            <div class=3D"gmail_quote">On Thu, Sep 29, 2011 at 2:17 PM,
              aaron morton <span dir=3D"ltr">&lt;<a =
moz-do-not-send=3D"true" =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan>
              wrote:<br>
              <blockquote class=3D"gmail_quote" style=3D"border-left: =
1px
                solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex;
                padding-left: 1ex;">
                <div style=3D"word-wrap: break-word;">As with any
                  situation involving the un-dead, it really is the
                  number of Zombies, Mummies or Vampires that is the
                  concern. &nbsp;
                  <div><br>
                  </div>
                  <div>If you delete data there will always be
                    tombstones. If you have a delete heavy workload
                    there will be more tombstones. This is why
                    implementing a queue with cassandra is a bad =
idea.</div>
                  <div><br>
                  </div>
                  <div>gc_grace_seconds (and column TTL) are the
                    *minimum* about of time the tombstones will stay in
                    the data files, there is no maximum.&nbsp;</div>
                  <div><br>
                  </div>
                  <div>Your read performance also depends on the number
                    of SSTables the row is spread over, see&nbsp;<a =
moz-do-not-send=3D"true" =
href=3D"http://thelastpickle.com/2011/04/28/Forces-of-Write-and-Read/" =
target=3D"_blank">http://thelastpickle.com/2011/04/28/Forces-of-Write-and-=
Read/</a></div>
                  <div><br>
                  </div>
                  <div>If you really wanted to purge them then yes a
                    repair and then major compaction would be the way to
                    go. Also consider if it's possible to design the
                    data model around the problem, e.g. partitioning
                    rows by date. IMHO I would look to make data model
                    changes before implementing a compaction policy, or
                    consider if cassandra is the right store if you have
                    a delete heavy workload.</div>
                  <div><br>
                  </div>
                  <div>Cheers</div>
                  <div><br>
                  </div>
                  <div>
                    <div class=3D"im">&nbsp;<br>
                      <div>
                        <span style=3D"border-collapse: separate; color:
                          rgb(0, 0, 0); font-family: Helvetica;
                          font-style: normal; font-variant: normal;
                          font-weight: normal; letter-spacing: normal;
                          line-height: normal; text-indent: 0px;
                          text-transform: none; white-space: normal;
                          word-spacing: 0px; font-size: medium;"><span =
style=3D"border-collapse: separate; color:
                            rgb(0, 0, 0); font-family: Helvetica;
                            font-style: normal; font-variant: normal;
                            font-weight: normal; letter-spacing: normal;
                            line-height: normal; text-indent: 0px;
                            text-transform: none; white-space: normal;
                            word-spacing: 0px; font-size: medium;">
                            <div style=3D"word-wrap: break-word;">
                              <span style=3D"border-collapse: separate;
                                color: rgb(0, 0, 0); font-family:
                                Helvetica; font-style: normal;
                                font-variant: normal; font-weight:
                                normal; letter-spacing: normal;
                                line-height: normal; text-indent: 0px;
                                text-transform: none; white-space:
                                normal; word-spacing: 0px; font-size:
                                medium;">
                                <div style=3D"word-wrap: break-word;">
                                  <div>
                                    <div>-----------------</div>
                                    <div>Aaron Morton</div>
                                    <div>Freelance Cassandra =
Developer</div>
                                    <div>@aaronmorton</div>
                                    <div><a moz-do-not-send=3D"true" =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div>
                                  </div>
                                </div>
                              </span></div>
                          </span></span>
                      </div>
                      <br>
                    </div>
                    <div>
                      <div class=3D"h5">
                        <div>
                          <div>On 30/09/2011, at 3:27 AM, Daning Wang
                            wrote:</div>
                          <br>
                          <blockquote type=3D"cite">Jonathan/Aaron,
                            <div><br>
                            </div>
                            <div>Thank you guy's reply, I will change
                              GCGracePeriod to 1 day to see what will
                              happen.</div>
                            <div><br>
                            </div>
                            <div>Is there a way to purge tombstones at
                              anytime? because if tombstones affect
                              performance, we want them to be purged
                              right away, not after GCGracePeriod. We
                              know all the nodes are up, and we can do
                              repair first to make sure the consistency
                              before purging.</div>
                            <div><br>
                            </div>
                            <div>Thanks,</div>
                            <div><br>
                            </div>
                            <div>Daning</div>
                            <div><br>
                              <br>
                              <div class=3D"gmail_quote">On Wed, Sep 28,
                                2011 at 5:22 PM, aaron morton <span =
dir=3D"ltr">&lt;<a moz-do-not-send=3D"true" =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span>
                                wrote:<br>
                                <blockquote class=3D"gmail_quote" =
style=3D"border-left: 1px solid rgb(204,
                                  204, 204); margin: 0pt 0pt 0pt 0.8ex;
                                  padding-left: 1ex;">if I had to guess
                                  I would say it was spending time
                                  handling tombstones. If you see it
                                  happen again, and are interested, turn
                                  the logging up to DEBUG and look for
                                  messages from something starting with
                                  "Slice"<br>
                                  <br>
                                  Minor (automatic) compaction will,
                                  over time, purge the tombstones. Until
                                  then reads must read discard the data
                                  deleted by the tombstones. If you
                                  perform a big (i.e. 100k's ) delete
                                  this can reduce performance until
                                  compaction does it's thing.<br>
                                  <br>
                                  My second guess would be read repair
                                  (or the simple consistency checks on
                                  read) kicking in. That would show up
                                  in the "ReadRepairStage" in =
TPSTATS<br>
                                  <br>
                                  it may have been neither of those two
                                  things, just guesses. If you have more
                                  issues let us know and provide some
                                  more info.<br>
                                  <br>
                                  Cheers<br>
                                  <br>
                                  <br>
                                  -----------------<br>
                                  <font color=3D"#888888">Aaron =
Morton<br>
                                    Freelance Cassandra Developer<br>
                                    @aaronmorton<br>
                                    <a moz-do-not-send=3D"true" =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
                                  </font>
                                  <div>
                                    <div><br>
                                      On 29/09/2011, at 6:35 AM, Daning
                                      wrote:<br>
                                      <br>
                                      &gt; I have an app polling a few
                                      CFs (select first N * from CF),
                                      there were data in CFs but later
                                      were deleted so CFs were empty for
                                      a long time. I found Cassandra CPU
                                      usage was getting high to 80%,
                                      normally it uses less than 30%. I
                                      issued the select query manually
                                      and feel the response is slow. I
                                      have tried nodetool compact/repair
                                      for those CFs but that does not
                                      work. later, I issue 'truncate'
                                      for all the CFs and CPU usage gets
                                      down to 1%.<br>
                                      &gt;<br>
                                      &gt; Can somebody explain to me
                                      why I need to truncate an empty
                                      CF? and what else I could do to
                                      bring the CPU usage down?<br>
                                      &gt;<br>
                                      &gt; I am running 0.8.6.<br>
                                      &gt;<br>
                                      &gt; Thanks,<br>
                                      &gt;<br>
                                      &gt; Daning<br>
                                      &gt;<br>
                                      <br>
                                    </div>
                                  </div>
                                </blockquote>
                              </div>
                              <br>
                            </div>
                          </blockquote>
                        </div>
                        <br>
                      </div>
                    </div>
                  </div>
                </div>
              </blockquote>
            </div>
            <br>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div>

</blockquote></div><br></div></body></html>=

--Apple-Mail=_26D1CE5C-D91D-4BF5-AB6B-7402BE0306D6--