Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7D1FF100C5 for ; Fri, 25 Oct 2013 20:22:54 +0000 (UTC) Received: (qmail 28543 invoked by uid 500); 25 Oct 2013 20:21:12 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 28332 invoked by uid 500); 25 Oct 2013 20:21:04 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 28316 invoked by uid 99); 25 Oct 2013 20:21:00 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Oct 2013 20:21:00 +0000 X-ASF-Spam-Status: No, hits=1.7 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dsjas297@gmail.com designates 74.125.82.172 as permitted sender) Received: from [74.125.82.172] (HELO mail-we0-f172.google.com) (74.125.82.172) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Oct 2013 20:20:54 +0000 Received: by mail-we0-f172.google.com with SMTP id q58so4333580wes.17 for ; Fri, 25 Oct 2013 13:20:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=R8klWyLmpvexrxSRUq6Jdyrq5RLU5qOJ2izA++R/nKE=; b=yrSxQ/fEWGCNvXwULRjPlNp1WneXUmVVkEzOr+fzYHNKzRlnvD2Htoeuzzq4eOE0eB t4G40mNa+YAvnC3QPrIw8e5dFTjr5yTQ5YNjO/qpwQEWIttJ3idxGmaWRqk8pawim5tT tbcy22eBEMzpEXRYFDeNfk2wUiVGbyZriQ0KMt7jfVz3NLZ4yarnc6JLjhk+uMBW5zes y1gsq/YCZ3C945VQgHke5BxDhkDnmL8RKPvVBLznqpIDeuf1V7ZXCulyhuhcnJ33zQuf AFALjDTDjAzqK3dhVu2D5VXwfRHMP6BuUPmgsn6MuYOhZkJFxkPBFQdvlaZiAoqC09OP LkMQ== MIME-Version: 1.0 X-Received: by 10.194.110.138 with SMTP id ia10mr9042856wjb.3.1382732434534; Fri, 25 Oct 2013 13:20:34 -0700 (PDT) Received: by 10.194.250.73 with HTTP; Fri, 25 Oct 2013 13:20:34 -0700 (PDT) In-Reply-To: References: Date: Fri, 25 Oct 2013 13:20:34 -0700 Message-ID: Subject: Re: Cassandra SSTable deletion/load reporting question From: Jasdeep Hundal To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=047d7bf10aa894788e04e9967b03 X-Virus-Checked: Checked by ClamAV on apache.org --047d7bf10aa894788e04e9967b03 Content-Type: text/plain; charset=ISO-8859-1 Thanks Rob. Will checkout the tool you linked to. In our case it's definitely not the tombstones hanging around since we write entire rows at once and the amount of data in a row is far, far greater than the space a tombstone takes. Jasdeep On Fri, Oct 25, 2013 at 1:14 PM, Robert Coli wrote: > On Fri, Oct 25, 2013 at 1:10 PM, Jasdeep Hundal wrote: > >> >> After performing a large set of deletes on our cluster, a few hundred >> gigabytes work (essentially cleaning out nearly all old data), we noticed >> that nodetool reported about the same load as before. >> > > Tombstones are purgeable only after gc_grace_seconds has elapsed, and only > if all SSTables which contain fragments of that row are involved in the > compaction. > > >> With my understanding, running a repair should have triggered compactions >> between SSTable files and reference counting on the subsequent restart of >> cassandra on a node should have cleared the old files, but this did not >> appear to happen. The load did not start going down until we were writing >> to the cluster again. >> > > Repair is unrelated to minor compaction, except in that it creates new > SSTables via streaming, which may trigger minor compaction. > > >> I suspect that there are a few values hanging around in the old tables so >> the references stayed intact, can anyone confirm this? >> > > Stop suspecting and measure with checksstablegarbage : > https://github.com/cloudian/support-tools > > >> What's a bit more important for me is being able to accurately report the >> size of the "active" data set, since nodetool doesn't seem to be useful for >> this. I use counters for reporting some of this, but is there a single >> source of truth for this, especially since counters do occasionally miss >> updates. >> > > In very new versions of Cassandra, there is tracking of and metrics > available for what percentage of data in a SSTable is expired. > > =Rob > > --047d7bf10aa894788e04e9967b03 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Thanks Rob.

Will checkout the tool you linked to. I= n our case it's definitely not the tombstones hanging around since we w= rite entire rows at once and the amount of data in a row is far, far greate= r than the space a tombstone takes.

Jasdeep


On Fri, Oct 25, 2013 at 1:14 PM, Robert Coli <rcoli@eventbrite.com<= /a>> wrote:

After performing a large set of delet= es on our cluster, a few hundred gigabytes work (essentially cleaning out n= early all old data), we noticed that nodetool reported about the same load = as before.

Tombstones are purgeable only = after gc_grace_seconds has elapsed, and only if all SSTables which contain = fragments of that row are involved in the compaction.
=A0
With my understanding, running a repair should have trigge= red compactions between SSTable files and reference counting on the subsequ= ent restart of cassandra on a node should have cleared the old files, but t= his did not appear to happen. The load did not start going down until we we= re writing to the cluster again.

Repair is unrelated to minor compact= ion, except in that it creates new SSTables via streaming, which may trigge= r minor compaction.
=A0
I suspect that there are a few values hanging around = in the old tables so the references stayed intact, can anyone confirm this?=

Stop suspecting and = measure with checksstablegarbage :=A0https://github.com/cloudian/support-tools=
=A0
What's a bit more = important for me is being able to accurately report the size of the "a= ctive" data set, since nodetool doesn't seem to be useful for this= . I use counters for reporting some of this, but is there a single source o= f truth for this, especially since counters do occasionally miss updates.

In very new versions of = Cassandra, there is tracking of and metrics available for what percentage o= f data in a SSTable is expired.

=3DRob
=A0

--047d7bf10aa894788e04e9967b03--