Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 73948 invoked from network); 31 May 2010 13:12:14 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 31 May 2010 13:12:14 -0000 Received: (qmail 19646 invoked by uid 500); 31 May 2010 13:12:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 19624 invoked by uid 500); 31 May 2010 13:12:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 19616 invoked by uid 99); 31 May 2010 13:12:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 May 2010 13:12:13 +0000 X-ASF-Spam-Status: No, hits=-0.3 required=10.0 tests=AWL,FREEMAIL_FROM,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of maximkr@gmail.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 31 May 2010 13:12:07 +0000 Received: by fxm6 with SMTP id 6so2759904fxm.31 for ; Mon, 31 May 2010 06:11:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:message-id:date:from :organization:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=31MkdDZf2GxT3NIsZUxDdZH9Ep3QltVDJNY6zFEVM7M=; b=kSecpOynWzZLJC83EnpBctISx2Rh++o2+xbUX5FhlZ9iQN1BVDkiJ6UYSq7PIMcOPy nKyZpMz/wI64tyHaExkZke/Nchu9liWW7eKgPl2UIvosHfSneRB+U7JHsxdB36WFiVyQ i44t5ap3+7bv1OVlEnmcYvx7IGQdCKmyYecms= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:date:from:organization:user-agent:mime-version:to :subject:references:in-reply-to:content-type :content-transfer-encoding; b=cwGfCTQgpB8aqpO+heIM4CWoeZ1KTUz/j4PYlgwP1r8AYC1X0+pH+7p2vsEy7jUTTM NiVF/rBBzaNE6HiVbFVvkwyE/3e7PueWfA2dYAaGauULmCmcwuTW/4n7/GWm2nS7jfZu on4PXfhFHVis9/+zok+lCzbvWjst/1q58w5k4= Received: by 10.223.60.3 with SMTP id n3mr5417522fah.83.1275311505227; Mon, 31 May 2010 06:11:45 -0700 (PDT) Received: from [192.168.1.5] (95-31-251-70.broadband.corbina.ru [95.31.251.70]) by mx.google.com with ESMTPS id 2sm40189898faf.3.2010.05.31.06.11.44 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 31 May 2010 06:11:44 -0700 (PDT) Sender: Maxim Kramarenko Message-ID: <4C03B5BA.7040601@trackstudio.com> Date: Mon, 31 May 2010 17:12:26 +0400 From: Maxim Kramarenko Organization: TrackStudio User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.9) Gecko/20100330 Shredder/3.0.4 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: nodetool cleanup isn't cleaning up? References: <4C03ABB9.6050007@trackstudio.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Hello! I think (but not sure, please correct me if required), that after you change token, nodes just receive new data, but don't immediate deletes old one. It seems like "clean" will mark them as tombstone and it will be deleted when you run "compact" after GCGraceSeconds seconds. On 31.05.2010 17:00, Ran Tavory wrote: > Do you think it's the tombstones that take up the disk space? > Shouldn't the tombstones be moved along with the data? > > On Mon, May 31, 2010 at 3:29 PM, Maxim Kramarenko > > wrote: > > Hello! > > You likely need wait for GCGraceSeconds seconds or modify this param. > > http://spyced.blogspot.com/2010/02/distributed-deletes-in-cassandra.html > === > Thus, a delete operation can't just wipe out all traces of the data > being removed immediately: if we did, and a replica did not receive > the delete operation, when it becomes available again it will treat > the replicas that did receive the delete as having missed a write > update, and repair them! So, instead of wiping out data on delete, > Cassandra replaces it with a special value called a tombstone. The > tombstone can then be propagated to replicas that missed the initial > remove request. > ... > Here, we defined a constant, GCGraceSeconds, and had each node track > tombstone age locally. Once it has aged past the constant, it can be > GC'd. > ===