Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 64226 invoked from network); 4 Dec 2009 02:09:50 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Dec 2009 02:09:50 -0000 Received: (qmail 23463 invoked by uid 500); 4 Dec 2009 02:09:49 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 23412 invoked by uid 500); 4 Dec 2009 02:09:49 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 23403 invoked by uid 99); 4 Dec 2009 02:09:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 02:09:49 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 209.85.219.220 as permitted sender) Received: from [209.85.219.220] (HELO mail-ew0-f220.google.com) (209.85.219.220) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Dec 2009 02:09:41 +0000 Received: by ewy20 with SMTP id 20so162359ewy.0 for ; Thu, 03 Dec 2009 18:09:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:content-type; bh=nJ8vGGG0aPcimQGPLM1EbGv/H7r5JBpbjm1iBvci8FA=; b=rgCcNOBn7Z57GZLFQmnAGx83Mo/9HDlIL+TXAi0iW8Ojvf2Kg0K5NSPFUtWtFhOIg6 DUhUXcKdcogwBxesIO8Yyp+sKNpjp0fhQSpH3UAl+aM6Z7NchCW0eSWYjBzRoIsMQWe6 LS9A8Q2s0JEDxRhtQv/B2TBTh8+pa31fe1S4I= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=vdRCnFuUhBBsrPpMfwc/s/4NA11O3cPqjHTRjNhq7r7FgzdkyOWfLMu61P2cHKlChD ONTVr9G2wDjL+oIe1zw8gNuVd4IUyphmtqYKVMf9C1D64NdQVNUSuXpGlypQjcD1+UhR CzNJe8AvEUhy14EIJKR2Zq8VikLdr12TJ762c= MIME-Version: 1.0 Received: by 10.216.93.78 with SMTP id k56mr810982wef.102.1259892561191; Thu, 03 Dec 2009 18:09:21 -0800 (PST) In-Reply-To: References: From: Jonathan Ellis Date: Thu, 3 Dec 2009 20:09:01 -0600 Message-ID: Subject: Re: Removes increasing disk space usage in Cassandra? To: cassandra-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org cassandra never modifies data in-place. so it writes tombstones to supress the older writes, and when compaction occurs the data and tombstones get GC'd (after the period specified in your config file). On Thu, Dec 3, 2009 at 8:07 PM, Ramzi Rabah wrote: > Looking at jconsole I see a high number of writes when I do removes, > so I am guessing these are tombstones being written? If that's the > case, is the data being removed and replaced by tombstones? and will > they all be deleted eventually when compaction runs? > > > > On Thu, Dec 3, 2009 at 3:18 PM, Ramzi Rabah wrote: >> Hi all, >> >> I ran a test where I inserted about 1.2 Gigabytes worth of data into >> each node of a 4 node cluster. >> I ran a script that first calls a get on each column inserted followed >> by a remove. Since I was basically removing every entry >> I inserted before, I expected that the disk space occupied by the >> nodes will go down and eventually become 0. The disk space >> actually goes up when I do the bulk removes to about 1.8 gigs per >> node. Am I missing something here? >> >> Thanks a lot for your help >> Ray >> >