Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 40252 invoked from network); 20 Jun 2010 05:45:40 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Jun 2010 05:45:40 -0000 Received: (qmail 87637 invoked by uid 500); 20 Jun 2010 05:45:39 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 87404 invoked by uid 500); 20 Jun 2010 05:45:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 87396 invoked by uid 99); 20 Jun 2010 05:45:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Jun 2010 05:45:35 +0000 X-ASF-Spam-Status: No, hits=1.9 required=10.0 tests=FORGED_YAHOO_RCVD,FREEMAIL_FROM,RCVD_NUMERIC_HELO,SPF_HELO_PASS,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of gcdcu-cassandra-user-1@m.gmane.org designates 80.91.229.12 as permitted sender) Received: from [80.91.229.12] (HELO lo.gmane.org) (80.91.229.12) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Jun 2010 05:45:27 +0000 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1OQDL5-0003fW-6P for user@cassandra.apache.org; Sun, 20 Jun 2010 07:45:03 +0200 Received: from 195.62.31.20 ([195.62.31.20]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 20 Jun 2010 07:45:03 +0200 Received: from amir74tt by 195.62.31.20 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sun, 20 Jun 2010 07:45:03 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: user@cassandra.apache.org connect(): No such file or directory From: Amir Subject: Problem with Deletes Date: Thu, 17 Jun 2010 16:10:52 +0000 (UTC) Lines: 54 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: sea.gmane.org User-Agent: Loom/3.14 (http://gmane.org/) X-Loom-IP: 195.62.31.20 (Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.2; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.5.21022; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; InfoPath.2; AskTB5.4)) Hi All, I'm running a benchmark on Cassandra while using a benchmark client which I've written myself. I'm running the following scenario: One Cassandra node on the same machine as the client. The client writes a new key every 1 second and deletes it after 10 seconds, so at any given time there should be only 10 keys. The key value size is 2K. When I ran this scenario I looked at the data folder and saw that initially cassandra created 4 files (SSTables) of ~130K and then compacted them to 20K (which is exactly what I expected 10 keys * 2K = 20K). And afterwards there where another 3 130K files which were compacted together with the first 20K file to a new 20K file, and so on... This scenario ran exactly as I excepted. Now I ran the same scenario but this time with key value size=2M. When I ran this scenario initially cassandra created 4 files (SSTables) of ~64M and then compacted them to 20M (which is exactly what I expected 10 keys * 2M = 20M). But after creating another 3 64M files - the problem started! It didn't compact them with the first 20M - instead it created another 64M file and compacted all 4 of them to 260M file (!), and after creating another 4 64M files it compacted them to another 260M file and so on... It looks to me like in this scenario for some reason the compaction doesn't make any deletes. I don't have any idea why :-( Additional info which I should mention: In the storage-conf.xml the following are not default: GCGraceSeconds = 0 MemtableFlushAfterMinutes = 1 Thanks a lot for your help, Amir