Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 21493 invoked from network); 20 Sep 2010 21:58:23 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Sep 2010 21:58:23 -0000 Received: (qmail 41011 invoked by uid 500); 20 Sep 2010 21:58:21 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 40955 invoked by uid 500); 20 Sep 2010 21:58:21 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 40947 invoked by uid 99); 20 Sep 2010 21:58:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Sep 2010 21:58:20 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.213.44] (HELO mail-yw0-f44.google.com) (209.85.213.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Sep 2010 21:58:13 +0000 Received: by ywk9 with SMTP id 9so2028895ywk.31 for ; Mon, 20 Sep 2010 14:57:49 -0700 (PDT) MIME-Version: 1.0 Received: by 10.150.139.9 with SMTP id m9mr10111837ybd.442.1285019869455; Mon, 20 Sep 2010 14:57:49 -0700 (PDT) Sender: scode@scode.org Received: by 10.151.101.3 with HTTP; Mon, 20 Sep 2010 14:57:49 -0700 (PDT) X-Originating-IP: [213.114.156.79] In-Reply-To: References: Date: Mon, 20 Sep 2010 23:57:49 +0200 X-Google-Sender-Auth: OoIMUv9Ztk5XaUOH5SS3hg4LuJU Message-ID: Subject: Re: what are ways to keep the SSTable Count down low From: Peter Schuller To: user@cassandra.apache.org Content-Type: text/plain; charset=UTF-8 > Not in 0.6 I believe, but I believe this recently went into trunk for > the upcoming 0.7. By "not in 0.6" i mean in terms of the configuration. If you are willing to modify the source you can change it fairly easily by changing the private constants towards the beginning of org/apache/cassandra/db/CompactionManager.java: private int minimumCompactionThreshold = 4; // compact this many sstables min at a time private int maximumCompactionThreshold = 32; // compact this many sstables max at a time Note that increasing the min will tend to mean more overall compaction, while decreasing it means less overall compaction. The trade-off is potentially slower reads due to more sstables having to be consulted (meaning, more seeking). If you're worried about decreasing amount of compaction, you don't want to decrease the 'max'. It *might*, under some circumstances, contribute to decreasing the impact of compaction once it happens, but that would be highly dependent on I/O scheduling and other circumstances. I wouldn't touch if unless I knew specifically there was a reason to. -- / Peter Schuller