Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0225E6C94 for ; Wed, 22 Jun 2011 16:36:32 +0000 (UTC) Received: (qmail 32463 invoked by uid 500); 22 Jun 2011 16:36:29 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 32441 invoked by uid 500); 22 Jun 2011 16:36:29 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 32433 invoked by uid 99); 22 Jun 2011 16:36:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jun 2011 16:36:29 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jonathan.colby@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jun 2011 16:36:21 +0000 Received: by wwe5 with SMTP id 5so896809wwe.25 for ; Wed, 22 Jun 2011 09:36:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:from:content-type:content-transfer-encoding :subject:date:message-id:to:mime-version:x-mailer; bh=YuCJJilnWKqZ4+yLBExtICnY1PxUyMGukZ9M160SfzA=; b=ckaeeRnnrhEw4snRMu9uthCl3LS69CF2cJvlMp4YFWDQ0b3nZ1BBjgXtql0tvakuRT mLgWyFLCdTFXW3REYvNSUYIzhADmojkynyhjlgl2Qy91a58jegFbQKGw48fS1mvh4d7l F4lHfWuzNGhXXJf9dSyGVh/Lq7TTbww6XjKzw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:content-type:content-transfer-encoding:subject:date:message-id :to:mime-version:x-mailer; b=OcyoHqLKoHfUNyfcuSWfZ/dsA2NnA8O/xu0yp+72LZkeOea7EN38noPbqXDbEH4z/3 6YG3QZowWgzpxJeLD6uRkmPEfIZDLfMUrbRT/Xf3FtZW5AgI/vk1CKdcvBErAh6ecU3q EpkcZpfYGWnVJkHMsHa9Q01K+x5qqiSobmPYc= Received: by 10.227.12.15 with SMTP id v15mr937054wbv.77.1308760560483; Wed, 22 Jun 2011 09:36:00 -0700 (PDT) Received: from [10.250.128.161] ([195.234.136.12]) by mx.google.com with ESMTPS id eq4sm572940wbb.54.2011.06.22.09.35.58 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 22 Jun 2011 09:35:59 -0700 (PDT) From: Jonathan Colby Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: simple question about merged SSTable sizes Date: Wed, 22 Jun 2011 18:35:57 +0200 Message-Id: To: user@cassandra.apache.org Mime-Version: 1.0 (Apple Message framework v1084) X-Mailer: Apple Mail (2.1084) The way compaction works, "x" same-sized files are merged into a new = SSTable. This repeats itself and the SSTable get bigger and bigger. So what is the upper limit?? If you are not deleting stuff fast = enough, wouldn't the SSTable sizes grow indefinitely? I ask because we have some rather large SSTable files (80-100 GB) and = I'm starting to worry about future compactions. Second, compacting such large files is an IO killer. What can be = tuned other than compaction_threshold to help optimize this and prevent = the files from getting too big? Thanks!=