Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8124C6A19 for ; Tue, 7 Jun 2011 15:15:25 +0000 (UTC) Received: (qmail 97145 invoked by uid 500); 7 Jun 2011 15:15:23 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 97040 invoked by uid 500); 7 Jun 2011 15:15:23 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 97029 invoked by uid 99); 7 Jun 2011 15:15:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Jun 2011 15:15:23 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [204.13.248.74] (HELO mho-02-ewr.mailhop.org) (204.13.248.74) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Jun 2011 15:15:15 +0000 Received: from 71-218-75-109.hlrn.qwest.net ([71.218.75.109] helo=[192.168.0.2]) by mho-02-ewr.mailhop.org with esmtpsa (TLSv1:CAMELLIA256-SHA:256) (Exim 4.72) (envelope-from ) id 1QTxza-000Df1-3W for user@cassandra.apache.org; Tue, 07 Jun 2011 15:14:54 +0000 X-Mail-Handler: MailHop Outbound by DynDNS X-Originating-IP: 71.218.75.109 X-Report-Abuse-To: abuse@dyndns.com (see http://www.dyndns.com/services/mailhop/outbound_abuse.html for abuse reporting information) X-MHO-User: U2FsdGVkX19i/D6W/a1dXyfT4hT1xWE2GQLyN2740ks= Message-ID: <4DEE4068.9090802@dude.podzone.net> Date: Tue, 07 Jun 2011 09:14:48 -0600 From: AJ User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.17) Gecko/20110414 Lightning/1.0b2 Thunderbird/3.1.10 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Backups, Snapshots, SSTable Data Files, Compaction References: <4DEDB132.4070100@dude.podzone.net> <4DEDB65C.203@datastax.com> <4DEDD135.6090800@dude.podzone.net> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 6/7/2011 2:29 AM, Maki Watanabe wrote: > You can find useful information in: > http://www.datastax.com/docs/0.8/operations/scheduled_tasks > > sstables are immutable. Once it written to disk, it won't be updated. > When you take snapshot, the tool makes hard links to sstable files. > After certain time, you will have some times of memtable flushs, so > your sstable files will be merged, and obsolete sstable files will be > removed. But snapshot set will remains on your disk, for backup. > Thanks for the doc source. I will be experimenting with 0.8.0 since it has many features I've been waiting for. But, still, if the snapshots don't link to all of the previous sets of .db files, then those unlinked previous file sets MUST be safe to manually delete. But, they aren't deleted until later after a GC. It's a bit confusing why they are kept after compaction up until GC when they seem to not be needed. We have Big Data plans... one node can have 10's of TBs, so I'm trying to get an idea of how much disk space will be required and whether or not I can free-up some disk space. Hopefully someone can still elaborate on this.