Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 87696 invoked from network); 10 Apr 2010 19:25:16 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 10 Apr 2010 19:25:16 -0000 Received: (qmail 83999 invoked by uid 500); 10 Apr 2010 19:25:15 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 83980 invoked by uid 500); 10 Apr 2010 19:25:15 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 83972 invoked by uid 99); 10 Apr 2010 19:25:15 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Apr 2010 19:25:15 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [131.215.239.119] (HELO mail.alumni.caltech.edu) (131.215.239.119) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Apr 2010 19:25:07 +0000 Received: from localhost (dsl081-082-089.lax1.dsl.speakeasy.net [64.81.82.89]) by mail.alumni.caltech.edu (Postfix) with ESMTPSA id 76E053F038A; Sat, 10 Apr 2010 12:24:42 -0700 (PDT) X-DKIM: Sendmail DKIM Filter v2.8.2 mail.alumni.caltech.edu 76E053F038A DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=alumni.caltech.edu; s=enforce; t=1270927482; bh=GzM6SgXI1Xqp2Ppcp3ctqcsCrWkE0fbgN0g+PiPeHtE=; h=Date:From:To:Subject:Message-ID:Mime-Version:Content-Type; b=N2vbI2xdUcSUyb6H3VnucoO1ofwVZmhdEM43Sj6qFtw7pt3638IvbKe220XwbydHP hK35XatlF/D2WdFNhk/ztAjvujA0tm/Yce6ZgcfFjRBaj4hr8HQqSRCDTUO4p+OiD3 5hS3QMskPgP7wEXn5rKt7eAagd7Se6P2b7bKRX4A= Date: Sat, 10 Apr 2010 12:24:38 -0700 From: Anthony Molinaro To: user@cassandra.apache.org Subject: Recovery from botched compaction Message-ID: <20100410192438.GA82910@alumni.caltech.edu> Mail-Followup-To: user@cassandra.apache.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.2.3i X-MailScanner-Information-Alumni: X-Alumni-MailScanner-ID: 76E053F038A.AB4B1 X-MailScanner-Alumni: No Virii found X-Spam-Status-Alumni: not spam, SpamAssassin (not cached, score=-3.268, required 5, ALL_TRUSTED -1.80, BAYES_00 -2.60, DNS_FROM_OPENWHOIS 1.13, FH_DATE_PAST_20XX 0.00) X-MailScanner-From: anthonym@alumni.caltech.edu X-Virus-Checked: Checked by ClamAV on apache.org Hi, This is sort of a pre-emptive question as the compaction I'm doing hasn't failed yet but I expect it to any time now. I have a cluster which has been storing user profile data for a client. Recently I've had to go back and reload all the data again. I wasn't watching diskspace, and on one of the nodes it went above 50% (which I recall was bad), to somewhere around 70%. I expect to most back with a compaction (as most of the data was the same so a compaction should remove old copies), and went ahead and started one with nodeprobe compact (using 0.5.0 on this cluster). However, I do see that the disk usage is growing (it's at 91% now). So when the disk fills up and this compaction crashes what can I do? I assume get a bigger disk, shut down the node, move the data and restart will work, but do I have other options? Which files can I ignore (ie, can I not move any of the *-tmp-* files)? Will my system be in a corrupt state? This machine is one in a set of 6, and since I didn't choose tokens initially, they are very lopsided (ie, some use 20% of their disk, others 60-70%). If I were to start moving tokens around would the machines short of space be able to anti-compact without filling up? or does anti-compaction like compaction require 2x disk space? Thanks, -Anthony -- ------------------------------------------------------------------------ Anthony Molinaro