Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4D0F417667 for ; Fri, 10 Apr 2015 23:31:48 +0000 (UTC) Received: (qmail 28139 invoked by uid 500); 10 Apr 2015 23:31:45 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 28095 invoked by uid 500); 10 Apr 2015 23:31:45 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 28084 invoked by uid 99); 10 Apr 2015 23:31:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 23:31:45 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jonathan.haddad@gmail.com designates 209.85.216.46 as permitted sender) Received: from [209.85.216.46] (HELO mail-vn0-f46.google.com) (209.85.216.46) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 23:31:21 +0000 Received: by vnbf1 with SMTP id f1so9478867vnb.5 for ; Fri, 10 Apr 2015 16:30:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=pYDakkHkT6VGn8iWvnO5oyoXvTvzU1uYcI9o7/g0a/A=; b=dNY5d9NQjZhZd+f8XWTwngQnPNLOJIS4fhm1PfBbt0K5A/NYuq8VFEEKWSGjot/enq qcY9ZNp/VX9FN8K0RCLalRMtLyqN9Ay6q7MtG1eOv786TQXMevK8TdbDrQZTNQ2xBCok G9/kVntEbtV2NAVb0MMrrPa3ocr2zkcPDk2Y+dp+7zKc8jN9mOyFMBTYkoooR2SJ6DbJ Wz2C0duRaUPij4mcQgmAjXblR1xmIr+4JLowZmZjwYMsDlhVBzSG0jMEZvKCH5fs1QrJ jgAVt/pMWzLSlefmliw0LfZ+J/v6WnyT8XlyQB2Cp9f01BkHjTBiRqbst0rute35ionV d3DA== MIME-Version: 1.0 X-Received: by 10.60.78.72 with SMTP id z8mr4623966oew.13.1428708634483; Fri, 10 Apr 2015 16:30:34 -0700 (PDT) Sender: jonathan.haddad@gmail.com Received: by 10.202.97.8 with HTTP; Fri, 10 Apr 2015 16:30:34 -0700 (PDT) In-Reply-To: References: Date: Fri, 10 Apr 2015 16:30:34 -0700 X-Google-Sender-Auth: D2aUQcNZByjUkCc8Z1cUNMjfD64 Message-ID: Subject: Re: Moving SSTables from one disk to another From: Jonathan Haddad To: "user@cassandra.apache.org" Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org I had submitted this issue which could have had (in theory) some serious performance benefit when using JBOD: https://issues.apache.org/jira/browse/CASSANDRA-8868 However, it was pointed out to me that https://issues.apache.org/jira/browse/CASSANDRA-6696 will be a better solution in a lot of cases. On Fri, Apr 10, 2015 at 4:13 PM, Robert Coli wrote: > On Fri, Apr 10, 2015 at 4:00 PM, Roman Tkachenko > wrote: >> >> * Can I just move some SSTables data files from "sstables2" to "sstables1" >> which has much more free disk space? Will Cassandra start fine after that >> and not lose any data? > > > Cassandra generally discovers files in its data directories and treats them > as legitimate files. I do not have specific knowledge of JBOD behavior here, > but I would presume it would be the same. > >> >> * Provided multiple data dirs, should Cassandra distribute data equally >> between them? In what I'm observing this is almost always not true. On that >> particular node I mentioned above the difference is huge: 4% occupied disk >> space for "sstables1" and 87% for "sstables2"; on other nodes the situation >> is a little better but still not 50/50. > > > No, and especially not when using Size Tiered Compaction. > > I honestly wonder why people think JBOD is a useful feature for Cassandra. > You don't really want to continue to operate a node that has lost half of > its data, and managing multiple data directories seems relatively likely to > be more trouble than it's worth. You have a distributed, replicated > database... just replace nodes when they fail. Anyone care to set me > straight about the amazing benefits they see which make the costs > worthwhile? > > =Rob > -- Jon Haddad http://www.rustyrazorblade.com twitter: rustyrazorblade