Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 03BEA105AE for ; Thu, 11 Jul 2013 23:11:49 +0000 (UTC) Received: (qmail 84103 invoked by uid 500); 11 Jul 2013 23:11:48 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 84082 invoked by uid 500); 11 Jul 2013 23:11:48 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 84073 invoked by uid 99); 11 Jul 2013 23:11:48 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Jul 2013 23:11:48 +0000 Date: Thu, 11 Jul 2013 23:11:48 +0000 (UTC) From: "Nick Bailey (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-5722) Cleanup should skip sstables that don't contain data outside a nodes ranges MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13706414#comment-13706414 ] Nick Bailey commented on CASSANDRA-5722: ---------------------------------------- After thinking about this a bit more, this is going to be pretty useless with vnodes isn't it? Each sstable is going to contain keys from all over the ring so this probably won't be able to skip any. I guess there are enough people not on vnodes it could be worth it still though. > Cleanup should skip sstables that don't contain data outside a nodes ranges > --------------------------------------------------------------------------- > > Key: CASSANDRA-5722 > URL: https://issues.apache.org/jira/browse/CASSANDRA-5722 > Project: Cassandra > Issue Type: Improvement > Reporter: Nick Bailey > Assignee: Tyler Hobbs > Fix For: 2.0.1 > > > Right now cleanup is optimized to simply delete sstables that *only* contain data that doesn't belong on the node, for all other sstables though, it will read them, check each row, and write out new sstables. > Cleanup could be optimized to look at an sstable and determine that all data within the sstable does belong on a node, and therefore skip re-writing that sstable. This would make cleanup essentially a noop in the case where all data on a node belongs on that node. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira