Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 61436200B68 for ; Thu, 4 Aug 2016 18:44:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5FDE4160A6A; Thu, 4 Aug 2016 16:44:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A2C4A160AAB for ; Thu, 4 Aug 2016 18:44:21 +0200 (CEST) Received: (qmail 79577 invoked by uid 500); 4 Aug 2016 16:44:20 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 79560 invoked by uid 99); 4 Aug 2016 16:44:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Aug 2016 16:44:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 97A872C0D62 for ; Thu, 4 Aug 2016 16:44:20 +0000 (UTC) Date: Thu, 4 Aug 2016 16:44:20 +0000 (UTC) From: "Marcus Eriksson (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-10643) Implement compaction for a specific token range MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 04 Aug 2016 16:44:22 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15408089#comment-15408089 ] Marcus Eriksson commented on CASSANDRA-10643: --------------------------------------------- [~vkasar] the test does not cancel any ongoing compactions - we need to cancel the compactions before picking the sstable instances (when cancelling the compaction we reset the original instances which have not had their starts moved or have been early opened), otherwise we can't mark them as compacting. I'll get it committed tomorrow, thanks! > Implement compaction for a specific token range > ----------------------------------------------- > > Key: CASSANDRA-10643 > URL: https://issues.apache.org/jira/browse/CASSANDRA-10643 > Project: Cassandra > Issue Type: Improvement > Components: Compaction > Reporter: Vishy Kasar > Assignee: Vishy Kasar > Labels: lcs > Attachments: 10643-trunk-REV01.txt, 10643-trunk-REV02.txt, 10643-trunk-REV03.txt > > > We see repeated cases in production (using LCS) where small number of users generate a large number repeated updates or tombstones. Reading data of such users brings in large amounts of data in to java process. Apart from the read itself being slow for the user, the excessive GC affects other users as well. > Our solution so far is to move from LCS to SCS and back. This takes long and is an over kill if the number of outliers is small. For such cases, we can implement the point compaction of a token range. We make the nodetool compact take a starting and ending token range and compact all the SSTables that fall with in that range. We can refuse to compact if the number of sstables is beyond a max_limit. > Example: > nodetool -st 3948291562518219268 -et 3948291562518219269 compact keyspace table -- This message was sent by Atlassian JIRA (v6.3.4#6332)