Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9295A10589 for ; Fri, 1 Nov 2013 19:48:32 +0000 (UTC) Received: (qmail 71122 invoked by uid 500); 1 Nov 2013 19:48:30 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 71066 invoked by uid 500); 1 Nov 2013 19:48:30 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 71058 invoked by uid 99); 1 Nov 2013 19:48:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Nov 2013 19:48:29 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of horky@avast.com designates 74.125.83.41 as permitted sender) Received: from [74.125.83.41] (HELO mail-ee0-f41.google.com) (74.125.83.41) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Nov 2013 19:48:23 +0000 Received: by mail-ee0-f41.google.com with SMTP id e53so2219759eek.28 for ; Fri, 01 Nov 2013 12:48:02 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:message-id:date:from:user-agent:mime-version:to :cc:subject:content-type:content-transfer-encoding; bh=9Lkvq67+Fy0UL/m70aa5rgViiFcqSvap0E5XZdt0YkQ=; b=OylW9NhxzabsU2L7gnbAX+c+qAOmS/l0YdV4Gvf5rnF49EOp7VAL3ZhHkPgrJ//dBM rRfi0ZJeMLWnT4UbJsfIMBc8z2WL6aBGrpQYbhVLZ94VobakR9/yj2ZoNf4ahHoaLYyi uYNW+il+KZDZtZx5U2LbEwGxBwc+682OT1R8sb1YskSOdGO5OHFbb9vzn57Igns1591k /DLQbbGHDvqoKYhEtlWeshSI/NSRz8QeeqvQfmjJKijnDR0r2Q9P6COwRTsfUl0EG58x t7pDeJOj2aRzdPlIxvK76ZBzH0wNzwPTkvbKloYZnHEActjlvbqpC7c9cpYdO0oyMpv8 6iTA== X-Gm-Message-State: ALoCoQkxqgpMnrVEOwssoGYuZvLTXwrEX9vz7AfQwshdDckZyRr29jTHVMWeB+S8AFyXv2S9JqxB X-Received: by 10.15.45.135 with SMTP id b7mr669538eew.135.1383335282647; Fri, 01 Nov 2013 12:48:02 -0700 (PDT) Received: from [172.31.1.101] (ip-62-245-69-12.net.upcbroadband.cz. [62.245.69.12]) by mx.google.com with ESMTPSA id m54sm11527544eex.2.2013.11.01.12.48.01 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 01 Nov 2013 12:48:02 -0700 (PDT) Message-ID: <5274056C.4010401@avast.com> Date: Fri, 01 Nov 2013 20:47:56 +0100 From: Jiri Horky User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130827 Thunderbird/17.0.8 MIME-Version: 1.0 To: user@cassandra.apache.org CC: =?UTF-8?B?J1pkZW7Em2sgT3R0Jw==?= Subject: Recompacting all sstables Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi all since we upgraded half of our Cassandra cluster to 2.0.0 and we use LCS, we hit CASSANDRA-6284 bug. So basically all data in sstables created after the upgrade are wrongly (non-uniformly within compaction levels) distributed. This causes a huge overhead when compacting new sstables (see the bug for the details). After applying the patch, the distribution of the data within a level is supposed to recover itself over time but we would like to not to wait a month or so until it gets better. So the question. What is the best way to recompact all the sstables so the data in one sstables within a level would contain more or less the right portion of the data, in other worlds, keys would be uniformly distributed across sstables within a level? (e.g.: assumming total token range for a node 1..10000, and given that L2 should contain 100 sstables, , all sstables within L2 should cover a range of ~100 tokens). Based on documentation, I can only think of switching to SizeTiered compaction, doing major compaction and then switching back to LCS. Thanks in advance Jiri Horky