Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1DC2F17425 for ; Wed, 1 Apr 2015 10:26:09 +0000 (UTC) Received: (qmail 77446 invoked by uid 500); 1 Apr 2015 10:25:53 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 77406 invoked by uid 500); 1 Apr 2015 10:25:53 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 77393 invoked by uid 99); 1 Apr 2015 10:25:53 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Apr 2015 10:25:53 +0000 Date: Wed, 1 Apr 2015 10:25:53 +0000 (UTC) From: "Marcus Eriksson (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (CASSANDRA-8360) In DTCS, always compact SSTables in the same time window, even if they are fewer than min_threshold MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-8360?page=3Dcom.atla= ssian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcus Eriksson resolved CASSANDRA-8360. ---------------------------------------- Resolution: Fixed will commit fix as part of CASSANDRA-8359 > In DTCS, always compact SSTables in the same time window, even if they ar= e fewer than min_threshold > -------------------------------------------------------------------------= -------------------------- > > Key: CASSANDRA-8360 > URL: https://issues.apache.org/jira/browse/CASSANDRA-8360 > Project: Cassandra > Issue Type: Improvement > Reporter: Bj=C3=B6rn Hegerfors > Assignee: Bj=C3=B6rn Hegerfors > Priority: Minor > Fix For: 3.0, 2.1.4, 2.0.14 > > Attachments: cassandra-2.0-CASSANDRA-8360.txt > > > DTCS uses min_threshold to decide how many time windows of the same size = that need to accumulate before merging into a larger window. The age of an = SSTable is determined as its min timestamp, and it always falls into exactl= y one of the time windows. If multiple SSTables fall into the same window, = DTCS considers compacting them, but if they are fewer than min_threshold, i= t decides not to do it. > When do more than 1 but fewer than min_threshold SSTables end up in the s= ame time window (except for the current window), you might ask? In the curr= ent state, DTCS can spill some extra SSTables into bigger windows when the = previous window wasn't fully compacted, which happens all the time when the= latest window stops being the current one. Also, repairs and hints can put= new SSTables in old windows. > I think, and [~jjordan] agreed in a comment on CASSANDRA-6602, that DTCS = should ignore min_threshold and compact tables in the same windows regardle= ss of how few they are. I guess max_threshold should still be respected. > [~jjordan] suggested that this should apply to all windows but the curren= t window, where all the new SSTables end up. That could make sense. I'm not= clear on whether compacting many SSTables at once is more cost efficient o= r not, when it comes to the very newest and smallest SSTables. Maybe compac= ting as soon as 2 SSTables are seen is fine if the initial window size is s= mall enough? I guess the opposite could be the case too; that the very newe= st SSTables should be compacted very many at a time? -- This message was sent by Atlassian JIRA (v6.3.4#6332)