Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EFF071875D for ; Fri, 29 May 2015 21:42:17 +0000 (UTC) Received: (qmail 78742 invoked by uid 500); 29 May 2015 21:42:17 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 78710 invoked by uid 500); 29 May 2015 21:42:17 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 78698 invoked by uid 99); 29 May 2015 21:42:17 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 May 2015 21:42:17 +0000 Date: Fri, 29 May 2015 21:42:17 +0000 (UTC) From: "Albert P Tobey (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-9517) Switch to DTCS for hint storage MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-9517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14565469#comment-14565469 ] Albert P Tobey commented on CASSANDRA-9517: ------------------------------------------- My original theory was that we could use DTCS for system.hints since it has a timeseries-like table definition and let it delete whole tables when the TTLs expire. That was before I understood exactly how tombstones are used in hints. The patch seemed to help a little in testing, but I did not figure out why it seemed that way. The forced major compaction is most of the problem when hints build up, so that's the thing that needs to be removed if at all possible. Under 100% write workload on very fast machines I was seeing system.hints compactions in excess of 100GB, which has all kinds of negative side-effects. If there's a way we can convince any of the compaction strategies to split the wide rows across sstables (split by time window) while only merging tombstones along with subsequent cleanup, that could make hints tolerable until 3.0 takes over the world. > Switch to DTCS for hint storage > ------------------------------- > > Key: CASSANDRA-9517 > URL: https://issues.apache.org/jira/browse/CASSANDRA-9517 > Project: Cassandra > Issue Type: Improvement > Components: Core > Reporter: Jeremy Hanna > Fix For: 2.1.6 > > > The DateTieredCompactionStrategy is a good choice for HintedHandoff so that we reduce the compaction load we incur when users build up hints. [~atobey@datastax.com] and others have tried the following patch in various setups and have seen significantly less load from hint compaction. > https://gist.github.com/tobert/c069af27e3f8840d137d > Setting the time window to 10 minutes has shown additional improvement. > [~krummas] do you have any feedback about this idea and/or settings? -- This message was sent by Atlassian JIRA (v6.3.4#6332)