Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 95129200B5E for ; Wed, 27 Jul 2016 01:33:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 93BF1160AA4; Tue, 26 Jul 2016 23:33:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E24FF160AA6 for ; Wed, 27 Jul 2016 01:33:21 +0200 (CEST) Received: (qmail 18632 invoked by uid 500); 26 Jul 2016 23:33:20 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 18598 invoked by uid 99); 26 Jul 2016 23:33:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jul 2016 23:33:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 918072C0D5F for ; Tue, 26 Jul 2016 23:33:20 +0000 (UTC) Date: Tue, 26 Jul 2016 23:33:20 +0000 (UTC) From: "Brandon Williams (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CASSANDRA-12300) Disallow unset memtable_cleanup_threshold when flush writers is set MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 26 Jul 2016 23:33:22 -0000 [ https://issues.apache.org/jira/browse/CASSANDRA-12300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15394770#comment-15394770 ] Brandon Williams commented on CASSANDRA-12300: ---------------------------------------------- I think part of it is our somewhat bad advice: {noformat} # If your data directories are backed by SSD, you should increase this # to the number of cores. #memtable_flush_writers: 8 {noformat} Oh, I have SSD, I'll set this to the number of cores. In reality, I think what you want to set this to is min(active_tables+fudge, num_cores). Instead when you do just blindly set this to the number of cores, you can get a huge divisor for mct if unset, and create a bunch of very small sstables (I've seen as bad as a handful or two of kilobytes.) > Disallow unset memtable_cleanup_threshold when flush writers is set > ------------------------------------------------------------------- > > Key: CASSANDRA-12300 > URL: https://issues.apache.org/jira/browse/CASSANDRA-12300 > Project: Cassandra > Issue Type: Improvement > Reporter: Brandon Williams > > Many times I see flush writers set, and mct unset, leading to a very small mct, which causes unneeded frequent flushing, and then of course compaction. I also think the default is a bit conservative, typically ending up at 0.11, where I'd say the majority of use cases only have one or two hot tables and are much better served at 0.7 or 0.8. -- This message was sent by Atlassian JIRA (v6.3.4#6332)