Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E8284200B67 for ; Tue, 2 Aug 2016 02:42:21 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id E5299160AA7; Tue, 2 Aug 2016 00:42:21 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5FD23160A6C for ; Tue, 2 Aug 2016 02:42:21 +0200 (CEST) Received: (qmail 30530 invoked by uid 500); 2 Aug 2016 00:42:20 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 30517 invoked by uid 99); 2 Aug 2016 00:42:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Aug 2016 00:42:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 6B0F02C0059 for ; Tue, 2 Aug 2016 00:42:20 +0000 (UTC) Date: Tue, 2 Aug 2016 00:42:20 +0000 (UTC) From: "Ariel Weisberg (JIRA)" To: commits@cassandra.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (CASSANDRA-12358) Slow PostFlush execution due to 2i flushing can cause near OOM to OOM MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 02 Aug 2016 00:42:22 -0000 Ariel Weisberg created CASSANDRA-12358: ------------------------------------------ Summary: Slow PostFlush execution due to 2i flushing can cause near OOM to OOM Key: CASSANDRA-12358 URL: https://issues.apache.org/jira/browse/CASSANDRA-12358 Project: Cassandra Issue Type: Bug Components: Core Reporter: Ariel Weisberg Assignee: Ariel Weisberg Fix For: 3.10 2i can be slow to flush for a variety of reasons. Potentially slower than the rate at which Memtables can ingest and flush data. If this occurs the heap fills up with Memtables that are waiting for PostFlush to run. This occurs because reclaiming the memory is done before PostFlush runs. I will post a branch that has the reclaim memory task run after PostFlush has completed. As far as I can tell this is safe and correct since the memory is committed up until that point. It's not clear to me if PostFlush has to bind the Memtables or not. I suspect it does, but I'm not sure if that is a route I should go down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)