Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6CFE719E67 for ; Sat, 12 Mar 2016 22:28:59 +0000 (UTC) Received: (qmail 66830 invoked by uid 500); 12 Mar 2016 22:28:56 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 66786 invoked by uid 500); 12 Mar 2016 22:28:56 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 66776 invoked by uid 99); 12 Mar 2016 22:28:56 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 12 Mar 2016 22:28:56 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id C12A3180315 for ; Sat, 12 Mar 2016 22:28:55 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 4PtCN6p4C5Ni for ; Sat, 12 Mar 2016 22:28:54 +0000 (UTC) Received: from mail-wm0-f54.google.com (mail-wm0-f54.google.com [74.125.82.54]) by mx2-lw-us.apache.org (ASF Mail Server at mx2-lw-us.apache.org) with ESMTPS id 4AE2A5FAF0 for ; Sat, 12 Mar 2016 22:28:54 +0000 (UTC) Received: by mail-wm0-f54.google.com with SMTP id p65so58631679wmp.1 for ; Sat, 12 Mar 2016 14:28:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to; bh=2e5Sw4U/FUcr0iuz0GX4ac5bILVaVfeE7jaXWsEVUgc=; b=BWklfHYTS32eZr/biDJsL6BgOLrXA8AoDpK+hi4Y4pISSrgfTV/NI/M51rvN/K97to VEXhyQTlXkFYlmWhcLSp9af4XEd7PL4RziIJWXrboDgv98ATgslQBXRTxJqVvKN2LYr7 PErozn5wNMT0sU0FwPh+ijEehmb6WE4toUxUFKEO+DWewbSiaHjFEgV2dSk/71+Je+qq yWD5H9Fy9xTqoIemiA3XE5TZswVmvqOfWhtyH1NOkCnr3AWUwa8kkwnlun1rMm1jUaRo IzZCpMtCJV3uo5qP/W7/86ktmCTc3Ef5yL31GnW6/hwGhJnxF7vSJdwWyyDXz2sVqFpj suSA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to; bh=2e5Sw4U/FUcr0iuz0GX4ac5bILVaVfeE7jaXWsEVUgc=; b=Dh0f+2Q14JqbSgf+RhTJEzMwjerBIaVYagBNk8utIOC46dUf1tluBJJqro/YR/qjes 3xon1hQ+wZJDIrr5kyO3jAKHBknFt0wSQ75of9nHhrhDcmigw8LdRlniJIRh4IodCtzF Db+mhEdwrFY6VddrD+Zv1oWctJ05v2fvgCWEIMpMFtF/QrX3I9egzKgZlQ6d2lSFNuhj RpvOU2rEwPa2GiAJfLSRe1MTI/uc6mFv/qXkSzbFhf+bvfxwqWIX18690DM0fNwz4shj ox6JLUKquzkuAOaM+rYetGgqMpwYUwYL3cK2e4MH0jYhCh2+5ujdq0vVbGc89BiSZo3j zf/w== X-Gm-Message-State: AD7BkJJAQGURg35HoRpPV6NmtkzMFNVFEihsyR+tombID6daXqlvv+IH276qyDlX5u52Wg== X-Received: by 10.194.158.36 with SMTP id wr4mr15951680wjb.134.1457821733132; Sat, 12 Mar 2016 14:28:53 -0800 (PST) Received: from [192.168.1.2] ([93.84.36.227]) by smtp.googlemail.com with ESMTPSA id z127sm8832071wme.5.2016.03.12.14.28.51 for (version=TLSv1/SSLv3 cipher=OTHER); Sat, 12 Mar 2016 14:28:52 -0800 (PST) Subject: Re: Cassandra 3.2.1: Memory leak? To: user@cassandra.apache.org References: From: "ssivikt@gmail.com" Message-ID: <56E49822.6010304@gmail.com> Date: Sun, 13 Mar 2016 01:28:50 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.5.1 MIME-Version: 1.0 In-Reply-To: Content-Type: multipart/alternative; boundary="------------040709060008070809030005" This is a multi-part message in MIME format. --------------040709060008070809030005 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Hi, I'll duplicate here my email with the same issue "/I have 7 nodes of C* v2.2.5 running on CentOS 7 and using jemalloc for dynamic storage allocation. Use only one keyspace and one table with Leveled compaction strategy. I've loaded ~500 GB of data into the cluster with replication factor equals to 3 and waiting until compaction is finished. But during compaction each of the C* nodes allocates all the available memory (~128GB) and just stops its process. This is a known bug ? /" On 03/13/2016 12:56 AM, Mohamed Lrhazi wrote: > Hello, > > We installed Datastax community edition, on 8 nodes, RHEL7. We > inserted some 7 billion rows into a pretty simple table. the inserts > seem to have completed without issues. but ever since, we find that > the nodes reliably run out of RAM after few hours, without any user > activity at all. No reads nor write are sent at all. What should we > look for to try and identify root cause? > > > [root@avesterra-prod-1 ~]# cat /etc/redhat-release > Red Hat Enterprise Linux Server release 7.2 (Maipo) > [root@avesterra-prod-1 ~]# rpm -qa| grep datastax > datastax-ddc-3.2.1-1.noarch > datastax-ddc-tools-3.2.1-1.noarch > [root@avesterra-prod-1 ~]# > > The nodes had 8 GB RAM, which we doubled twice and now are trying with > 40GB... they still manage to consume it all and cause oom_killer to > kick in. > > Pretty much all the settings are the default ones the installation > created. > > Thanks, > Mohamed. -- Thanks, Serj --------------040709060008070809030005 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit Hi, I'll duplicate here my email with the same issue

"I have 7 nodes of C* v2.2.5 running on CentOS 7 and using jemalloc for dynamic storage allocation.
Use only one keyspace and one table with Leveled compaction strategy.
I've loaded ~500 GB of data into the cluster with replication factor equals to 3 and waiting until compaction is finished. But during compaction each of the C* nodes allocates all the available memory (~128GB) and just stops its process.
This is a known bug ?
"

On 03/13/2016 12:56 AM, Mohamed Lrhazi wrote:
Hello,

We installed Datastax community edition, on 8 nodes, RHEL7. We inserted some 7 billion rows into a pretty simple table. the inserts seem to have completed without issues. but ever since, we find that the nodes reliably run out of RAM after few hours, without any user activity at all. No reads nor write are sent at all.  What should we look for to try and identify root cause?


[root@avesterra-prod-1 ~]# cat /etc/redhat-release 
Red Hat Enterprise Linux Server release 7.2 (Maipo)
[root@avesterra-prod-1 ~]# rpm -qa| grep datastax
datastax-ddc-3.2.1-1.noarch
datastax-ddc-tools-3.2.1-1.noarch
[root@avesterra-prod-1 ~]# 

The nodes had 8 GB RAM, which we doubled twice and now are trying with 40GB... they still manage to consume it all and cause oom_killer to kick in.

Pretty much all the settings are the default ones the installation created.

Thanks,
Mohamed.

-- 
Thanks,
Serj
--------------040709060008070809030005--