From user-return-23929-apmail-cassandra-user-archive=cassandra.apache.org@cassandra.apache.org Sat Feb 4 01:28:19 2012 Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E08DD9FC0 for ; Sat, 4 Feb 2012 01:28:18 +0000 (UTC) Received: (qmail 6001 invoked by uid 500); 4 Feb 2012 01:28:16 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 5931 invoked by uid 500); 4 Feb 2012 01:28:15 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 5922 invoked by uid 99); 4 Feb 2012 01:28:15 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Feb 2012 01:28:15 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of asgrewal@gmail.com designates 209.85.210.172 as permitted sender) Received: from [209.85.210.172] (HELO mail-iy0-f172.google.com) (209.85.210.172) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Feb 2012 01:28:09 +0000 Received: by iagf6 with SMTP id f6so6693023iag.31 for ; Fri, 03 Feb 2012 17:27:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; bh=CVDpYq6cFgzFmIENgeN+LtcuKCU6mM2KOJkLsi3nPp0=; b=xE6jKGGGC7dUB83LcyKd5i+ioK5E58kBUk70WdDe/5anMCVlORg5FWYIAkPOT8eF3m Dmx6eQsCBIWuoWOI26Sk48ZL2/FrL0y5GxZZVEATvxZEvwaqtn+BVKwMDH8jtdblcRtw QMZ9Zg4+EqN4IURcjq9UFH3rp3oZeDqmd0f/w= MIME-Version: 1.0 Received: by 10.50.15.169 with SMTP id y9mr478962igc.9.1328318868723; Fri, 03 Feb 2012 17:27:48 -0800 (PST) Received: by 10.42.108.67 with HTTP; Fri, 3 Feb 2012 17:27:48 -0800 (PST) Date: Fri, 3 Feb 2012 17:27:48 -0800 Message-ID: Subject: Cassandra OOM - 1.0.2 From: Ajeet Grewal To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Hey guys, I am getting an out of memory (mmap failed) error with Cassandra 1.0.2. The relevant log lines are pasted at http://pastebin.com/UM28ZC1g. Cassandra works fine until it reaches about 300-400GB of load (on one instance, I have 12 nodes RF=2). Then nodes start failing with such errors. The nodes are pretty beefy, 32GB of ram, 8 cores. Increasing the JVM heap size does not help. I am running on a 64bit jvm. I am using jna. I have memlock unlimited for the user. (I confirmed this by looking at /proc//limits). I also tried restarting the process as root, but it crashes with the same error. Also the number of files that I have in the data directory is about ~300, so it should not be exceeding the open files limit. I don't know if this is relevant. I just have two column families, counter_object and counter_time. I am using very wide columns, so row sizes can be huge. You can see from the log link, that the *.db files are sometimes pretty big. Please help! Thank you! -- Regards, Ajeet