Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 91D2481AE for ; Thu, 1 Sep 2011 23:55:35 +0000 (UTC) Received: (qmail 9882 invoked by uid 500); 1 Sep 2011 23:55:33 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 9651 invoked by uid 500); 1 Sep 2011 23:55:32 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 9641 invoked by uid 99); 1 Sep 2011 23:55:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Sep 2011 23:55:32 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [74.125.82.172] (HELO mail-wy0-f172.google.com) (74.125.82.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Sep 2011 23:55:27 +0000 Received: by wyg8 with SMTP id 8so2082655wyg.31 for ; Thu, 01 Sep 2011 16:55:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.227.60.140 with SMTP id p12mr432670wbh.30.1314921296236; Thu, 01 Sep 2011 16:54:56 -0700 (PDT) Received: by 10.227.55.72 with HTTP; Thu, 1 Sep 2011 16:54:56 -0700 (PDT) X-Originating-IP: [173.167.104.65] Date: Thu, 1 Sep 2011 16:54:56 -0700 Message-ID: Subject: Fun with Heap Dump ... From: Ian Danforth To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 All, I need help interpreting the results of my investigation. I'm encountering this error: "Unable to reduce heap usage since there are no dirty column families". My heap sits near max and occasionally OOMs. (4GB heap) Following Mr. Ellis's instructions here: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Memory-Usage-During-Read-td6338635.html I set the heap down to 1GB, restarted the node, watched the memory climb in jconsole and waited for a heap dump. (Of course the first time I tried this I got a permission denied error on writing out the dump, and had to restart C* as root, but anyway ...) Below you'll find a screen grab of the heap dump analysis. http://screencast.com/t/U6IYzloe2McP Here is what I see in cassandra.log just prior to OOM: [ec2-user@ip-10-86-223-245 ~]$ tail -f /var/log/cassandra/cassandra.log INFO 22:37:11,193 Removing compacted SSTable files (see http://wiki.apache.org/cassandra/MemtableSSTable) INFO 22:37:11,194 Removing compacted SSTable files (see http://wiki.apache.org/cassandra/MemtableSSTable) INFO 22:37:11,195 Removing compacted SSTable files (see http://wiki.apache.org/cassandra/MemtableSSTable) INFO 22:37:11,196 Removing compacted SSTable files (see http://wiki.apache.org/cassandra/MemtableSSTable) INFO 22:37:11,212 Removing compacted SSTable files (see http://wiki.apache.org/cassandra/MemtableSSTable) INFO 22:38:32,485 Opening /cassandra/data/Keyspace1/TwitterTest-g-5852 INFO 22:38:33,253 Opening /cassandra/data/Keyspace1/TwitterTest-g-5502 INFO 22:38:34,710 Opening /cassandra/data/Keyspace1/TwitterTest-g-5643 INFO 22:38:35,653 Opening /cassandra/data/Keyspace1/TwitterTest-g-6117 INFO 22:38:35,699 Opening /cassandra/data/Keyspace1/TwitterTest-g-1376 [Unloading class sun.reflect.GeneratedSerializationConstructorAccessor27] ...etc... >From my org.apache.cassandra.db MBean I get an estimate of 596,504,576 keys. ----- What I'd really like to know is: 1. What operation is C* performing during lines like these: INFO 22:38:34,710 Opening /cassandra/data/Keyspace1/TwitterTest-g-5643 (I think this is an SSTable it's extracting an index for this column family from) 2. Has my CF index outgrown memory? 3. If so is there a way to relate # CF, # Columns, # Rows to index size? I need to know how many keys I can store before I need more memory, or need more nodes. Thanks in advance. I've been getting a lot of help from the list and I really appreciate it! Ian