Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 4323 invoked from network); 23 Apr 2010 21:14:12 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 23 Apr 2010 21:14:12 -0000 Received: (qmail 88268 invoked by uid 500); 23 Apr 2010 21:14:12 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 88247 invoked by uid 500); 23 Apr 2010 21:14:12 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 88239 invoked by uid 99); 23 Apr 2010 21:14:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Apr 2010 21:14:12 +0000 X-ASF-Spam-Status: No, hits=-1339.4 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 Apr 2010 21:14:11 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o3NLDp3P013934 for ; Fri, 23 Apr 2010 21:13:51 GMT Message-ID: <6873742.167471272057231069.JavaMail.jira@thor> Date: Fri, 23 Apr 2010 17:13:51 -0400 (EDT) From: "Brandon Williams (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1014) GC storming, possible memory leak In-Reply-To: <32174055.140911271953191087.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12860375#action_12860375 ] Brandon Williams commented on CASSANDRA-1014: --------------------------------------------- Confirmed, 100M inserts works with this patch, though CMS still had tons of concurrent mode failures. 18 minutes of GC time on ParNew, 1 hour 26 minutes on CMS. > GC storming, possible memory leak > --------------------------------- > > Key: CASSANDRA-1014 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1014 > Project: Cassandra > Issue Type: Bug > Components: Core > Affects Versions: 0.6 > Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode) > Reporter: Brandon Williams > Assignee: Jonathan Ellis > Fix For: 0.6.2 > > Attachments: 1014-commitlog-v2.tar.gz, 1014-commitlog.tar.gz, 1014-table.diff, 1014.txt, 724-0001.png > > > There appears to be a GC issue due to memory pressure in the 0.6 branch. You can see this by starting the server and performing many inserts. Quickly the jvm will consume most of its heap, and pauses for stop-the-world GC will begin. With verbose GC turned on, this can be observed as follows: > [GC [ParNew (promotion failed): 79703K->79703K(84544K), 0.0622980 secs][CMS[CMS-concurrent-mark: 3.678/5.031 secs] [Times: user=10.35 sys=4.22, real=5.03 secs] > (concurrent mode failure): 944529K->492222K(963392K), 2.8264480 secs] 990745K->492222K(1047936K), 2.8890500 secs] [Times: user=2.90 sys=0.04, real=2.90 secs] > After enough inserts (around 75-100 million) the server will GC storm and then OOM. > jbellis and I narrowed this down to patch 0001 in CASSANDRA-724. Switching LBQ with ABQ made no difference, however using batch mode instead of periodic for the commitlog does prevent the issue from occurring. The attached screenshot shows the heap usage in jconsole first when the issue is exhibiting, a restart, and then the same amount of inserts when it does not. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.