Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 60E0342D1 for ; Tue, 21 Jun 2011 11:40:54 +0000 (UTC) Received: (qmail 78182 invoked by uid 500); 21 Jun 2011 11:40:51 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 78154 invoked by uid 500); 21 Jun 2011 11:40:51 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 78140 invoked by uid 99); 21 Jun 2011 11:40:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 11:40:51 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a50.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jun 2011 11:40:45 +0000 Received: from homiemail-a50.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a50.g.dreamhost.com (Postfix) with ESMTP id CA8806F8062 for ; Tue, 21 Jun 2011 04:40:24 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=content-type :mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; q=dns; s= thelastpickle.com; b=3RWH1XIbM1QJPcqNEPGJRSleJAUHaNp73h11D7rGGWk iuVAUPuC0d1aC8bh5CeVM+2Qo+Fg/yDXJK5OXaBWjI0SDLRJ6cHj7a9SY8bvsfVC dlWrLsr779VnomZ9uGzmWuTS74DIBHs1iv/M8/ud+OdTVKKbV9G/KudZWZBI4poQ = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; s= thelastpickle.com; bh=k2FPl3v+1RkyuXgjkKINyLvN4o4=; b=dhMKk4Vtvb kWJzVLb/U/42MMe+JBygvvdhS4RLa2Ww1bReY+ENnZlB8j2p22KPpamL2GYLlaH/ Iw7AG+ItXP0N6cGAip1UkxFS7ItyVe8eWazE4xksgqe8EWZJwbiEy2KPEGxgg4ID Kcr5pRX8tXinq3BXEGwbyFTGaiD6ZhfPs= Received: from [10.0.1.151] (121-73-157-230.cable.telstraclear.net [121.73.157.230]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a50.g.dreamhost.com (Postfix) with ESMTPSA id 5896B6F8059 for ; Tue, 21 Jun 2011 04:40:24 -0700 (PDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1084) Subject: Re: OOM during restart From: aaron morton In-Reply-To: Date: Tue, 21 Jun 2011 23:40:21 +1200 Content-Transfer-Encoding: quoted-printable Message-Id: References: To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1084) AFAIK the node will not announce itself in the ring until the log replay = is complete, so it will not get the schema update until after log = replay. If possible i'd avoid making the schema change until you have = solved this problem. My theory on OOM during log replay is that the high speed inserts are a = good way of finding out if the maximum memory required by the schema is = too big to fit in the JVM. How big is the max JVM Heap SIze and do you = have a lot of CF's? The simple solution it to either (temporarily) increase the JVM Heap = Size or move the log files so that the server can process only one at a = time. The JVM option D.cassandra_ring=3Dfalse will stop the node from = joining the cluster and stop other nodes sending requests to it until = you have sorted it out.=20 Hope that helps.=20 =20 =20 ----------------- Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 21 Jun 2011, at 10:24, Gabriel Ki wrote: > Hi, >=20 > Cassandra: 7.6-2 > I was restarting a node and ran into OOM while replaying the commit = log. I am not able to bring the node up again. >=20 > DEBUG 15:11:43,501 forceFlush requested but everything is clean = <-------- For this I don't know what to do. > java.lang.OutOfMemoryError: Java heap space > at = org.apache.cassandra.io.util.BufferedRandomAccessFile.(BufferedRando= mAccessFile.java:123) > at = org.apache.cassandra.io.sstable.SSTableWriter$IndexWriter.(SSTableWr= iter.java:395) > at = org.apache.cassandra.io.sstable.SSTableWriter.(SSTableWriter.java:76= ) > at = org.apache.cassandra.db.ColumnFamilyStore.createFlushWriter(ColumnFamilySt= ore.java:2238) > at = org.apache.cassandra.db.Memtable.writeSortedContents(Memtable.java:166) > at org.apache.cassandra.db.Memtable.access$000(Memtable.java:49) > at = org.apache.cassandra.db.Memtable$1.runMayThrow(Memtable.java:189) > at = org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) > at = java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.= java:886) > at = java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java= :908) > at java.lang.Thread.run(Thread.java:662) >=20 > Any help will be appreciated. =20 >=20 > If I update the schema while a node is down, the new schema is loaded = before the flushing when the node is brought up again, correct? =20 >=20 > Thanks, > -gabe