Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 80560 invoked from network); 16 Feb 2010 18:29:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Feb 2010 18:29:07 -0000 Received: (qmail 15739 invoked by uid 500); 16 Feb 2010 18:29:06 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 15716 invoked by uid 500); 16 Feb 2010 18:29:06 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 15680 invoked by uid 99); 16 Feb 2010 18:29:06 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 18:29:06 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of tsaloranta@gmail.com designates 209.85.216.196 as permitted sender) Received: from [209.85.216.196] (HELO mail-px0-f196.google.com) (209.85.216.196) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 18:28:58 +0000 Received: by pxi34 with SMTP id 34so739669pxi.6 for ; Tue, 16 Feb 2010 10:28:37 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=nJtobWKO/S813LUvMub4h2ZjvX0c+hQwx1tRDuN7m64=; b=oHI1pLir0RExpm9CyBvfbgAf0B0vkWQF5mpUB+1US6Lob8nUR4PwiKbvbvM/ANeT5Z Y/xcgyDvK26L8Ucqj73/8LT6JyORbMiLnPALvvk2gM6QhzRuOWy4LRzI8/VBuT1OXTUu tQgmufVpruVbzgIgJ+fOmm7U/+zz9wYA9/sik= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=R0d9Ot+fA0lia1wzhj2re/LqOB1ByVthx/qfxDV7sq2fPssiglxx8WYQPaUzMb5370 hvnoOHom5FQuxPG+xDqQOW18/hFsSAUI5FFlSeAJsiac7kG4AU9vNwtHxmCumz8SaiSq hJliEcUvb2w1NyHsjpiXgSLFHN6BNRrh6BQhw= MIME-Version: 1.0 Received: by 10.141.125.12 with SMTP id c12mr4534269rvn.170.1266344917617; Tue, 16 Feb 2010 10:28:37 -0800 (PST) In-Reply-To: References: Date: Tue, 16 Feb 2010 10:28:37 -0800 Message-ID: <5f7770581002161028s21c54cdfk846e2c973d06aaf6@mail.gmail.com> Subject: Re: cassandra freezes From: Tatu Saloranta To: cassandra-user@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org On Tue, Feb 16, 2010 at 6:25 AM, Boris Shulman wrote: > Hello, I'm running some benchmarks on 2 cassandra nodes each running > on 8 cores machine with 16G RAM, 10G for Java heap. I've noticed that > during benchmarks with numerous writes cassandra just freeze for > several minutes (in those benchmarks I'm writing batches of 10 columns > with 1K data each for every key in a single CF). Usually after > performing 50K writes I'm getting a TimeOutException and cassandra > just freezes. What configuration changes can I make in order to > prevent this? Is it possible that my setup just can't handle the load? > How can I calculate the number of casandra nodes for a desired load? One thing that can cause seeming lockups is garbage collector. So enabling GC debug output would be heplful, to see GC activity. Some collector (CMS specifically) can stop the system for very long time, up to minutes. This is not necessarily the root cause, but is easy to rule out. Beyond this, getting a stack trace during lockup would make sense. That can pinpoint what threads are doing, or what they are blocked on in case there is a deadlock or heavy contention on some shared resource. -+ Tatu +-