Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 79700 invoked from network); 30 Jun 2010 23:00:28 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 30 Jun 2010 23:00:28 -0000 Received: (qmail 7460 invoked by uid 500); 30 Jun 2010 23:00:27 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 7407 invoked by uid 500); 30 Jun 2010 23:00:26 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 7399 invoked by uid 99); 30 Jun 2010 23:00:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Jun 2010 23:00:26 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 209.85.160.44 as permitted sender) Received: from [209.85.160.44] (HELO mail-pw0-f44.google.com) (209.85.160.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Jun 2010 23:00:20 +0000 Received: by pwj2 with SMTP id 2so614096pwj.31 for ; Wed, 30 Jun 2010 15:59:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=F12w8BNlxfIsnJw0KQEJN/DMpMPR+bfjYPQbqZnVZeQ=; b=aYeMP/gszL6WR/xF6VG5jZ6dQwTYfBnuoDKT0YtN+7L0blSiys0mS5T8uvUpuhFtOK 9qI7QK+ABQVX7AJq/ruCQkoqKxJzdiNx+hZDo0IZMirWG4okOPMHTewP6qq3fyrTcCLV GxeM0PgY4+aT45OUZp2gn7Nl7S9LPaXv4nx0g= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=EXmiKW39ScpC2j2x+rYXC6vMEjsxHVyh6oIzc21OfZ2INR+2O6iGUTN3FHLvLevZwT 8u5X8YlCL27iyDf+wYmPeq+LVn8wQB8KgOcAxhCEcUaEGRNVO26QcGVN7cKg4XCPyHsr 5FCi8Qh+rCFIXpNt2TnGTQ3xGVwWhU87XDb7M= Received: by 10.142.10.5 with SMTP id 5mr11305934wfj.267.1277938799279; Wed, 30 Jun 2010 15:59:59 -0700 (PDT) MIME-Version: 1.0 Received: by 10.143.28.8 with HTTP; Wed, 30 Jun 2010 15:59:39 -0700 (PDT) In-Reply-To: References: <7A93E9E4-BC15-479C-A4CD-0E96CEFB540E@clearspring.com> From: Jonathan Ellis Date: Wed, 30 Jun 2010 15:59:39 -0700 Message-ID: Subject: Re: Cluster-wide pause To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org use a larger heap or throttle your writes On Wed, Jun 30, 2010 at 12:12 PM, Steve Lihn wrote: > Jon, > I am experimenting writing 8 million rows into Cassandra and also > experienced some random timeouts, even with 10-second timeout parameter. > How do I avoid such timeout at all cost? (I.e. At this time, my priority = is > to finish the end-to-end test. Don't want the program to fail at all.) > This is a one-node server with ms1G and mx2GB. The rest in cassandra.in.s= h > are default I think. > > Steve > > On Wed, Jun 30, 2010 at 1:58 AM, Jonathan Ellis wrote= : >> >> You could be seeing GC pauses. Did you increase the heap size you gave >> Cassandra, when you increased your VM size? >> >> On Tue, Jun 29, 2010 at 11:57 AM, Oren Benjamin >> wrote: >> > Hi all - first timer here. >> > >> > I'm experimenting with Cassandra on Rackspace Cloud. =A0Started with 4= GB >> > nodes and saw read latency spikes while streaming was taking place, so= I >> > increased to 8GB to see if limited memory was the issue. =A0Now I'm se= eing >> > very strange behavior during any period that writes are taking place. = =A0The >> > entire (6 node) cluster seems to pause for periods of as much as 5-8 s= ec. >> > =A0By that I mean all the stats (cpu, disk, network IO monitored via d= stat) >> > drop to zero or near zero on all nodes simultaneously. =A0Does anyone = have >> > experience with Cassandra on Rackspace or any idea what's going on her= e? >> > >> > The pauses are short enough that it's difficult to introspect the >> > application and determine what it's doing during the pause, but long e= nough >> > to cause unacceptable latency for any service built on top of it. >> > >> > Any ideas or debugging methods would be greatly appreciated, >> > >> > =A0-- Oren >> >> >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of Riptano, the source for professional Cassandra support >> http://riptano.com > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com