Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE81D6B82 for ; Mon, 25 Jul 2011 13:18:15 +0000 (UTC) Received: (qmail 12624 invoked by uid 500); 25 Jul 2011 13:18:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 12546 invoked by uid 500); 25 Jul 2011 13:18:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 12538 invoked by uid 99); 25 Jul 2011 13:18:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jul 2011 13:18:13 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of springrider@gmail.com designates 209.85.215.180 as permitted sender) Received: from [209.85.215.180] (HELO mail-ey0-f180.google.com) (209.85.215.180) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jul 2011 13:18:07 +0000 Received: by eyg24 with SMTP id 24so3026742eyg.25 for ; Mon, 25 Jul 2011 06:17:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=8LYycZs55epDA4HlLGnnpKJN0QBaICwzfOZfimEKD+4=; b=MkCsWlUSnx8TJ27nQuKRE3KukLIuQDtvScGIwSAZw+qYSBwEeoNbN9tbgLWJdzaK3i geagvoRux+qPA5MAIv06asH7JqJpeBjasp0e6KS08o0yYQzDPkNnpEHmMrbGfpOcgZzb VX8hnaWr47TQPWENU60tCSnyxLxaIChpbHeLE= Received: by 10.213.15.148 with SMTP id k20mr868316eba.72.1311599867079; Mon, 25 Jul 2011 06:17:47 -0700 (PDT) MIME-Version: 1.0 Received: by 10.213.23.9 with HTTP; Mon, 25 Jul 2011 06:17:27 -0700 (PDT) In-Reply-To: References: <4B50E052B4E82B4AA6961853D610D3C433B62890ED@MBX75.ad2.softcom.biz> From: Yan Chunlu Date: Mon, 25 Jul 2011 21:17:27 +0800 Message-ID: Subject: Re: do I need to add more nodes? minor compaction eat all IO To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org as the wiki suggested: http://wiki.apache.org/cassandra/LargeDataSetConsiderations Adding nodes is a slow process if each node is responsible for a large amount of data. Plan for this; do not try to throw additional hardware at a cluster at the last minute. I really would like to know what's the status of my cluster, if it is norma= l On Mon, Jul 25, 2011 at 8:59 PM, Yan Chunlu wrote: > I am using normal SATA disk, =A0actually I was worrying about whether it > is okay if every time cassandra using all the io resources? > further more when is the good time to add more nodes when I was just > using normal SATA disk and with 100r/s it could reach 100 %util.... > > how large the data size it should be on each node? > > > below is my iostat -x 2 when doing node repair, I have to repair > column family separately otherwise the load will be more crazy: > > Device: =A0 =A0 =A0 =A0 rrqm/s =A0 wrqm/s =A0 =A0 r/s =A0 =A0 w/s =A0 =A0= rMB/s =A0 =A0wMB/s > avgrq-sz avgqu-sz =A0 await r_await w_await =A0svctm =A0%util > sda =A0 =A0 =A0 =A0 =A0 =A0 =A0 1.50 =A0 =A0 1.50 =A0121.50 =A0 14.00 =A0= =A0 3.68 =A0 =A0 0.30 > 60.19 =A0 116.98 1569.46 =A0 59.49 14673.86 =A0 7.38 100.00 > > > > > > > On Sun, Jul 24, 2011 at 8:04 AM, Jonathan Ellis wrote= : >> On Sat, Jul 23, 2011 at 4:16 PM, Francois Richard w= rote: >>> My understanding is that during compaction cassandra does a lot of non = sequential readsa then dumps the results with a big sequential write. >> >> Compaction reads and writes are both sequential, and 0.8 allows >> setting a MB/s to cap compaction at. >> >> As to the original question "do I need to add more machines" I'd say >> that depends more on whether your application's SLA is met, than what >> % io util spikes to. >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of DataStax, the source for professional Cassandra support >> http://www.datastax.com >> >