Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C41C910EF0 for ; Wed, 26 Jun 2013 05:30:04 +0000 (UTC) Received: (qmail 52050 invoked by uid 500); 26 Jun 2013 05:29:59 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 51800 invoked by uid 500); 26 Jun 2013 05:29:58 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 51768 invoked by uid 99); 26 Jun 2013 05:29:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Jun 2013 05:29:56 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a80.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Jun 2013 05:29:52 +0000 Received: from homiemail-a80.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a80.g.dreamhost.com (Postfix) with ESMTP id 9010137A06F for ; Tue, 25 Jun 2013 22:29:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; s= thelastpickle.com; bh=KtLzgZRSSp3PM8PZ50V5NjPb3kU=; b=3fIT1aXUmK xcD2JOiBBkOfE/LmLeqLhEByDSgWqCDuxyb0Ksi25i2nx5xV5o25JNwIZajt9hkC 3UZZpcG9jKhFl4SRtAoQaQ0u4OnWlihgFG8CyiPHZqpb8FokIGoSRVTso6ZB7otf J3tCBlz3ApsMZoJ4lrnIXRBpmy0chqYB4= Received: from [172.16.1.7] (unknown [203.86.207.101]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a80.g.dreamhost.com (Postfix) with ESMTPSA id 3B36C37A065 for ; Tue, 25 Jun 2013 22:29:30 -0700 (PDT) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\)) Subject: Re: about FlushWriter "All time blocked" From: aaron morton In-Reply-To: Date: Wed, 26 Jun 2013 17:29:28 +1200 Content-Transfer-Encoding: quoted-printable Message-Id: <75BBE118-E9EC-437E-9F4C-1D3364216AC4@thelastpickle.com> References: To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1508) X-Virus-Checked: Checked by ClamAV on apache.org > FlushWriter 0 0 191 0 = 12 This means there were 12 times the code wanted to put an memtable in the = queue to be flushed to disk but the queue was full.=20 The length of this queue is controlled by the memtable_flush_queue_size = https://github.com/apache/cassandra/blob/cassandra-1.2/conf/cassandra.yaml= #L299 and memtable_flush_writers . When this happens an internal lock around the commit log is held which = prevents writes from being processed.=20 In general it means the IO system cannot keep up. It can sometimes = happen when snapshot is used as all the CF's are flushed to disk at = once. I also suspect it happens sometimes when a commit log segment is = flushed and their are a lot of dirty CF's. But i've never proved it.=20 Increase memtable_flush_queue_size following the help in the yaml file. = If you do not use secondary indexes are you using snapshot? Hope that helps.=20 A ----------------- Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 24/06/2013, at 3:41 PM, yue.zhang wrote: > 3 node > cent os > CPU 8core memory 32GB > cassandra 1.2.5 > my scenario: many counter incr, every node has one client program, = performance is 400 wps /every clicent (it=E2=80=99s so slowly) > =20 > my question=EF=BC=9A > =C3=98 nodetool tpstats > --------------------------------- > Pool Name Active Pending Completed Blocked = All time blocked > ReadStage 0 0 8453 0 = 0 > RequestResponseStage 0 0 138303982 0 = 0 > MutationStage 0 0 172002988 0 = 0 > ReadRepairStage 0 0 0 0 = 0 > ReplicateOnWriteStage 0 0 82246354 0 = 0 > GossipStage 0 0 1052389 0 = 0 > AntiEntropyStage 0 0 0 0 = 0 > MigrationStage 0 0 0 0 = 0 > MemtablePostFlusher 0 0 670 0 = 0 > FlushWriter 0 0 191 0 = 12 > MiscStage 0 0 0 0 = 0 > commitlog_archiver 0 0 0 0 = 0 > InternalResponseStage 0 0 0 0 = 0 > HintedHandoff 0 0 56 0 = 0 > ----------------------------------- > FlushWriter =E2=80=9CAll time blocked=E2=80=9D=3D12=EF=BC=8CI restart = the node=EF=BC=8Cbut no use=EF=BC=8Cit=E2=80=99s normally ? > =20 > thx > =20 > -heipark > =20 > =20