Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F9917CA3 for ; Sat, 10 Sep 2011 22:15:31 +0000 (UTC) Received: (qmail 33623 invoked by uid 500); 10 Sep 2011 22:15:29 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 33593 invoked by uid 500); 10 Sep 2011 22:15:29 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 33585 invoked by uid 99); 10 Sep 2011 22:15:29 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Sep 2011 22:15:29 +0000 X-ASF-Spam-Status: No, hits=-0.6 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jeremy.hanna1234@gmail.com designates 209.85.160.172 as permitted sender) Received: from [209.85.160.172] (HELO mail-gy0-f172.google.com) (209.85.160.172) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Sep 2011 22:15:22 +0000 Received: by gyd12 with SMTP id 12so2778794gyd.31 for ; Sat, 10 Sep 2011 15:15:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=nLsMcy25u4vDVXlb+0o/4q3NjorbO0ZZEXz+4jRsCD4=; b=wn8VuJreeli53vDLz2FeQ6Mdk0L/Z0z3oDL3ChLwmt5H9nJb+gz2nZPhu1uEZrTE1h 3iGfwOH0pRE/VOAWTHTxob13uKGmk86iS59B2Ln2WiRGk9t63qB4g7Fw2NYakeyI/eX2 HDKck6K1KzyXkci9KrGAAl+gok7tfdPe6kxdU= Received: by 10.150.31.20 with SMTP id e20mr3103674ybe.239.1315692900039; Sat, 10 Sep 2011 15:15:00 -0700 (PDT) Received: from [192.168.1.69] (108-90-0-32.lightspeed.austtx.sbcglobal.net [108.90.0.32]) by mx.google.com with ESMTPS id v11sm358389ybv.15.2011.09.10.15.14.58 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 10 Sep 2011 15:14:58 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Apple Message framework v1244.3) Subject: Re: Massive writes when only reading from Cassandra From: Jeremy Hanna In-Reply-To: Date: Sat, 10 Sep 2011 17:14:57 -0500 Content-Transfer-Encoding: quoted-printable Message-Id: <05BCECCE-098A-4205-8C0E-3ADCDF1C0EC2@gmail.com> References: <95F4FC21-8D11-4375-8503-1A60BD1AD311@gmail.com> <1BDB35E4-A61B-4F91-9257-D6F5119F29AB@gmail.com> <4ECB8F33-97BE-42D5-A435-707F04EE071B@gmail.com> To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1244.3) X-Virus-Checked: Checked by ClamAV on apache.org Thanks for the insights. I may first try disabling hinted handoff for = one run of our data pipeline and see if it exhibits the same behavior. = Will post back if I see anything enlightening there. On Sep 10, 2011, at 5:04 PM, Chris Goffinet wrote: > You could tail the commit log with `strings` to see what keys are = being inserted. >=20 > On Sat, Sep 10, 2011 at 2:24 PM, Jonathan Ellis = wrote: > Two possibilities: >=20 > 1) Hinted handoff (this will show up in the logs on the sending > machine, on the receiving one it will just look like any other write) >=20 > 2) You have something doing writes that you're not aware of, I guess > you could track that down using wireshark to see where the write > messages are coming from >=20 > On Sat, Sep 10, 2011 at 3:56 PM, Jeremy Hanna > wrote: > > Oh and we're running 0.8.4 and the RF is 3. > > > > On Sep 10, 2011, at 3:49 PM, Jeremy Hanna wrote: > > > >> In addition, the mutation stage and the read stage are backed up = like: > >> > >> Pool Name Active Pending Blocked > >> ReadStage 32 773 0 > >> RequestResponseStage 0 0 0 > >> ReadRepairStage 0 0 0 > >> MutationStage 158 525918 0 > >> ReplicateOnWriteStage 0 0 0 > >> GossipStage 0 0 0 > >> AntiEntropyStage 0 0 0 > >> MigrationStage 0 0 0 > >> StreamStage 0 0 0 > >> MemtablePostFlusher 1 5 0 > >> FILEUTILS-DELETE-POOL 0 0 0 > >> FlushWriter 2 5 0 > >> MiscStage 0 0 0 > >> FlushSorter 0 0 0 > >> InternalResponseStage 0 0 0 > >> HintedHandoff 0 0 0 > >> CompactionManager n/a 29 > >> MessagingService n/a 0,34 > >> > >> On Sep 10, 2011, at 3:38 PM, Jeremy Hanna wrote: > >> > >>> We are experiencing massive writes to column families when only = doing reads from Cassandra. A set of 5 hadoop jobs are reading from = Cassandra and then writing out to hdfs. That is the only thing = operating on the cluster. We are reading at CL.QUORUM with hadoop and = have written with CL.QUORUM. Read repair chance is set to 0.0 on all = column families. However, in the logs, I'm seeing flush after flush of = memtables and compactions taking place. Is there something else that = would be writing based on the above description? > >>> > >>> Jeremy > >> > > > > >=20 >=20 >=20 > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of DataStax, the source for professional Cassandra support > http://www.datastax.com >=20