Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CBB6010319 for ; Wed, 27 Nov 2013 01:48:20 +0000 (UTC) Received: (qmail 91513 invoked by uid 500); 27 Nov 2013 01:48:18 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 91485 invoked by uid 500); 27 Nov 2013 01:48:18 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 91477 invoked by uid 99); 27 Nov 2013 01:48:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Nov 2013 01:48:18 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy includes SPF record at spf.trusted-forwarder.org) Received: from [209.85.160.49] (HELO mail-pb0-f49.google.com) (209.85.160.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Nov 2013 01:48:12 +0000 Received: by mail-pb0-f49.google.com with SMTP id jt11so9147907pbb.8 for ; Tue, 26 Nov 2013 17:47:50 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:content-type:mime-version:subject:from :in-reply-to:date:content-transfer-encoding:message-id:references:to; bh=SdRreFOxF9xPA9JU4etJg8iCmFBtRIripq44zG2Qn00=; b=Rv6Dv2cWt3bizbgfpdSSZ80Cn5QCjlzXiTdxKoIa2vRCfZqAqMz6HkY2E4SBCi45Iy F+5rdCRjYhHvlzvRCsJAx4c4K0A5uRcEN8u7AmNQDhCi4shrbHlQNc2N42tyNC4U71aa mg1HCK9QTcBp5Ok49i1MvK1BfLMjDF/M87y/ID5PniX/uBXeGg8EviTi6zf9T+7EoToW i0YTjUap4fJmetb4nkf1EIA1Bl61QRGOi+Xu1psaXirPGIxgUCR+4r68ymz5mULqcbUx DvjOfvpQef3rWFqoPk4aZhUo+NE+Wo6ea/wF+Oklm5pCyjguggUaZI/JXA39wbkCXj3H gg8Q== X-Gm-Message-State: ALoCoQkfVvdb3bUfaALfRvhapdYTVyR3tG+3aKEmlLFOhpYnfiZ/IFrX7+9IYuZEK4zMO6TBpkNN X-Received: by 10.68.231.68 with SMTP id te4mr2424328pbc.174.1385516870548; Tue, 26 Nov 2013 17:47:50 -0800 (PST) Received: from [172.16.1.18] ([203.86.207.101]) by mx.google.com with ESMTPSA id ye1sm95015891pab.19.2013.11.26.17.47.48 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 26 Nov 2013 17:47:49 -0800 (PST) Content-Type: text/plain; charset=windows-1252 Mime-Version: 1.0 (Mac OS X Mail 7.0 \(1822\)) Subject: Re: 1.1.11: system keyspace is filling up From: Aaron Morton In-Reply-To: Date: Wed, 27 Nov 2013 14:47:41 +1300 Content-Transfer-Encoding: quoted-printable Message-Id: References: To: Cassandra User X-Mailer: Apple Mail (2.1822) X-Virus-Checked: Checked by ClamAV on apache.org > What happens if they are not being successfully delivered ? Will they = eventually TTL-out ? They have a TTL set to the gc_grace_seconds on the CF at the time of the = write.=20 I=92ve also seen hints build up in multi DC systems due to timeouts on = the coordinator. i.e. the remote nodes are up, co-ordinator starts the = writes, remote nodes process the request (no dropped messages), but the = response is lost. These are tracked as timeouts on the = MessagingServiceMBean.=20 Cheers ----------------- Aaron Morton New Zealand @aaronmorton Co-Founder & Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 22/11/2013, at 6:00 pm, Rahul Menon wrote: > Oleg,=20 >=20 > The system keyspace is not replicated it is local to the node. You = should check your logs to see if there are Timeouts from streaming = hints, i believe the default value to stream hints it 10 seconds. When i = ran into this problem i truncated hints to clear out the space and then = ran a repair so ensure that all the data was consistant across all = nodes, even if there was a failure.=20 >=20 > -rm=20 >=20 >=20 > On Tue, Nov 5, 2013 at 6:29 PM, Oleg Dulin = wrote: > What happens if they are not being successfully delivered ? Will they = eventually TTL-out ? >=20 >=20 >=20 > Also, do I need to truncate hints on every node or is it replicated ? >=20 >=20 >=20 > Oleg >=20 >=20 >=20 > On 2013-11-04 21:34:55 +0000, Robert Coli said: >=20 >=20 >=20 > On Mon, Nov 4, 2013 at 11:34 AM, Oleg Dulin = wrote: >=20 > I have a dual DC setup, 4 nodes, RF=3D4 in each. >=20 >=20 >=20 > The one that is used as primary has its system keyspace fill up with = 200 gigs of data, majority of which is hints. >=20 >=20 >=20 > Why does this happen ? >=20 >=20 >=20 > How can I clean it up ? >=20 >=20 >=20 > If you have this many hints, you probably have flapping / frequent = network partition, or very overloaded nodes. If you compare the number = of hints to the number of dropped messages, that would be informative. = If you're hinting because you're dropping, increase capacity. If you're = hinting because of partition, figure out why there's so much partition. >=20 >=20 >=20 > WRT cleaning up hints, they will automatically be cleaned up = eventually, as long as they are successfully being delivered. If you = need to manually clean them up you can truncate system.hints keyspace. >=20 >=20 >=20 > =3DRob >=20 > =20 >=20 >=20 >=20 >=20 >=20 >=20 > --=20 >=20 > Regards, >=20 > Oleg Dulin >=20 > http://www.olegdulin.com >=20 >=20