Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F1E891168C for ; Wed, 18 Jun 2014 15:14:31 +0000 (UTC) Received: (qmail 21211 invoked by uid 500); 18 Jun 2014 15:14:29 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 21175 invoked by uid 500); 18 Jun 2014 15:14:29 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 21165 invoked by uid 99); 18 Jun 2014 15:14:29 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jun 2014 15:14:29 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of marcelo@s1mbi0se.com.br designates 209.85.213.177 as permitted sender) Received: from [209.85.213.177] (HELO mail-ig0-f177.google.com) (209.85.213.177) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Jun 2014 15:14:25 +0000 Received: by mail-ig0-f177.google.com with SMTP id c1so900605igq.4 for ; Wed, 18 Jun 2014 08:13:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=s1mbi0se.com.br; s=google; h=mime-version:date:message-id:subject:from:to:content-type; bh=o2oiXcs/ffzuVt36WZSlb1gWvLwGUHAuhfq4EZR9CdY=; b=R6iV+DfcplGhwHVoWIKy/IG+Ex8+1U43l4+4TA/41nCd9AmYkbIHDgmEBEG2bM4gPW TjnXam9l7XVoD6PmoDVX32PcFuC79o4Zpp7x4YludBgbUCucfrmbc2zUEhCczuqfroms 0CXywQPwiWsPZ1xbA17GGOpWegDVynyN5icwc= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to :content-type; bh=o2oiXcs/ffzuVt36WZSlb1gWvLwGUHAuhfq4EZR9CdY=; b=NN2AwGxp0IY7rzho4q4LHNI8ZEBllpKpvzUgXseW7KAjdtPpQesQkCuwUxnyWi7DVR uFTiyYsjLg4lkvkXva+tswIgxJeoyfuEN3ynnoT1pMXme/7Ewbw79gNEZEmYzfEDbcfV /wpgKwwhjT4f1nRIlhehmKg8AfyG7peqJuFNYKp1zNQUnyJvh6g6e38KLRV2fwG5jCJR jWlDzA1B2QrXnvHmuioncTROy7U+FuEqo7QchTv6kfb70ER2whQ53Vd4ekZS2OgDvbO0 RpNugbBIx4pLQ4aQJdscNsaOffXfR0qEtRCZQaW5makMTu2b5nrnqwLDVzCM1OVlaqs+ YdDQ== X-Gm-Message-State: ALoCoQls1Z8IBe9FtpJTFIjGQDR6dfnJPA5o7jm65vyixLPnXjeolIjr+qZkxmPS0E3r/hY5EXmm MIME-Version: 1.0 X-Received: by 10.50.23.105 with SMTP id l9mr5956202igf.46.1403104439473; Wed, 18 Jun 2014 08:13:59 -0700 (PDT) Received: by 10.64.16.233 with HTTP; Wed, 18 Jun 2014 08:13:59 -0700 (PDT) X-Originating-IP: [189.101.187.145] Date: Wed, 18 Jun 2014 12:13:59 -0300 Message-ID: Subject: Batch of prepared statements exceeding specified threshold From: Marcelo Elias Del Valle To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=089e01538e6cb3143604fc1db5bc X-Virus-Checked: Checked by ClamAV on apache.org --089e01538e6cb3143604fc1db5bc Content-Type: text/plain; charset=UTF-8 I have a 10 node cluster with cassandra 2.0.8. I am taking this exceptions in the log when I run my code. What my code does is just reading data from a CF and in some cases it writes new data. WARN [Native-Transport-Requests:553] 2014-06-18 11:04:51,391 BatchStatement.java (line 228) Batch of prepared statements for [identification1.entity, identification1.entity_lookup] is of size 6165, exceeding specified threshold of 5120 by 1045. WARN [Native-Transport-Requests:583] 2014-06-18 11:05:01,152 BatchStatement.java (line 228) Batch of prepared statements for [identification1.entity, identification1.entity_lookup] is of size 21266, exceeding specified threshold of 5120 by 16146. WARN [Native-Transport-Requests:581] 2014-06-18 11:05:20,229 BatchStatement.java (line 228) Batch of prepared statements for [identification1.entity, identification1.entity_lookup] is of size 22978, exceeding specified threshold of 5120 by 17858. INFO [MemoryMeter:1] 2014-06-18 11:05:32,682 Memtable.java (line 481) CFS(Keyspace='OpsCenter', ColumnFamily='rollups300') liveRatio is 14.249755859375 (just-counted was 9.85302734375). calculation took 3ms for 1024 cells After some time, one node of the cluster goes down. Then it goes back after some seconds and another node goes down. It keeps happening and there is always a node down in the cluster, when it goes back another one falls. The only exceptions I see in the log is "connected reset by the peer", which seems to be relative to gossip protocol, when a node goes down. Any hint of what could I do to investigate this problem further? Best regards, Marcelo Valle. --089e01538e6cb3143604fc1db5bc Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I have a 10 node cluster with cassandra 2.0.8.
<= div>
I am taking this exceptions in the log when I run my cod= e. What my code does is just reading data from a CF and in some cases it wr= ites new data.

=C2=A0WARN [Native-Transport-Requests:553] 2014-06-18 1= 1:04:51,391 BatchStatement.java (line 228) Batch of prepared statements for= [identification1.entity, identification1.entity_lookup] is of size 6165, e= xceeding specified threshold of 5120 by 1045.
=C2=A0WARN [Native-Transport-Requests:583] 2014-06-18 11:05:01,152 Bat= chStatement.java (line 228) Batch of prepared statements for [identificatio= n1.entity, identification1.entity_lookup] is of size 21266, exceeding speci= fied threshold of 5120 by 16146.
=C2=A0WARN [Native-Transport-Requests:581] 2014-06-18 11:05:20,229 Bat= chStatement.java (line 228) Batch of prepared statements for [identificatio= n1.entity, identification1.entity_lookup] is of size 22978, exceeding speci= fied threshold of 5120 by 17858.
=C2=A0INFO [MemoryMeter:1] 2014-06-18 11:05:32,682 Memtable.java (line= 481) CFS(Keyspace=3D'OpsCenter', ColumnFamily=3D'rollups300= 9;) liveRatio is 14.249755859375 (just-counted was 9.85302734375). =C2=A0ca= lculation took 3ms for 1024 cells

After some time, one node of the cluster goes down. The= n it goes back after some seconds and another node goes down. It keeps happ= ening and there is always a node down in the cluster, when it goes back ano= ther one falls.

The only exceptions I see in the log is "connected= reset by the peer", which seems to be relative to gossip protocol, wh= en a node goes down.

Any hint of what could I do t= o investigate this problem further?

Best regards,
Marcelo Valle.
--089e01538e6cb3143604fc1db5bc--