Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: local policy includes SPF record at
 spf.trusted-forwarder.org)
MIME-Version: 1.0
In-Reply-To: 
 <CAAX2xq4z_GCubyyGYOa3L9YfDZOmmfSBhG2jXUs8PnmqtNK9Sw@mail.gmail.com>
References: 
 <CAAX2xq54D2AmnqX56q5-A93Pobui+sQq5LoD24m5ahRsPhQVcQ@mail.gmail.com>
	<CAABB5w_M33AwZwPaFSeOGSt-BUb6MO2tOuwOamT5oyv5Dq8oMg@mail.gmail.com>
	<CAAX2xq4z_GCubyyGYOa3L9YfDZOmmfSBhG2jXUs8PnmqtNK9Sw@mail.gmail.com>
Date: Fri, 20 Jun 2014 07:50:46 -0400
Message-ID: 
 <CAABB5w8qJamaLu9BVA-4y+HS8xxa9m8c6HiBjdwps7-bYArd3w@mail.gmail.com>
Subject: Re: Batch of prepared statements exceeding specified threshold
From: Pavel Kogan <pavel.kogan@cortica.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7bf10a1c9bc45f04fc431af8

--047d7bf10a1c9bc45f04fc431af8
Content-Type: text/plain; charset=UTF-8

The cluster is new, so no updates were done. Version 2.0.8.
It happened when I did many writes (no reads). Writes are done in small
batches of 2 inserts (writing to 2 column families). The values are big
blobs (up to 100Kb).

Any clues?

Pavel


On Thu, Jun 19, 2014 at 8:07 PM, Marcelo Elias Del Valle <
marcelo@s1mbi0se.com.br> wrote:

> Pavel,
>
> Out of curiosity, did it start to happen before some update? Which version
> of Cassandra are you using?
>
> []s
>
>
> 2014-06-19 16:10 GMT-03:00 Pavel Kogan <pavel.kogan@cortica.com>:
>
>> What a coincidence! Today happened in my cluster of 7 nodes as well.
>>
>> Regards,
>>   Pavel
>>
>>
>> On Wed, Jun 18, 2014 at 11:13 AM, Marcelo Elias Del Valle <
>> marcelo@s1mbi0se.com.br> wrote:
>>
>>> I have a 10 node cluster with cassandra 2.0.8.
>>>
>>> I am taking this exceptions in the log when I run my code. What my code
>>> does is just reading data from a CF and in some cases it writes new data.
>>>
>>>  WARN [Native-Transport-Requests:553] 2014-06-18 11:04:51,391
>>> BatchStatement.java (line 228) Batch of prepared statements for
>>> [identification1.entity, identification1.entity_lookup] is of size 6165,
>>> exceeding specified threshold of 5120 by 1045.
>>>  WARN [Native-Transport-Requests:583] 2014-06-18 11:05:01,152
>>> BatchStatement.java (line 228) Batch of prepared statements for
>>> [identification1.entity, identification1.entity_lookup] is of size 21266,
>>> exceeding specified threshold of 5120 by 16146.
>>>  WARN [Native-Transport-Requests:581] 2014-06-18 11:05:20,229
>>> BatchStatement.java (line 228) Batch of prepared statements for
>>> [identification1.entity, identification1.entity_lookup] is of size 22978,
>>> exceeding specified threshold of 5120 by 17858.
>>>  INFO [MemoryMeter:1] 2014-06-18 11:05:32,682 Memtable.java (line 481)
>>> CFS(Keyspace='OpsCenter', ColumnFamily='rollups300') liveRatio is
>>> 14.249755859375 (just-counted was 9.85302734375).  calculation took 3ms for
>>> 1024 cells
>>>
>>> After some time, one node of the cluster goes down. Then it goes back
>>> after some seconds and another node goes down. It keeps happening and there
>>> is always a node down in the cluster, when it goes back another one falls.
>>>
>>> The only exceptions I see in the log is "connected reset by the peer",
>>> which seems to be relative to gossip protocol, when a node goes down.
>>>
>>> Any hint of what could I do to investigate this problem further?
>>>
>>> Best regards,
>>> Marcelo Valle.
>>>
>>
>>
>

--047d7bf10a1c9bc45f04fc431af8
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The cluster is new, so no updates were done. Version 2.0.8=
.<div>It happened when I did many writes (no reads). Writes are done in sma=
ll batches of 2 inserts (writing to 2 column families). The values are big =
blobs (up to 100Kb).</div>
<div><br></div><div>Any clues?</div><div><br></div><div>Pavel</div></div><d=
iv class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Jun 19,=
 2014 at 8:07 PM, Marcelo Elias Del Valle <span dir=3D"ltr">&lt;<a href=3D"=
mailto:marcelo@s1mbi0se.com.br" target=3D"_blank">marcelo@s1mbi0se.com.br</=
a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Pavel,=C2=A0<div><br></div>=
<div>Out of curiosity, did it start to happen before some update? Which ver=
sion of Cassandra are you using?</div>
<div><br></div><div>[]s</div></div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote"><div class=3D"">
2014-06-19 16:10 GMT-03:00 Pavel Kogan <span dir=3D"ltr">&lt;<a href=3D"mai=
lto:pavel.kogan@cortica.com" target=3D"_blank">pavel.kogan@cortica.com</a>&=
gt;</span>:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 =
0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div dir=3D"ltr">What a coincidence! Today happened in my cluster of 7 node=
s as well.<div><br></div><div>Regards,</div><div>=C2=A0 Pavel</div></div><d=
iv><div class=3D"h5"><div><div><div class=3D"gmail_extra"><br><br><div clas=
s=3D"gmail_quote">

On Wed, Jun 18, 2014 at 11:13 AM, Marcelo Elias Del Valle <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:marcelo@s1mbi0se.com.br" target=3D"_blank">marcelo@s=
1mbi0se.com.br</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>I have a 10 node clust=
er with cassandra 2.0.8.</div><div><br></div><div>I am taking this exceptio=
ns in the log when I run my code. What my code does is just reading data fr=
om a CF and in some cases it writes new data.</div>


<div><br></div><div>=C2=A0WARN [Native-Transport-Requests:553] 2014-06-18 1=
1:04:51,391 BatchStatement.java (line 228) Batch of prepared statements for=
 [identification1.entity, identification1.entity_lookup] is of size 6165, e=
xceeding specified threshold of 5120 by 1045.</div>


<div>=C2=A0WARN [Native-Transport-Requests:583] 2014-06-18 11:05:01,152 Bat=
chStatement.java (line 228) Batch of prepared statements for [identificatio=
n1.entity, identification1.entity_lookup] is of size 21266, exceeding speci=
fied threshold of 5120 by 16146.</div>


<div>=C2=A0WARN [Native-Transport-Requests:581] 2014-06-18 11:05:20,229 Bat=
chStatement.java (line 228) Batch of prepared statements for [identificatio=
n1.entity, identification1.entity_lookup] is of size 22978, exceeding speci=
fied threshold of 5120 by 17858.</div>


<div>=C2=A0INFO [MemoryMeter:1] 2014-06-18 11:05:32,682 Memtable.java (line=
 481) CFS(Keyspace=3D&#39;OpsCenter&#39;, ColumnFamily=3D&#39;rollups300=
9;) liveRatio is 14.249755859375 (just-counted was 9.85302734375). =C2=A0ca=
lculation took 3ms for 1024 cells</div>


<div><br></div><div>After some time, one node of the cluster goes down. The=
n it goes back after some seconds and another node goes down. It keeps happ=
ening and there is always a node down in the cluster, when it goes back ano=
ther one falls.</div>


<div><br></div><div>The only exceptions I see in the log is &quot;connected=
 reset by the peer&quot;, which seems to be relative to gossip protocol, wh=
en a node goes down.</div><div><br></div><div>Any hint of what could I do t=
o investigate this problem further?</div>


<div><br></div><div>Best regards,</div><div>Marcelo Valle.</div></div>
</blockquote></div><br></div>
</div></div></div></div></blockquote></div><br></div>
</blockquote></div><br></div>

--047d7bf10a1c9bc45f04fc431af8--