Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of pavel.kirienko.list@gmail.com
 designates 209.85.214.43 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAGfq0uvKj04npULnZJF3LgccCCER8PQr2OaP5CEtTZGPJBu9yw@mail.gmail.com>
References: 
 <CAGfq0uvLOiB8h+9Qjd=3_FiOxOS8kTH-Cgef76SYB5W0XT3dYg@mail.gmail.com>
	<80D90631-D25C-41BA-8765-38E0CD84A265@gmail.com>
	<CAGfq0uuB1HyDc7qM5sDGhYJGv3AnV8MxK+qKfOXf+MRqDePf-w@mail.gmail.com>
	<32CB1D92-F3B7-41BF-B1B3-D4F07C488DFF@gmail.com>
	<CAGfq0uvKj04npULnZJF3LgccCCER8PQr2OaP5CEtTZGPJBu9yw@mail.gmail.com>
Date: Tue, 30 Jul 2013 15:39:48 +0400
Message-ID: 
 <CAGfq0uuuNBpjw1_6AibrSzhh36oiqAPa_jxd+ASpTte2-cLw3A@mail.gmail.com>
Subject: Re: AssertionError: Incorrect row data size
From: Pavel Kirienko <pavel.kirienko.list@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=20cf30223bf5005c8a04e2b911d4

--20cf30223bf5005c8a04e2b911d4
Content-Type: text/plain; charset=ISO-8859-1

Also it is probably worth to mention:
1. I see no other errors in logs except that one;
2. Sometimes connected clients receive "Request did not complete within
rpc_timeout.", even if they are accessing other tables.
3. Sometimes, some cells from another tables may read as NULL when they are
in fact not empty. (This is really really bad, maybe I failed to configure
something properly)

I can provide more info if needed.

Thanks in advance.


On Tue, Jul 30, 2013 at 3:32 PM, Pavel Kirienko <
pavel.kirienko.list@gmail.com> wrote:

> Cassandra 1.2.8 still have this issue.
>
> Possible recipe to reproduce: create the table as described in the first
> message of this thread; write 3000 rows of 10MB each at the rate about
> 0.1..1 request per second.
> Maybe this behavior is caused by incremental compaction of large rows...
>
>
> On Mon, Jul 29, 2013 at 8:59 AM, Paul Ingalls <paulingalls@gmail.com>wrote:
>
>> Great. Let me know what you find!
>>
>> Thanks!
>>
>> Paul
>>
>> Sent from my iPhone
>>
>> On Jul 27, 2013, at 2:47 AM, Pavel Kirienko <
>> pavel.kirienko.list@gmail.com> wrote:
>>
>> Hi Paul,
>>
>> I checked out your issue, looks the same indeed. Probably this can be
>> reproed simply by writing large rows (> 10MB) on high rates.
>> I'm going to try 1.2.7 then will be back with results.
>>
>>
>> On Sat, Jul 27, 2013 at 12:18 AM, Paul Ingalls <paulingalls@gmail.com>wrote:
>>
>>> This is the same issue we have been seeing.  Still no luck getting a
>>> simple repro case for creating a JIRA issue.  Do you have something simple
>>> enough to drop in a JIRA report?
>>>
>>> Paul
>>>
>>> On Jul 26, 2013, at 8:06 AM, Pavel Kirienko <
>>> pavel.kirienko.list@gmail.com> wrote:
>>>
>>> > Hi list,
>>> >
>>> > We run Cassandra 1.2 on three-node cluster. Each node has 16GB RAM,
>>> single 200GB HDD with Ubuntu Server 12.04.
>>> >
>>> > There is an issue with one table that contains about 3000 rows, here
>>> its describe-table:
>>> >
>>> > CREATE TABLE outputs (
>>> >   appid text,
>>> >   staged boolean,
>>> >   field ascii,
>>> >   data blob,
>>> >   PRIMARY KEY (appid, staged, field)
>>> > ) WITH
>>> >   bloom_filter_fp_chance=0.010000 AND
>>> >   caching='KEYS_ONLY' AND
>>> >   comment='' AND
>>> >   dclocal_read_repair_chance=0.000000 AND
>>> >   gc_grace_seconds=864000 AND
>>> >   read_repair_chance=0.100000 AND
>>> >   replicate_on_write='true' AND
>>> >   populate_io_cache_on_flush='false' AND
>>> >   compaction={'class': 'SizeTieredCompactionStrategy'} AND
>>> >   compression={'sstable_compression': 'SnappyCompressor'};
>>> >
>>> > Column DATA contains blobs of size about 1..50MB, average size should
>>> be something of 5MB.
>>> >
>>> > Sometimes this table expiriences huge write loads for few hours, at
>>> such times I see suspicious things in logs:
>>> >
>>> > ERROR [CompactionExecutor:357] 2013-07-24 12:32:10,293
>>> CassandraDaemon.java (line 192) Exception in thread
>>> Thread[CompactionExecutor:357,1,main]
>>> > java.lang.AssertionError: incorrect row data size 172489604 written to
>>> /var/lib/cassandra/data/woodpecker/outputs/woodpecker-outputs-tmp-ic-813-Data.db;
>>> correct is 172489704
>>> >         at
>>> org.apache.cassandra.io.sstable.SSTableWriter.append(SSTableWriter.java:162)
>>> >         at
>>> org.apache.cassandra.db.compaction.CompactionTask.runWith(CompactionTask.java:162)
>>> >         at
>>> org.apache.cassandra.io.util.DiskAwareRunnable.runMayThrow(DiskAwareRunnable.java:48)
>>> >         at
>>> org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
>>> >         at
>>> org.apache.cassandra.db.compaction.CompactionTask.executeInternal(CompactionTask.java:58)
>>> >         at
>>> org.apache.cassandra.db.compaction.AbstractCompactionTask.execute(AbstractCompactionTask.java:60)
>>> >         at
>>> org.apache.cassandra.db.compaction.CompactionManager$BackgroundCompactionTask.run(CompactionManager.java:211)
>>> >         at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
>>> Source)
>>> >         at java.util.concurrent.FutureTask$Sync.innerRun(Unknown
>>> Source)
>>> >         at java.util.concurrent.FutureTask.run(Unknown Source)
>>> >         at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
>>> Source)
>>> >         at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
>>> Source)
>>> >         at java.lang.Thread.run(Unknown Source)
>>> >
>>> > What shall I do about this?
>>> >
>>> > Thanks in advance.
>>> > Pavel.
>>> >
>>>
>>>
>>
>

--20cf30223bf5005c8a04e2b911d4
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Also it is probably worth to mention:<br>1. I see no other=
 errors in logs except that one;<br>2. Sometimes connected clients receive =
&quot;Request did not complete within rpc_timeout.&quot;, even if they are =
accessing other tables.<div>
3. Sometimes, some cells from another tables may read as NULL when they are=
 in fact not empty. (This is really really bad, maybe I failed to configure=
 something properly)<br><div><br></div><div>I can provide more info if need=
ed.</div>
</div><div><br></div><div>Thanks in advance.</div></div><div class=3D"gmail=
_extra"><br><br><div class=3D"gmail_quote">On Tue, Jul 30, 2013 at 3:32 PM,=
 Pavel Kirienko <span dir=3D"ltr">&lt;<a href=3D"mailto:pavel.kirienko.list=
@gmail.com" target=3D"_blank">pavel.kirienko.list@gmail.com</a>&gt;</span> =
wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Cassandra 1.2.8 still have =
this issue.<div><br><div>Possible recipe to reproduce: create the table as =
described in the first message of this thread; write 3000 rows of 10MB each=
 at the rate about 0.1..1 request per second.</div>

</div><div>Maybe this behavior is caused by incremental compaction of large=
 rows...</div></div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"g=
mail_extra"><br><br><div class=3D"gmail_quote">On Mon, Jul 29, 2013 at 8:59=
 AM, Paul Ingalls <span dir=3D"ltr">&lt;<a href=3D"mailto:paulingalls@gmail=
.com" target=3D"_blank">paulingalls@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"auto"><div>Great. Let me know wh=
at you find!</div><div><br></div><div>Thanks!</div><div><br></div><div>Paul=
<br>

<br>Sent from my iPhone</div><div><div><div><br>On Jul 27, 2013, at 2:47 AM=
, Pavel Kirienko &lt;<a href=3D"mailto:pavel.kirienko.list@gmail.com" targe=
t=3D"_blank">pavel.kirienko.list@gmail.com</a>&gt; wrote:<br><br>
</div><blockquote type=3D"cite"><div><div dir=3D"ltr">Hi Paul,<div><br></di=
v><div>I checked out your issue, looks the same indeed. Probably this can b=
e reproed simply by writing large rows (&gt; 10MB) on high rates.</div><div=
>

I&#39;m going to try 1.2.7 then will be back with results.</div>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Sat,=
 Jul 27, 2013 at 12:18 AM, Paul Ingalls <span dir=3D"ltr">&lt;<a href=3D"ma=
ilto:paulingalls@gmail.com" target=3D"_blank">paulingalls@gmail.com</a>&gt;=
</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">This is the same issue we have been seeing. =
=A0Still no luck getting a simple repro case for creating a JIRA issue. =A0=
Do you have something simple enough to drop in a JIRA report?<br>


<span><font color=3D"#888888"><br>
Paul<br>
</font></span><div><div><br>
On Jul 26, 2013, at 8:06 AM, Pavel Kirienko &lt;<a href=3D"mailto:pavel.kir=
ienko.list@gmail.com" target=3D"_blank">pavel.kirienko.list@gmail.com</a>&g=
t; wrote:<br>
<br>
&gt; Hi list,<br>
&gt;<br>
&gt; We run Cassandra 1.2 on three-node cluster. Each node has 16GB RAM, si=
ngle 200GB HDD with Ubuntu Server 12.04.<br>
&gt;<br>
&gt; There is an issue with one table that contains about 3000 rows, here i=
ts describe-table:<br>
&gt;<br>
&gt; CREATE TABLE outputs (<br>
&gt; =A0 appid text,<br>
&gt; =A0 staged boolean,<br>
&gt; =A0 field ascii,<br>
&gt; =A0 data blob,<br>
&gt; =A0 PRIMARY KEY (appid, staged, field)<br>
&gt; ) WITH<br>
&gt; =A0 bloom_filter_fp_chance=3D0.010000 AND<br>
&gt; =A0 caching=3D&#39;KEYS_ONLY&#39; AND<br>
&gt; =A0 comment=3D&#39;&#39; AND<br>
&gt; =A0 dclocal_read_repair_chance=3D0.000000 AND<br>
&gt; =A0 gc_grace_seconds=3D864000 AND<br>
&gt; =A0 read_repair_chance=3D0.100000 AND<br>
&gt; =A0 replicate_on_write=3D&#39;true&#39; AND<br>
&gt; =A0 populate_io_cache_on_flush=3D&#39;false&#39; AND<br>
&gt; =A0 compaction=3D{&#39;class&#39;: &#39;SizeTieredCompactionStrategy&#=
39;} AND<br>
&gt; =A0 compression=3D{&#39;sstable_compression&#39;: &#39;SnappyCompresso=
r&#39;};<br>
&gt;<br>
&gt; Column DATA contains blobs of size about 1..50MB, average size should =
be something of 5MB.<br>
&gt;<br>
&gt; Sometimes this table expiriences huge write loads for few hours, at su=
ch times I see suspicious things in logs:<br>
&gt;<br>
&gt; ERROR [CompactionExecutor:357] 2013-07-24 12:32:10,293 CassandraDaemon=
.java (line 192) Exception in thread Thread[CompactionExecutor:357,1,main]<=
br>
&gt; java.lang.AssertionError: incorrect row data size 172489604 written to=
 /var/lib/cassandra/data/woodpecker/outputs/woodpecker-outputs-tmp-ic-813-D=
ata.db; correct is 172489704<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.io.sstable.SSTableWriter.appen=
d(SSTableWriter.java:162)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.db.compaction.CompactionTask.r=
unWith(CompactionTask.java:162)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.io.util.DiskAwareRunnable.runM=
ayThrow(DiskAwareRunnable.java:48)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.utils.WrappedRunnable.run(Wrap=
pedRunnable.java:28)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.db.compaction.CompactionTask.e=
xecuteInternal(CompactionTask.java:58)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.db.compaction.AbstractCompacti=
onTask.execute(AbstractCompactionTask.java:60)<br>
&gt; =A0 =A0 =A0 =A0 at org.apache.cassandra.db.compaction.CompactionManage=
r$BackgroundCompactionTask.run(CompactionManager.java:211)<br>
&gt; =A0 =A0 =A0 =A0 at java.util.concurrent.Executors$RunnableAdapter.call=
(Unknown Source)<br>
&gt; =A0 =A0 =A0 =A0 at java.util.concurrent.FutureTask$Sync.innerRun(Unkno=
wn Source)<br>
&gt; =A0 =A0 =A0 =A0 at java.util.concurrent.FutureTask.run(Unknown Source)=
<br>
&gt; =A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExecutor.runWorker(U=
nknown Source)<br>
&gt; =A0 =A0 =A0 =A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(=
Unknown Source)<br>
&gt; =A0 =A0 =A0 =A0 at java.lang.Thread.run(Unknown Source)<br>
&gt;<br>
&gt; What shall I do about this?<br>
&gt;<br>
&gt; Thanks in advance.<br>
&gt; Pavel.<br>
&gt;<br>
<br>
</div></div></blockquote></div><br></div>
</div></blockquote></div></div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--20cf30223bf5005c8a04e2b911d4--