Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: <CAENxBwzws7+9t9rryh1j6icavLd37ffyOjH5uHMsTb6F=YrBfg@mail.gmail.com>
References: <F35D18B1-01D4-474B-BBBD-A75BCB717E40@nordsc.com>
 <CAEDUwd1Dt7c50xnZQ1pBbYBnz8q1TWg7LWXCozYB2T8UQjH3dw@mail.gmail.com>
 <CALY91SO6GdgW8G75=K_kKjuVSLBqEU6jDn9aKy-zFYTnWJtziQ@mail.gmail.com>
 <049FAE46-2AD8-4A9A-AA86-CE3BD09FC779@aol.com> <CALY91SOcvo-+1=WhAfMwAdhnrpB5SFtQW3tXUGdK63zGDOgSdQ@mail.gmail.com>
 <C8DE95CB-1D66-4DA8-849E-A7003082B81F@aol.com> <CALY91SOTNZAEhuCzvfVqKGBEAvSpduX_c7GzUyDO=NeCsXcV_g@mail.gmail.com>
 <CALY91SPCjw-PNzbS21FoxfU2fAaGE7MK0YHByvpr2RRQ5ExGAA@mail.gmail.com> <CAENxBwzws7+9t9rryh1j6icavLd37ffyOjH5uHMsTb6F=YrBfg@mail.gmail.com>
From: horschi <horschi@gmail.com>
Date: Sat, 24 Dec 2016 13:16:16 +0100
Message-ID: <CALY91SO5LeD16YJDhZ6Y2GomdwXHxpkw_EJ46_L24L06NiDkkg@mail.gmail.com>
Subject: Re: All subsequent CAS requests time out after heavy use of new CAS feature
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11440fea2eb612054466789a
archived-at: Sat, 24 Dec 2016 12:16:31 -0000

--001a11440fea2eb612054466789a
Content-Type: text/plain; charset=UTF-8

Oh yes it is, like Couters :-)


On Sat, Dec 24, 2016 at 4:02 AM, Edward Capriolo <edlinuxguru@gmail.com>
wrote:

> Anecdotal CAS works differently than the typical cassandra workload. If
> you run a stress instance 3 nodes one host, you find that you typically run
> into CPU issues, but if you are doing a CAS workload you see things timing
> out and before you hit 100% CPU. It is a strange beast.
>
> On Fri, Dec 23, 2016 at 7:28 AM, horschi <horschi@gmail.com> wrote:
>
>> Update: I replace all quorum reads on that table with serial reads, and
>> now these errors got less. Somehow quorum reads on CAS values cause most of
>> these WTEs.
>>
>> Also I found two tickets on that topic:
>> https://issues.apache.org/jira/browse/CASSANDRA-9328
>> https://issues.apache.org/jira/browse/CASSANDRA-8672
>>
>> On Thu, Dec 15, 2016 at 3:14 PM, horschi <horschi@gmail.com> wrote:
>>
>>> Hi,
>>>
>>> I would like to warm up this old thread. I did some debugging and found
>>> out that the timeouts are coming from StorageProxy.proposePaxos()
>>> - callback.isFullyRefused() returns false and therefore triggers a
>>> WriteTimeout.
>>>
>>> Looking at my ccm cluster logs, I can see that two replica nodes return
>>> different results in their ProposeVerbHandler. In my opinion the
>>> coordinator should not throw a Exception in such a case, but instead retry
>>> the operation.
>>>
>>> What do the CAS/Paxos experts on this list say to this? Feel free to
>>> instruct me to do further tests/code changes. I'd be glad to help.
>>>
>>> Log:
>>>
>>> node1/logs/system.log:WARN  [SharedPool-Worker-5] 2016-12-15
>>> 14:48:36,896 PaxosState.java:124 - Rejecting proposal for
>>> Commit(2d803540-c2cd-11e6-2e48-53a129c60cfc, [MDS.Lock] key=locktest_ 1
>>> columns=[[] | [value]]
>>> node1/logs/system.log-    Row: id=@ | value=<tombstone>) because
>>> inProgress is now Commit(2d8146b0-c2cd-11e6-f996-e5c8d88a1da4,
>>> [MDS.Lock] key=locktest_ 1 columns=[[] | [value]]
>>> --
>>> node1/logs/system.log:ERROR [SharedPool-Worker-12] 2016-12-15
>>> 14:48:36,980 StorageProxy.java:506 - proposePaxos:
>>> Commit(2d803540-c2cd-11e6-2e48-53a129c60cfc, [MDS.Lock] key=locktest_ 1
>>> columns=[[] | [value]]
>>> node1/logs/system.log-    Row: id=@ | value=<tombstone>)//1//0
>>> --
>>> node2/logs/system.log:WARN  [SharedPool-Worker-7] 2016-12-15
>>> 14:48:36,969 PaxosState.java:117 - Accepting proposal:
>>> Commit(2d803540-c2cd-11e6-2e48-53a129c60cfc, [MDS.Lock] key=locktest_ 1
>>> columns=[[] | [value]]
>>> node2/logs/system.log-    Row: id=@ | value=<tombstone>)
>>> --
>>> node3/logs/system.log:WARN  [SharedPool-Worker-2] 2016-12-15
>>> 14:48:36,897 PaxosState.java:124 - Rejecting proposal for
>>> Commit(2d803540-c2cd-11e6-2e48-53a129c60cfc, [MDS.Lock] key=locktest_ 1
>>> columns=[[] | [value]]
>>> node3/logs/system.log-    Row: id=@ | value=<tombstone>) because
>>> inProgress is now Commit(2d8146b0-c2cd-11e6-f996-e5c8d88a1da4,
>>> [MDS.Lock] key=locktest_ 1 columns=[[] | [value]]
>>>
>>>
>>> kind regards,
>>> Christian
>>>
>>>
>>> On Fri, Apr 15, 2016 at 8:27 PM, Denise Rogers <datagwal@aol.com> wrote:
>>>
>>>> My thinking was that due to the size of the data that there maybe I/O
>>>> issues. But it sounds more like you're competing for locks and hit a
>>>> deadlock issue.
>>>>
>>>> Regards,
>>>> Denise
>>>> Cell - (860)989-3431 <(860)%20989-3431>
>>>>
>>>> Sent from mi iPhone
>>>>
>>>> On Apr 15, 2016, at 9:00 AM, horschi <horschi@gmail.com> wrote:
>>>>
>>>> Hi Denise,
>>>>
>>>> in my case its a small blob I am writing (should be around 100 bytes):
>>>>
>>>>      CREATE TABLE "Lock" (
>>>>          lockname varchar,
>>>>          id varchar,
>>>>          value blob,
>>>>          PRIMARY KEY (lockname, id)
>>>>      ) WITH COMPACT STORAGE
>>>>          AND COMPRESSION = { 'sstable_compression' :
>>>> 'SnappyCompressor', 'chunk_length_kb' : '8' };
>>>>
>>>> You ask because large values are known to cause issues? Anything
>>>> special you have in mind?
>>>>
>>>> kind regards,
>>>> Christian
>>>>
>>>>
>>>>
>>>>
>>>> On Fri, Apr 15, 2016 at 2:42 PM, Denise Rogers <datagwal@aol.com>
>>>> wrote:
>>>>
>>>>> Also, what type of data were you reading/writing?
>>>>>
>>>>> Regards,
>>>>> Denise
>>>>>
>>>>> Sent from mi iPad
>>>>>
>>>>> On Apr 15, 2016, at 8:29 AM, horschi <horschi@gmail.com> wrote:
>>>>>
>>>>> Hi Jan,
>>>>>
>>>>> were you able to resolve your Problem?
>>>>>
>>>>> We are trying the same and also see a lot of WriteTimeouts:
>>>>> WriteTimeoutException: Cassandra timeout during write query at
>>>>> consistency SERIAL (2 replica were required but only 1 acknowledged the
>>>>> write)
>>>>>
>>>>> How many clients were competing for a lock in your case? In our case
>>>>> its only two :-(
>>>>>
>>>>> cheers,
>>>>> Christian
>>>>>
>>>>>
>>>>> On Tue, Sep 24, 2013 at 12:18 AM, Robert Coli <rcoli@eventbrite.com>
>>>>> wrote:
>>>>>
>>>>>> On Mon, Sep 16, 2013 at 9:09 AM, Jan Algermissen <
>>>>>> jan.algermissen@nordsc.com> wrote:
>>>>>>
>>>>>>> I am experimenting with C* 2.0 ( and today's java-driver 2.0
>>>>>>> snapshot) for implementing distributed locks.
>>>>>>>
>>>>>>
>>>>>> [ and I'm experiencing the problem described in the subject ... ]
>>>>>>
>>>>>>
>>>>>>> Any idea how to approach this problem?
>>>>>>>
>>>>>>
>>>>>> 1) Upgrade to 2.0.1 release.
>>>>>> 2) Try to reproduce symptoms.
>>>>>> 3) If able to, file a JIRA at https://issues.apache.org/jira
>>>>>> /secure/Dashboard.jspa including repro steps
>>>>>> 4) Reply to this thread with the JIRA ticket URL
>>>>>>
>>>>>> =Rob
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>

--001a11440fea2eb612054466789a
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Oh yes it is, like Couters :-)<div><br></div></div><div cl=
ass=3D"gmail_extra"><br><div class=3D"gmail_quote">On Sat, Dec 24, 2016 at =
4:02 AM, Edward Capriolo <span dir=3D"ltr">&lt;<a href=3D"mailto:edlinuxgur=
u@gmail.com" target=3D"_blank">edlinuxguru@gmail.com</a>&gt;</span> wrote:<=
br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left=
:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Anecdotal CAS works diff=
erently than the typical cassandra workload. If you run a stress instance 3=
 nodes one host, you find that you typically run into CPU issues, but if yo=
u are doing a CAS workload you see things timing out and before you hit 100=
% CPU. It is a strange beast.=C2=A0</div><div class=3D"HOEnZb"><div class=
=3D"h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, D=
ec 23, 2016 at 7:28 AM, horschi <span dir=3D"ltr">&lt;<a href=3D"mailto:hor=
schi@gmail.com" target=3D"_blank">horschi@gmail.com</a>&gt;</span> wrote:<b=
r><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:=
1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Update: I replace all quo=
rum reads on that table with serial reads, and now these errors got less. S=
omehow quorum reads on CAS values cause most of these WTEs.<div><br></div><=
div>Also I found two tickets on that topic:</div><div><a href=3D"https://is=
sues.apache.org/jira/browse/CASSANDRA-9328" target=3D"_blank">https://issue=
s.apache.org/jira<wbr>/browse/CASSANDRA-9328</a><br></div><div><a href=3D"h=
ttps://issues.apache.org/jira/browse/CASSANDRA-8672" target=3D"_blank">http=
s://issues.apache.org/jira<wbr>/browse/CASSANDRA-8672</a><br></div></div><d=
iv class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, Dec 15, 201=
6 at 3:14 PM, horschi <span dir=3D"ltr">&lt;<a href=3D"mailto:horschi@gmail=
.com" target=3D"_blank">horschi@gmail.com</a>&gt;</span> wrote:<br><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc s=
olid;padding-left:1ex"><div dir=3D"ltr">Hi,<div><br></div><div>I would like=
 to warm up this old thread. I did some debugging and found out that the ti=
meouts are coming from StorageProxy.proposePaxos() -=C2=A0callback.isFullyR=
efused() returns false and therefore triggers a WriteTimeout.</div><div><br=
></div><div>Looking at my ccm cluster logs, I can see that two replica node=
s return different results in their=C2=A0ProposeVerbHandler. In my opinion =
the coordinator should not throw a Exception in such a case, but instead re=
try the operation.</div><div><br></div><div>What do the CAS/Paxos experts o=
n this list say to this? Feel free to instruct me to do further tests/code =
changes. I&#39;d be glad to help.</div><div><br></div><div>Log:</div><div><=
br></div><div><div><font face=3D"monospace, monospace">node1/logs/system.lo=
g:WARN =C2=A0[SharedPool-Worker-5] 2016-12-15 14:48:36,896 PaxosState.java:=
124 - Rejecting proposal for Commit(2d803540-c2cd-11e6-2e48<wbr>-53a129c60c=
fc, [MDS.Lock] key=3Dlocktest_ 1 columns=3D[[] | [value]]</font></div><div>=
<font face=3D"monospace, monospace">node1/logs/system.log- =C2=A0 =C2=A0Row=
: id=3D@ | value=3D&lt;tombstone&gt;) because inProgress is now Commit(2d81=
46b0-c2cd-11e6-f996<wbr>-e5c8d88a1da4, [MDS.Lock] key=3Dlocktest_ 1 columns=
=3D[[] | [value]]</font></div><div><font face=3D"monospace, monospace">--</=
font></div><div><font face=3D"monospace, monospace">node1/logs/system.log:E=
RROR [SharedPool-Worker-12] 2016-12-15 14:48:36,980 StorageProxy.java:506 -=
 proposePaxos: Commit(2d803540-c2cd-11e6-2e48<wbr>-53a129c60cfc, [MDS.Lock]=
 key=3Dlocktest_ 1 columns=3D[[] | [value]]</font></div><div><font face=3D"=
monospace, monospace">node1/logs/system.log- =C2=A0 =C2=A0Row: id=3D@ | val=
ue=3D&lt;tombstone&gt;)//1//0</font></div><div><font face=3D"monospace, mon=
ospace">--</font></div><div><font face=3D"monospace, monospace">node2/logs/=
system.log:WARN =C2=A0[SharedPool-Worker-7] 2016-12-15 14:48:36,969 PaxosSt=
ate.java:117 - Accepting proposal: Commit(2d803540-c2cd-11e6-2e48<wbr>-53a1=
29c60cfc, [MDS.Lock] key=3Dlocktest_ 1 columns=3D[[] | [value]]</font></div=
><div><font face=3D"monospace, monospace">node2/logs/system.log- =C2=A0 =C2=
=A0Row: id=3D@ | value=3D&lt;tombstone&gt;)</font></div><div><font face=3D"=
monospace, monospace">--</font></div><div><font face=3D"monospace, monospac=
e">node3/logs/system.log:WARN =C2=A0[SharedPool-Worker-2] 2016-12-15 14:48:=
36,897 PaxosState.java:124 - Rejecting proposal for Commit(2d803540-c2cd-11=
e6-2e48<wbr>-53a129c60cfc, [MDS.Lock] key=3Dlocktest_ 1 columns=3D[[] | [va=
lue]]</font></div><div><font face=3D"monospace, monospace">node3/logs/syste=
m.log- =C2=A0 =C2=A0Row: id=3D@ | value=3D&lt;tombstone&gt;) because inProg=
ress is now Commit(2d8146b0-c2cd-11e6-f996<wbr>-e5c8d88a1da4, [MDS.Lock] ke=
y=3Dlocktest_ 1 columns=3D[[] | [value]]</font></div></div><div><br></div><=
div><br></div><div>kind regards,</div><div>Christian</div><div><br></div></=
div><div class=3D"m_5567128657919620376m_-7142326899991343956HOEnZb"><div c=
lass=3D"m_5567128657919620376m_-7142326899991343956h5"><div class=3D"gmail_=
extra"><br><div class=3D"gmail_quote">On Fri, Apr 15, 2016 at 8:27 PM, Deni=
se Rogers <span dir=3D"ltr">&lt;<a href=3D"mailto:datagwal@aol.com" target=
=3D"_blank">datagwal@aol.com</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex"><div dir=3D"auto"><div>My thinking was that due to the size of th=
e data that there maybe I/O issues. But it sounds more like you&#39;re comp=
eting for locks and hit a deadlock issue.=C2=A0<br><br><div>Regards,</div><=
div>Denise</div><div>Cell - <a href=3D"tel:(860)%20989-3431" value=3D"+1860=
9893431" target=3D"_blank">(860)989-3431</a></div><div><br></div>Sent from =
mi iPhone</div><div><div class=3D"m_5567128657919620376m_-71423268999913439=
56m_-6479368521815025754h5"><div><br>On Apr 15, 2016, at 9:00 AM, horschi &=
lt;<a href=3D"mailto:horschi@gmail.com" target=3D"_blank">horschi@gmail.com=
</a>&gt; wrote:<br><br></div><blockquote type=3D"cite"><div><div dir=3D"ltr=
"><div>Hi Denise,</div><div><br></div>in my case its a small blob I am writ=
ing (should be around 100 bytes):<div><br></div><div><div>=C2=A0 =C2=A0 =C2=
=A0CREATE TABLE &quot;Lock&quot; (</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0lockname varchar,</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0id varc=
har,</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0value blob,</div><div>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0PRIMARY KEY (lockname, id)</div><div>=C2=A0 =
=C2=A0 =C2=A0) WITH COMPACT STORAGE=C2=A0</div><div>=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0AND COMPRESSION =3D { &#39;sstable_compression&#39; : &#39;Sna=
ppyCompressor&#39;, &#39;chunk_length_kb&#39; : &#39;8&#39; };</div></div><=
div><br></div><div>You ask because large values are known to cause issues? =
Anything special you have in mind?</div><div><br></div><div>kind regards,</=
div><div>Christian</div><div><br></div><div><br></div><div><br></div></div>=
<div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, Apr 15, 2=
016 at 2:42 PM, Denise Rogers <span dir=3D"ltr">&lt;<a href=3D"mailto:datag=
wal@aol.com" target=3D"_blank">datagwal@aol.com</a>&gt;</span> wrote:<br><b=
lockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px =
#ccc solid;padding-left:1ex"><div dir=3D"auto"><div>Also, what type of data=
 were you reading/writing?</div><div><br></div><div>Regards,</div><div>Deni=
se<br><br>Sent from mi iPad</div><div><div class=3D"m_5567128657919620376m_=
-7142326899991343956m_-6479368521815025754m_9191894592366620582h5"><div><br=
>On Apr 15, 2016, at 8:29 AM, horschi &lt;<a href=3D"mailto:horschi@gmail.c=
om" target=3D"_blank">horschi@gmail.com</a>&gt; wrote:<br><br></div><blockq=
uote type=3D"cite"><div><div dir=3D"ltr">Hi Jan,<div><br></div><div>were yo=
u able to resolve your Problem?</div><div><br></div><div>We are trying the =
same and also see a lot of WriteTimeouts:</div><div>WriteTimeoutException: =
Cassandra timeout during write query at consistency SERIAL (2 replica were =
required but only 1 acknowledged the write)<br></div><div><br></div><div>Ho=
w many clients were competing for a lock in your case? In our case its only=
 two :-(</div><div><br></div><div>cheers,</div><div>Christian</div><div><br=
></div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On T=
ue, Sep 24, 2013 at 12:18 AM, Robert Coli <span dir=3D"ltr">&lt;<a href=3D"=
mailto:rcoli@eventbrite.com" target=3D"_blank">rcoli@eventbrite.com</a>&gt;=
</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><span>On =
Mon, Sep 16, 2013 at 9:09 AM, Jan Algermissen <span dir=3D"ltr">&lt;<a href=
=3D"mailto:jan.algermissen@nordsc.com" target=3D"_blank">jan.algermissen@no=
rdsc.com</a>&gt;</span> wrote:<br></span><div class=3D"gmail_extra"><div cl=
ass=3D"gmail_quote"><span>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex">I am experimenting with C* 2.0 ( and today&#39;s java-driv=
er 2.0 snapshot) for implementing distributed locks.<br>
</blockquote><div><br></div></span><div>[ and I&#39;m experiencing the prob=
lem described in the subject ... ]</div><span><div>=C2=A0</div><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1=
px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:=
1ex">
Any idea how to approach this problem?<br></blockquote><div><br></div></spa=
n><div>1) Upgrade to 2.0.1 release.</div><div>2) Try to reproduce symptoms.=
</div><div>3) If able to, file a JIRA at <a href=3D"https://issues.apache.o=
rg/jira/secure/Dashboard.jspa" target=3D"_blank">https://issues.apache.org/=
jira<wbr>/secure/Dashboard.jspa</a> including repro steps</div>
<div>4) Reply to this thread with the JIRA ticket URL</div><div><br></div><=
div>=3DRob</div><div><br></div><div>=C2=A0</div></div></div></div>
</blockquote></div><br></div>
</div></blockquote></div></div></div></blockquote></div><br></div>
</div></blockquote></div></div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a11440fea2eb612054466789a--