Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_17C72B33-1B80-49ED-8DCF-28A5589545E8"
Message-Id: <2027FF56-EA9B-48B3-BDB7-82DF9857C00E@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: re-execution of failed queries with rpc_timeout
Date: Wed, 17 Apr 2013 09:08:13 +1200
References: 
 <CAEeoTeMsFZtkHPkaafyS1779MnW2+sNsjV+S-TO-2rSZhA3MUw@mail.gmail.com>
 <CAEeoTeNZw6uhUxJJJUJ5J81FrmNJvr0riPmdm8Cik+TsrtP7Rw@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAEeoTeNZw6uhUxJJJUJ5J81FrmNJvr0riPmdm8Cik+TsrtP7Rw@mail.gmail.com>


--Apple-Mail=_17C72B33-1B80-49ED-8DCF-28A5589545E8
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

If you are using Counters you need to do everything you can to avoid =
timeouts. In the worse case we do not know where it has been applied. =
The increment is applied on a lead and then replicated to the others, if =
the coordinator is not  the lead it may not know if the increments was =
applied at all.=20

Start by reducing the size of the updates. Larger batches do not always =
mean better performance.=20

>  In all other cases, the rpc_timeout might be thrown from a remote =
node (not the one I'm connected to), and hence some parts of the update =
will be performed and others parts will not.
TimedOutException is always thrown from the coordinator you are =
connected to.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 15/04/2013, at 1:38 PM, Moty Kosharovsky <motykosh@gmail.com> wrote:

> Sorry, not LOCAL QUORUM, I meant "ANY" quorum.
>=20
>=20
> On Mon, Apr 15, 2013 at 4:12 AM, Moty Kosharovsky <motykosh@gmail.com> =
wrote:
> Hello,
>=20
> I'm running a 12 node cluser with cassandra 1.1.5 and oracle jdk =
1.6.0_35. Our application constantly writes large updates with cql. Once =
in a while, an rpc_time will occur.
>=20
> Since a lot of the information is counters, its impossible for me to =
understand if the updates complete partially on rpc_timeout, or =
cassandra somehow rolls back the change completely, and hence I can't =
tell if I should re-execute the query on rpc_timeout (with double =
processing being a bigger concern than missing updates).
>=20
> I am thinking, but unsure of this, that if I'll switch to =
LOCAL_QUORUM, rpc_timeout will always mean that the update was not =
processes as a whole. In all other cases, the rpc_timeout might be =
thrown from a remote node (not the one I'm connected to), and hence some =
parts of the update will be performed and others parts will not.
>=20
> Anyone solved this issue before?
>=20
> Kind Regards,
> Kosha
>=20


--Apple-Mail=_17C72B33-1B80-49ED-8DCF-28A5589545E8
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">If =
you are using Counters you need to do everything you can to avoid =
timeouts. In the worse case we do not know where it has been applied. =
The increment is applied on a lead and then replicated to the others, if =
the coordinator is not &nbsp;the lead it may not know if the increments =
was applied at all.&nbsp;<div><br></div><div>Start by reducing the size =
of the updates. Larger batches do not always mean better =
performance.&nbsp;</div><div><br></div><div><blockquote type=3D"cite"><div=
 class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; "><div dir=3D"ltr">&nbsp;In all other cases, the rpc_timeout might =
be thrown from a remote node (not the one I'm connected to), and hence =
some parts of the update will be performed and others parts will =
not.</div></blockquote></div></div></blockquote>TimedOutException is =
always thrown from the coordinator you are connected =
to.&nbsp;</div><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 15/04/2013, at 1:38 PM, Moty Kosharovsky &lt;<a =
href=3D"mailto:motykosh@gmail.com">motykosh@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div dir=3D"ltr">Sorry, not LOCAL QUORUM, I meant "ANY" =
quorum.</div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">On Mon, Apr 15, 2013 at 4:12 AM, Moty Kosharovsky =
<span dir=3D"ltr">&lt;<a href=3D"mailto:motykosh@gmail.com" =
target=3D"_blank">motykosh@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; "><div dir=3D"ltr">Hello,<div><br></div><div>I'm running a 12 node =
cluser with cassandra 1.1.5 and oracle jdk 1.6.0_35. Our application =
constantly writes large updates with cql. Once in a while, an rpc_time =
will&nbsp;occur.</div>

<div><br></div><div>Since a lot of the information is counters, its =
impossible for me to understand if the updates complete partially on =
rpc_timeout, or cassandra somehow rolls back the change completely, and =
hence I can't tell if I should re-execute the query on rpc_timeout (with =
double processing being a bigger concern than missing updates).</div>

<div><br></div><div>I am thinking, but unsure of this, that if I'll =
switch to LOCAL_QUORUM, rpc_timeout will always mean that the update was =
not processes as a whole. In all other cases, the rpc_timeout might be =
thrown from a remote node (not the one I'm connected to), and hence some =
parts of the update will be performed and others parts will not.</div>

<div><br></div><div>Anyone solved this issue =
before?</div><div><br></div><div>Kind =
Regards,</div><div>Kosha</div></div>
</blockquote></div><br></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_17C72B33-1B80-49ED-8DCF-28A5589545E8--