Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_BBCECFB5-E7D9-4CC0-9EC8-673EEBE97074"
Message-Id: <4080F9F6-14EC-4A3C-BC15-B5C55DA54E84@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Subject: Re: what happen if coordinator node fails during write
Date: Fri, 28 Jun 2013 16:51:57 +1200
References: 
 <CAADOH5aAj_Qu4A=9uA=PhtQfChHBzG0ANgBGuBYwY6F8fe93PA@mail.gmail.com>
 <CAK0tFt7MBiaUbRRJ=J+nprgSPYpz-YEAv2Zng26UkAPi_qcTBw@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAK0tFt7MBiaUbRRJ=J+nprgSPYpz-YEAv2Zng26UkAPi_qcTBw@mail.gmail.com>


--Apple-Mail=_BBCECFB5-E7D9-4CC0-9EC8-673EEBE97074
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

> As far as I know in 1.2 coordinator logs request before it updates =
replicas.
You may be thinking about atomic batches, which are enabled by default =
for 1.2 via CQL but must be supported by Thrift clients. I would guess =
Hector is not using them.=20
These logs are stored on other machines, which then reply the mutation =
if they have not been removed by a certain time.=20

>=20
> I am writing data to Cassandra by thrift client (not hector) and
> wonder what happen if the coordinator node fails.

How and when it fails is important.
But lets say their was an OS level OOM situation and the process was =
killed just after it sent messages to the remote replicas. In that case =
all you know if the request was applied on 0 to RF number of replicas. =
So it's the same as a TimedOutException.=20

The request did not complete at the request CL so reads to that data =
will be working eventual consistency until the next successful write.=20

Cheers


-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 26/06/2013, at 12:45 PM, Andrey Ilinykh <ailinykh@gmail.com> wrote:

> It depends on cassandra version. As far as I know in 1.2 coordinator =
logs request before it updates replicas. If it fails it will replay log =
on startup.
> In 1.1 you may have inconsistant state, because only part of your =
request is propagated to replicas.
>=20
> Thank you,
>   Andrey
>=20
>=20
> On Tue, Jun 25, 2013 at 5:11 PM, Jiaan Zeng <jiaan@bloomreach.com> =
wrote:
> Hi there,
>=20
> I am writing data to Cassandra by thrift client (not hector) and
> wonder what happen if the coordinator node fails. The same question
> applies for bulk loader which uses gossip protocol instead of thrift
> protocol. In my understanding, the HintedHandoff only takes care of
> the replica node fails.
>=20
> Thanks.
>=20
> --
> Regards,
> Jiaan
>=20


--Apple-Mail=_BBCECFB5-E7D9-4CC0-9EC8-673EEBE97074
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><blockquote type=3D"cite"><div dir=3D"ltr">As far as I know in 1.2 =
coordinator logs request before it updates =
replicas.</div></blockquote>You may be thinking about atomic batches, =
which are enabled by default for 1.2 via CQL but must be supported by =
Thrift clients. I would guess Hector is not using them.&nbsp;<div>These =
logs are stored on other machines, which then reply the mutation if they =
have not been removed by a certain =
time.&nbsp;</div><div><br></div><div><blockquote type=3D"cite"><div =
class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; "><br>I am writing data to Cassandra by thrift client (not hector) =
and<br>wonder what happen if the coordinator node =
fails.</blockquote></div></div></blockquote></div><div>How and when it =
fails is important.</div><div>But lets say their was an OS level OOM =
situation and the process was killed just after it sent messages to the =
remote replicas. In that case all you know if the request was applied on =
0 to RF number of replicas. So it's the same as a =
TimedOutException.&nbsp;</div><div><br></div><div>The request did not =
complete at the request CL so reads to that data will be working =
eventual consistency until the next successful =
write.&nbsp;</div><div><br></div><div>Cheers</div><div><br></div><div><br>=
<div apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
border-spacing: 0px; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>
<br><div><div>On 26/06/2013, at 12:45 PM, Andrey Ilinykh &lt;<a =
href=3D"mailto:ailinykh@gmail.com">ailinykh@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div dir=3D"ltr">It depends on cassandra version. As far =
as I know in 1.2 coordinator logs request before it updates replicas. If =
it fails it will replay log on startup.<div style=3D"">In 1.1 you may =
have inconsistant state, because only part of your request =
is&nbsp;propagated&nbsp;to replicas.</div>
<div style=3D""><br></div><div style=3D"">Thank you,</div><div =
style=3D"">&nbsp; Andrey</div></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Tue, Jun 25, =
2013 at 5:11 PM, Jiaan Zeng <span dir=3D"ltr">&lt;<a =
href=3D"mailto:jiaan@bloomreach.com" =
target=3D"_blank">jiaan@bloomreach.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; ">Hi there,<br>
<br>
I am writing data to Cassandra by thrift client (not hector) and<br>
wonder what happen if the coordinator node fails. The same question<br>
applies for bulk loader which uses gossip protocol instead of thrift<br>
protocol. In my understanding, the HintedHandoff only takes care of<br>
the replica node fails.<br>
<br>
Thanks.<br>
<br>
--<br>
Regards,<br>
Jiaan<br>
</blockquote></div><br></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_BBCECFB5-E7D9-4CC0-9EC8-673EEBE97074--