Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of nick.telford@tweetmeme.com
 designates 209.85.215.172 as permitted sender)
MIME-Version: 1.0
Sender: nick.telford@tweetmeme.com
In-Reply-To: <AANLkTinA=OHCaLU8VbXMxt7PzjJjGmsNd6FBhDtyaSTq@mail.gmail.com>
References: <AANLkTimb52PdRfJCJtyFq0ow2g8OM0VOj8=KL3GcbMaE@mail.gmail.com>
	<AANLkTimZEp_sJQmk6DdQCJ6Y8YAgsi7EXkVNk15w3Aoq@mail.gmail.com>
	<AANLkTinA=OHCaLU8VbXMxt7PzjJjGmsNd6FBhDtyaSTq@mail.gmail.com>
Date: Mon, 22 Nov 2010 13:03:31 +0000
Message-ID: <AANLkTikbYbxwV7YQBgY07hWg-mtoAYs92cNSo1Z4zWbc@mail.gmail.com>
Subject: Re: Facebook messaging and choice of HBase over Cassandra - what can
 we learn?
From: Nick Telford <nick.telford@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016e6db2f2d05dc100495a3e239

--0016e6db2f2d05dc100495a3e239
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Provided at least one node receives the write, it will eventually be writte=
n
to all replicas. A failure to meet the requested ConsistencyLevel is just
that; not a failure to write the data itself. Once the write is received by
a node, it will eventually reach all replicas, there is no roll back.

This is the source of a fair bit of confusion, as most people are used to
the binary behaviour of "success or failure". It's important that clients
are able to distinguish between a failure for a write to reach the cluster
and a failure to meet the requested ConsistencyLevel in order to provide
Durability guarantees for application data.

On 22 November 2010 12:31, David Boxenhorn <david@lookin2.com> wrote:

> Yes, but the value is supposed to be 11, since the write failed.
>
> On Mon, Nov 22, 2010 at 2:27 PM, Andr=E9 Fiedler <
> fiedler.andre@googlemail.com> wrote:
>
>> Doesn=B4t sync Cassandra all nodes if the network is up again? I think t=
his
>> was one of the reasons, storing a timestamp at every key/value pair?
>> So i think the response will only temporary be 11. If all nodes have syn=
ct
>> it should be 12? Or isn=B4t that so?
>>
>> greetings Andr=E9
>>
>> 2010/11/22 Samuel Carri=E8re <samuel.carriere@gmail.com>
>>
>> >Cassandra can work in a consistent way, see some of this discussion and
>>> the Consistency section here
>>> http://wiki.apache.org/cassandra/ArchitectureOverview
>>> >
>>> >If you always read and write with CL.Quorum (or the other way discusse=
d)
>>> you will have consistency. Even if some of the replicas are temporarily
>>> inconsistent, or off line or whatever. Your reads will >be consistent, =
i.e.
>>> every client will get the same value or the read will not work. If you =
want
>>> to work at a lower or higher consistency you can.
>>> >
>>> >Eventually all replicas of a value will become consistent.
>>> >
>>> >There are a number of reasons why cassandra may not be a good fit, and=
 I
>>> would guess something else would be a problem before the consistency mo=
del.
>>> >
>>> >Hope that helps.
>>> >Aaron
>>>
>>> Hello,
>>>
>>> I like cassandra a lot and I'm sure it can be used in many use cases,
>>> but I'm not sure we can say that we have strong consistency,
>>> even if we read and write with CL.Quorum.
>>>
>>> Firstly, we can only expect consistency at the column level. Reading
>>> and writing with CL.Quorum gives you most of the time
>>> a consistent value for each individual column, but it does not mean if
>>> gives you a consistent view of your data.
>>> (Because cassandra gives you no isolation and no transactions, your
>>> application has to deal with data inconsistencies).
>>>
>>> Secondly, I may be wrong, but I'm not sure consistency at the column
>>> level is guaranteed. Here is an example, with a replication
>>> factor of 3.
>>> Imagine that the current value of col1 is 11. Your application tries
>>> to write "col1 =3D 12" with CL.Quorum.
>>> Imagine the write arrives to node 1, but that the new value is not
>>> transmitted to nodes 2 and 3 because of network failures. So
>>> the write fails (this is the expected behaviour), but node 1 still has
>>> the new value (there is no rollback).
>>>
>>> Then, imagine that the network is back to normal, and that another
>>> client asked for the value of col1, with CL.Quorum. Here,
>>> the value of the response is not guaranteed. If the client asks for
>>> the value to node 2 and node 3, the response will be 11, but
>>> if he asks to node 1 and node 2 or 3, the response will be 12.
>>>
>>> Am I missing something ?
>>>
>>> Samuel
>>>
>>
>>
>

--0016e6db2f2d05dc100495a3e239
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Provided at least one node receives the write, it will eventually be writte=
n to all replicas. A failure to meet the requested ConsistencyLevel is just=
 that; not a failure to write the data itself. Once the write is received b=
y a node, it will eventually reach all replicas, there is no roll back.<div=
>
<br></div><div>This is the source of a fair bit of confusion, as most peopl=
e are used to the binary behaviour of &quot;success or failure&quot;. It=
9;s important that clients are able to distinguish between a failure for a =
write to reach the cluster and a failure to meet the requested ConsistencyL=
evel in order to provide Durability guarantees for application data.</div>
<div><br><div class=3D"gmail_quote">On 22 November 2010 12:31, David Boxenh=
orn <span dir=3D"ltr">&lt;<a href=3D"mailto:david@lookin2.com">david@lookin=
2.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"m=
argin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div dir=3D"ltr">Yes, but the value is supposed to be 11, since the write f=
ailed. <br><div><div></div><div class=3D"h5"><br><div class=3D"gmail_quote"=
>On Mon, Nov 22, 2010 at 2:27 PM, Andr=E9 Fiedler <span dir=3D"ltr">&lt;<a =
href=3D"mailto:fiedler.andre@googlemail.com" target=3D"_blank">fiedler.andr=
e@googlemail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex;border-=
left:1px solid rgb(204, 204, 204);padding-left:1ex">Doesn=B4t sync Cassandr=
a all nodes if the network is up again? I think this was one of the reasons=
, storing a timestamp at every key/value pair?<div>

So i think the response will only temporary be 11. If all nodes have synct =
it should be 12? Or isn=B4t that so?</div>

<div><br></div><div>greetings Andr=E9<br><br><div class=3D"gmail_quote">201=
0/11/22 Samuel Carri=E8re <span dir=3D"ltr">&lt;<a href=3D"mailto:samuel.ca=
rriere@gmail.com" target=3D"_blank">samuel.carriere@gmail.com</a>&gt;</span=
><div><div>

</div><div><br><blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0p=
t 0.8ex;border-left:1px solid rgb(204, 204, 204);padding-left:1ex">

<div>&gt;Cassandra can work in a consistent way, see some of this discussio=
n and the Consistency section here <a href=3D"http://wiki.apache.org/cassan=
dra/ArchitectureOverview" target=3D"_blank">http://wiki.apache.org/cassandr=
a/ArchitectureOverview</a><br>


&gt;<br>
&gt;If you always read and write with CL.Quorum (or the other way discussed=
) you will have consistency. Even if some of the replicas are temporarily i=
nconsistent, or off line or whatever. Your reads will &gt;be consistent, i.=
e. every client will get the same value or the read will not work. If you w=
ant to work at a lower or higher consistency you can.<br>


&gt;<br>
&gt;Eventually all replicas of a value will become consistent.<br>
&gt;<br>
&gt;There are a number of reasons why cassandra may not be a good fit, and =
I would guess something else would be a problem before the consistency mode=
l.<br>
&gt;<br>
&gt;Hope that helps.<br>
&gt;Aaron<br>
<br>
</div>Hello,<br>
<br>
I like cassandra a lot and I&#39;m sure it can be used in many use cases,<b=
r>
but I&#39;m not sure we can say that we have strong consistency,<br>
even if we read and write with CL.Quorum.<br>
<br>
Firstly, we can only expect consistency at the column level. Reading<br>
and writing with CL.Quorum gives you most of the time<br>
a consistent value for each individual column, but it does not mean if<br>
gives you a consistent view of your data.<br>
(Because cassandra gives you no isolation and no transactions, your<br>
application has to deal with data inconsistencies).<br>
<br>
Secondly, I may be wrong, but I&#39;m not sure consistency at the column<br=
>
level is guaranteed. Here is an example, with a replication<br>
factor of 3.<br>
Imagine that the current value of col1 is 11. Your application tries<br>
to write &quot;col1 =3D 12&quot; with CL.Quorum.<br>
Imagine the write arrives to node 1, but that the new value is not<br>
transmitted to nodes 2 and 3 because of network failures. So<br>
the write fails (this is the expected behaviour), but node 1 still has<br>
the new value (there is no rollback).<br>
<br>
Then, imagine that the network is back to normal, and that another<br>
client asked for the value of col1, with CL.Quorum. Here,<br>
the value of the response is not guaranteed. If the client asks for<br>
the value to node 2 and node 3, the response will be 11, but<br>
if he asks to node 1 and node 2 or 3, the response will be 12.<br>
<br>
Am I missing something ?<br>
<font color=3D"#888888"><br>
Samuel<br>
</font></blockquote></div></div></div><br></div>
</blockquote></div><br></div></div></div>
</blockquote></div><br></div>

--0016e6db2f2d05dc100495a3e239--