Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <2e191f541003141529v6a194f3dif5060420178f6ff2@mail.gmail.com>
References: <bad22a01003140946s17bbde2eu642802b1900b4a12@mail.gmail.com>
	 <2e191f541003141529v6a194f3dif5060420178f6ff2@mail.gmail.com>
Date: Sun, 14 Mar 2010 19:37:42 -0400
Message-ID: <fd4250211003141637n595a2b6ap109cacd49d6a9e55@mail.gmail.com>
Subject: Re: serialized vector clock as global counter?
From: Dwight Merriman <dwight@10gen.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=00032555a7123892dd0481cb4001

--00032555a7123892dd0481cb4001
Content-Type: text/plain; charset=ISO-8859-1

yes - take a look at this app engine blog post:

http://googleappengine.blogspot.com/2009/09/migration-to-better-datastore.html

if i read this correctly, app engine data store is pretty much in the
"strongly consistent" camp while cassandra is more eventually consistent --
so really quite different.  you would get higher availability on an EC
system but atomic updates become quite hard (at least when fully
generalized)

On Sun, Mar 14, 2010 at 6:29 PM, Fred Wulff <frew@stanford.edu> wrote:

> Hey Toby,
>
> I'm not an expert on Cassandra's infrastructure, but I believe the
> thing the AppEngine datastore has that Cassandra doesn't is a
> transaction between the read and write of a sharded counter. That
> means that while the read of the various counters may be inconsistent,
> the actual update of the shard is always consistent and the read of
> that shard is always consistent with the previous write.
>
> -Fred
>
> On Sun, Mar 14, 2010 at 9:46 AM, Toby DiPasquale <toby@cbcg.net> wrote:
> > Hi all,
> >
> > I'm trying to write an application using Cassandra which requires the
> > use of a global, monotonically-increasing counter. I've seen the
> > previous threads on this subject which basically say that this can't
> > be done in Cassandra as is, but I think I've come up with a method
> > that might work. I wanted to get the list's feedback on whether or not
> > this method is workable:
> >
> > * Each client maintains its own monotonically-increasing counter as a
> > row in Cassandra
> > * When a client wants to increment the counter, it will:
> >  * increment its own counter key using a quorum write
> >  * read all keys in the CF using a quorum read
> >  * the sum of the values is then the value of the counter
> >
> > This method is robust against nodes coming and going (new nodes just
> > get a new counter and dead nodes never increase their counter again).
> > It also doesn't matter for my application if some possible values for
> > the counter are skipped over, as long as every value is greater than
> > the last. I believe this scheme to be commensurate to a vector clock,
> > no?
> >
> > My question would be: assuming we're using both quorum reads and
> > writes, is it possible that clients A and B could race in the
> > following manner:
> >
> > * A updates its counter
> > * B updates its counter
> > * A reads the keys to get sum X
> > * B reads the keys to get the same sum X
> >
> > ...thus violating the ever-increasing constraint?
> >
> > Google App Engine suggests a similar method for doing global counters
> > on Datastore:
> http://code.google.com/appengine/articles/sharding_counters.html.
> > I'm troubled by their implementation, though, because the reads on the
> > list of counters are not transactional and are potentially subject to
> > the same race that I've described above.
> >
> > Any thoughts/ideas?
> >
> > --
> > Toby DiPasquale
> >
>

--00032555a7123892dd0481cb4001
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

yes - take a look at this app engine blog post:<br><br><a href=3D"http://go=
ogleappengine.blogspot.com/2009/09/migration-to-better-datastore.html">http=
://googleappengine.blogspot.com/2009/09/migration-to-better-datastore.html<=
/a><br>
<br>if i read this correctly, app engine data store is pretty much in the &=
quot;strongly consistent&quot; camp while cassandra is more eventually cons=
istent -- so really quite different.=A0 you would get higher availability o=
n an EC system but atomic updates become quite hard (at least when fully ge=
neralized)<br>
<br><div class=3D"gmail_quote">On Sun, Mar 14, 2010 at 6:29 PM, Fred Wulff =
<span dir=3D"ltr">&lt;<a href=3D"mailto:frew@stanford.edu">frew@stanford.ed=
u</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"borde=
r-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-le=
ft: 1ex;">
Hey Toby,<br>
<br>
I&#39;m not an expert on Cassandra&#39;s infrastructure, but I believe the<=
br>
thing the AppEngine datastore has that Cassandra doesn&#39;t is a<br>
transaction between the read and write of a sharded counter. That<br>
means that while the read of the various counters may be inconsistent,<br>
the actual update of the shard is always consistent and the read of<br>
that shard is always consistent with the previous write.<br>
<font color=3D"#888888"><br>
-Fred<br>
</font><div><div></div><div class=3D"h5"><br>
On Sun, Mar 14, 2010 at 9:46 AM, Toby DiPasquale &lt;<a href=3D"mailto:toby=
@cbcg.net">toby@cbcg.net</a>&gt; wrote:<br>
&gt; Hi all,<br>
&gt;<br>
&gt; I&#39;m trying to write an application using Cassandra which requires =
the<br>
&gt; use of a global, monotonically-increasing counter. I&#39;ve seen the<b=
r>
&gt; previous threads on this subject which basically say that this can&#39=
;t<br>
&gt; be done in Cassandra as is, but I think I&#39;ve come up with a method=
<br>
&gt; that might work. I wanted to get the list&#39;s feedback on whether or=
 not<br>
&gt; this method is workable:<br>
&gt;<br>
&gt; * Each client maintains its own monotonically-increasing counter as a<=
br>
&gt; row in Cassandra<br>
&gt; * When a client wants to increment the counter, it will:<br>
&gt; =A0* increment its own counter key using a quorum write<br>
&gt; =A0* read all keys in the CF using a quorum read<br>
&gt; =A0* the sum of the values is then the value of the counter<br>
&gt;<br>
&gt; This method is robust against nodes coming and going (new nodes just<b=
r>
&gt; get a new counter and dead nodes never increase their counter again).<=
br>
&gt; It also doesn&#39;t matter for my application if some possible values =
for<br>
&gt; the counter are skipped over, as long as every value is greater than<b=
r>
&gt; the last. I believe this scheme to be commensurate to a vector clock,<=
br>
&gt; no?<br>
&gt;<br>
&gt; My question would be: assuming we&#39;re using both quorum reads and<b=
r>
&gt; writes, is it possible that clients A and B could race in the<br>
&gt; following manner:<br>
&gt;<br>
&gt; * A updates its counter<br>
&gt; * B updates its counter<br>
&gt; * A reads the keys to get sum X<br>
&gt; * B reads the keys to get the same sum X<br>
&gt;<br>
&gt; ...thus violating the ever-increasing constraint?<br>
&gt;<br>
&gt; Google App Engine suggests a similar method for doing global counters<=
br>
&gt; on Datastore: <a href=3D"http://code.google.com/appengine/articles/sha=
rding_counters.html" target=3D"_blank">http://code.google.com/appengine/art=
icles/sharding_counters.html</a>.<br>
&gt; I&#39;m troubled by their implementation, though, because the reads on=
 the<br>
&gt; list of counters are not transactional and are potentially subject to<=
br>
&gt; the same race that I&#39;ve described above.<br>
&gt;<br>
&gt; Any thoughts/ideas?<br>
&gt;<br>
&gt; --<br>
&gt; Toby DiPasquale<br>
&gt;<br>
</div></div></blockquote></div><br>

--00032555a7123892dd0481cb4001--