Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy includes SPF record at
 spf.trusted-forwarder.org)
References: 
 <CAAZU44kSAwFQVHedh6hTE8pV19L=4_ROjs1pr9hT2PxZNp=tUg@mail.gmail.com>
 <81A38010-3844-47F7-8DF6-5F069D12AD7C@gmail.com>
 <CAAZU44nYNNXjm+gOa1T-5JGc2m2F7-U5H5QPHhNfqjRnJ-RyOw@mail.gmail.com>
From: Colin Clark <colin@clark.ws>
In-Reply-To: 
 <CAAZU44nYNNXjm+gOa1T-5JGc2m2F7-U5H5QPHhNfqjRnJ-RyOw@mail.gmail.com>
Mime-Version: 1.0 (1.0)
Date: Sat, 7 Jun 2014 12:41:38 -0500
Message-ID: <-6327655872995295763@unknownmsgid>
Subject: Re: Data model for streaming a large table in real time.
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=001a11c16ba880f64304fb427d50

--001a11c16ba880f64304fb427d50
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

It's an anti-pattern and there are better ways to do this.

I have implemented the paging algorithm you've described using wide rows
and bucketing.  This approach is a more efficient utilization of
Cassandra's built in wholesome goodness.

Also, I wouldn't let any number of clients (huge) connect directly the
cluster to do this-put some type of app server in between to handle the
comm's and fan out.  You'll get better utilization of resources and less
overhead in addition to flexibility of which data center you're utilizing
to serve requests.


--
Colin
320-221-9531


On Jun 7, 2014, at 12:28 PM, Kevin Burton <burton@spinn3r.com> wrote:

I just checked the source and in 2.1.0 it's not deprecated.

So it *might* be *being* deprecated but I haven't seen anything stating
that.


On Sat, Jun 7, 2014 at 8:03 AM, Colin <colpclark@gmail.com> wrote:

> I believe Byteorderedpartitioner is being deprecated and for good reason.
>  I would look at what you could achieve by using wide rows and
> murmur3partitioner.
>
>
>
> --
> Colin
> 320-221-9531
>
>
> On Jun 6, 2014, at 5:27 PM, Kevin Burton <burton@spinn3r.com> wrote:
>
> We have the requirement to have clients read from our tables while they'r=
e
> being written.
>
> Basically, any write that we make to cassandra needs to be sent out over
> the Internet to our customers.
>
> We also need them to resume so if they go offline, they can just pick up
> where they left off.
>
> They need to do this in parallel, so if we have 20 cassandra nodes, they
> can have 20 readers each efficiently (and without coordination) reading
> from our tables.
>
> Here's how we're planning on doing it.
>
> We're going to use the ByteOrderedPartitioner .
>
> I'm writing with a primary key of the timestamp, however, in practice,
> this would yield hotspots.
>
> (I'm also aware that time isn't a very good pk in a distribute system as =
I
> can easily have a collision so we're going to use a scheme similar to a
> uuid to make it unique per writer).
>
> One node would take all the load, followed by the next node, etc.
>
> So my plan to stop this is to prefix a slice ID to the timestamp.  This
> way each piece of content has a unique ID, but the prefix will place it o=
n
> a node.
>
> The slide ID is just a byte=E2=80=A6 so this means there are 255 buckets =
in which
> I can place data.
>
> This means I can have clients each start with a slice, and a timestamp,
> and page through the data with tokens.
>
> This way I can have a client reading with 255 threads from 255 regions in
> the cluster, in parallel, without any hot spots.
>
> Thoughts on this strategy?
>
> --
>
> Founder/CEO Spinn3r.com
> Location: *San Francisco, CA*
> Skype: *burtonator*
> blog: http://burtonator.wordpress.com
> =E2=80=A6 or check out my Google+ profile
> <https://plus.google.com/102718274791889610666/posts>
> <http://spinn3r.com>
> War is peace. Freedom is slavery. Ignorance is strength. Corporations are
> people.
>
>


--=20

Founder/CEO Spinn3r.com
Location: *San Francisco, CA*
Skype: *burtonator*
blog: http://burtonator.wordpress.com
=E2=80=A6 or check out my Google+ profile
<https://plus.google.com/102718274791889610666/posts>
<http://spinn3r.com>
War is peace. Freedom is slavery. Ignorance is strength. Corporations are
people.

--001a11c16ba880f64304fb427d50
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<html><head><meta http-equiv=3D"content-type" content=3D"text/html; charset=
=3Dutf-8"></head><body dir=3D"auto"><div>It&#39;s an anti-pattern and there=
 are better ways to do this.</div><div><br></div><div>I have implemented th=
e paging algorithm you&#39;ve described using wide rows and bucketing. =C2=
=A0This approach is a more efficient utilization of Cassandra&#39;s built i=
n wholesome goodness.</div>
<div><br></div><div>Also, I wouldn&#39;t let any number of clients (huge) c=
onnect directly the cluster to do this-put some type of app server in betwe=
en to handle the comm&#39;s and fan out. =C2=A0You&#39;ll get better utiliz=
ation of resources and less overhead in addition to flexibility of which da=
ta center you&#39;re utilizing to serve requests.=C2=A0</div>
<div><br></div><div><br><br>--<div>Colin</div><div>320-221-9531</div><div><=
br></div></div><div><br>On Jun 7, 2014, at 12:28 PM, Kevin Burton &lt;<a hr=
ef=3D"mailto:burton@spinn3r.com">burton@spinn3r.com</a>&gt; wrote:<br><br>
</div><blockquote type=3D"cite"><div><div dir=3D"ltr">I just checked the so=
urce and in 2.1.0 it&#39;s not deprecated. =C2=A0<div><br></div><div>So it =
*might* be *being* deprecated but I haven&#39;t seen anything stating that.=
</div>
</div><div class=3D"gmail_extra"><br>

<br><div class=3D"gmail_quote">On Sat, Jun 7, 2014 at 8:03 AM, Colin <span =
dir=3D"ltr">&lt;<a href=3D"mailto:colpclark@gmail.com" target=3D"_blank">co=
lpclark@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote=
" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"auto"><div>I believe Byteorderedpartitioner is being deprecated=
 and for good reason. =C2=A0I would look at what you could achieve by using=
 wide rows and murmur3partitioner.</div><div><br></div><div><br><br>--<div>=
Colin</div>


<div><a href=3D"tel:320-221-9531" value=3D"+13202219531" target=3D"_blank">=
320-221-9531</a></div><div><br></div></div><div><div class=3D"h5"><div><br>=
On Jun 6, 2014, at 5:27 PM, Kevin Burton &lt;<a href=3D"mailto:burton@spinn=
3r.com" target=3D"_blank">burton@spinn3r.com</a>&gt; wrote:<br>


<br></div><blockquote type=3D"cite"><div><div dir=3D"ltr">We have the requi=
rement to have clients read from our tables while they&#39;re being written=
.<div><br></div><div>Basically, any write that we make to cassandra needs t=
o be sent out over the Internet to our customers.</div>


<div><br><div>We also need them to resume so if they go offline, they can j=
ust pick up where they left off.</div><div><br></div><div>They need to do t=
his in parallel, so if we have 20 cassandra nodes, they can have 20 readers=
 each efficiently (and without coordination) reading from our tables.</div>


<div><br></div><div>Here&#39;s how we&#39;re planning on doing it.</div><di=
v><br></div><div>We&#39;re going to use the ByteOrderedPartitioner .</div><=
div><br></div><div>I&#39;m writing with a primary key of the timestamp, how=
ever, in practice, this would yield hotspots.</div>


<div><br></div><div>(I&#39;m also aware that time isn&#39;t a very good pk =
in a distribute system as I can easily have a collision so we&#39;re going =
to use a scheme similar to a uuid to make it unique per writer).</div>


<div>
<br></div><div>One node would take all the load, followed by the next node,=
 etc.</div><div><br></div><div>So my plan to stop this is to prefix a slice=
 ID to the timestamp. =C2=A0This way each piece of content has a unique ID,=
 but the prefix will place it on a node.</div>


<div><br></div><div>The slide ID is just a byte=E2=80=A6 so this means ther=
e are 255 buckets in which I can place data. =C2=A0</div><div><br></div><di=
v>This means I can have clients each start with a slice, and a timestamp, a=
nd page through the data with tokens.</div>


<div><br></div><div>This way I can have a client reading with 255 threads f=
rom 255 regions in the cluster, in parallel, without any hot spots.</div><d=
iv><br></div><div>Thoughts on this strategy? =C2=A0</div><div><br></div><di=
v>


-- <br><div><div><p style=3D"margin-top:0px;margin-right:0px;margin-bottom:=
12pt;margin-left:0px"></p><div>Founder/CEO=C2=A0<a href=3D"http://Spinn3r.c=
om" target=3D"_blank">Spinn3r.com</a><br></div><div>Location:=C2=A0<b>San F=
rancisco, CA</b><br>


Skype:=C2=A0<b>burtonator</b></div><div><font color=3D"#2c2c2c" face=3D"Hel=
vetica, Arial, sans-serif"><span style=3D"line-height:19px">blog:<b>=C2=A0<=
/b></span></font><a href=3D"http://burtonator.wordpress.com" target=3D"_bla=
nk">http://burtonator.wordpress.com</a></div>


<div>=E2=80=A6 or check out my <a href=3D"https://plus.google.com/102718274=
791889610666/posts" target=3D"_blank">Google+ profile</a></div><div><a href=
=3D"http://spinn3r.com" target=3D"_blank"><img src=3D"http://spinn3r.com/im=
ages/spinn3r.jpg"></a></div>


<div><span style=3D"color:rgb(0,0,0);font-family:verdana,arial,helvetica,sa=
ns-serif;font-size:small;font-style:normal;font-variant:normal;font-weight:=
normal;letter-spacing:normal;line-height:normal;text-align:start;text-inden=
t:0px;text-transform:none;white-space:normal;word-spacing:0px;background-co=
lor:rgb(255,255,255);display:inline!important;float:none">War is peace. Fre=
edom is slavery. Ignorance is strength. Corporations are people.</span></di=
v>


<p></p></div></div>
</div></div></div>
</div></blockquote></div></div></div></blockquote></div><br><br clear=3D"al=
l"><div><br></div>-- <br><div><div><p style=3D"margin-top:0px;margin-right:=
0px;margin-bottom:12pt;margin-left:0px"></p><div>Founder/CEO=C2=A0<a href=
=3D"http://Spinn3r.com" target=3D"_blank">Spinn3r.com</a><br>


</div><div>Location:=C2=A0<b>San Francisco, CA</b><br>Skype:=C2=A0<b>burton=
ator</b></div><div><font color=3D"#2c2c2c" face=3D"Helvetica, Arial, sans-s=
erif"><span style=3D"line-height:19px">blog:<b>=C2=A0</b></span></font><a h=
ref=3D"http://burtonator.wordpress.com" target=3D"_blank">http://burtonator=
.wordpress.com</a></div>


<div>=E2=80=A6 or check out my <a href=3D"https://plus.google.com/102718274=
791889610666/posts" target=3D"_blank">Google+ profile</a></div><div><a href=
=3D"http://spinn3r.com" target=3D"_blank"><img src=3D"http://spinn3r.com/im=
ages/spinn3r.jpg"></a></div>


<div><span style=3D"color:rgb(0,0,0);font-family:verdana,arial,helvetica,sa=
ns-serif;font-size:small;font-style:normal;font-variant:normal;font-weight:=
normal;letter-spacing:normal;line-height:normal;text-align:start;text-inden=
t:0px;text-transform:none;white-space:normal;word-spacing:0px;background-co=
lor:rgb(255,255,255);display:inline!important;float:none">War is peace. Fre=
edom is slavery. Ignorance is strength. Corporations are people.</span></di=
v>


<p></p></div></div>
</div>
</div></blockquote></body></html>

--001a11c16ba880f64304fb427d50--