Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <AANLkTimCCKsSeiF+j95X3Nn=tYz2C4yPx2BK2PH65DEy@mail.gmail.com>
References: <AANLkTikaMAibtgg8MSCReuGWn8KD4jj31QO5mwPZh0sC@mail.gmail.com>
	<AANLkTi=87LDxOpt-h2WMnUxXbsXis9tj53jMkoe2FRxw@mail.gmail.com>
	<AANLkTimyyoqdj1+Sp1AMgLimx=+d_Z0r61Hr4mQf9qRP@mail.gmail.com>
	<AANLkTimCCKsSeiF+j95X3Nn=tYz2C4yPx2BK2PH65DEy@mail.gmail.com>
Date: Fri, 15 Oct 2010 18:56:22 -0500
Message-ID: <AANLkTi=G6mXhmxms7R6MvzFmkwfehvVSaPbapYFYEOyc@mail.gmail.com>
Subject: Re: Recommended sort mechanism and partitioner
From: Tyler Hobbs <tyler@riptano.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016e65aeb0adb92ad0492b092d3

--0016e65aeb0adb92ad0492b092d3
Content-Type: text/plain; charset=ISO-8859-1

i) Yes

ii) Well, so you don't actually want to use version 1 UUIDs for keys here.
Although
they mostly increase in byte order over time, it's only for the first 8
bytes.  Instead,
you can use something like:

'timestamp-foo'

Where 'foo' might be a randomly generated string or something unique per
client.

You could also use 'YYYYMMDDSSmmm' instead of the timestamp if that makes
queries easier for you.

- Tyler

On Fri, Oct 15, 2010 at 6:22 PM, Wicked J <wickedj2010@gmail.com> wrote:

> Tyler,
> Thanks for answering my question. Can you please clarify on point (c)?
>
> i] Are you saying that if I move to second row (identified by a rowKey in
> Cassandra) after I hit 10 million  col. values for 1st row, only then the
> second row will be written to a new node in the cluster?  meaning all the 10
> million column values within the first row (rowKey) until then have been
> written to one and the same node regardless of the # of nodes in the
> cluster.
>
> ii] Assume I change my data model to the one in below (CF1) with a
> "OrderPreservingPartitioner" then would I be able to read data in the order
> inserted? Because my understanding is TimeUUID values cannot be inserted for
> row Keys based on the Thrift API in v0.6.4 i.e. from the insert method in
> Cassandra.Client or am I missing something?
>
> CF1:
>
> Key: '1'
>   name: colname, value: 'First Inserted', timestamp: 1287165326492
> Key: '2'
>   name: colname, value: 'Second Inserted', timestamp: 1287165326523
>
> Thanks!
>
>
> On Fri, Oct 15, 2010 at 12:18 PM, Tyler Hobbs <tyler@riptano.com> wrote:
>
>> a) 10 mil sounds fine.  Just watch out for compaction. Huge rows can kill
>> you there,
>> from my understanding.
>>
>> b) Use RandomPartitioner unless you absolutely have to use something else.
>>
>> c) If you're inserting all along one row and only moving to another row
>> when you
>> hit 10 mil, you're only going to be writing to one node at a time.  In
>> this sense,
>> you might want to consider using the TimeUUID as a row key instead.
>> There's
>> not really a problem with having tons of rows in a column family.
>>
>> If you want to be able to get a slice of time with this scheme, you can
>> either use
>> an order preserving partitioner or have a second column family with an
>> index
>> row (or rows) sorted by TimeUUID. (This sounds like what you're
>> suggesting.)
>>
>> - Tyler
>>
>>
>> I wrote some thoughts about this on my blog. I think it's still mostly
>>> correct:
>>>
>>>  * http://www.ayogo.com/techblog/2010/04/sorting-in-cassandra/
>>>
>>> On Fri, Oct 15, 2010 at 11:14 AM, Wicked J <wickedj2010@gmail.com>
>>> wrote:
>>> > Hi,
>>> > I'm using TimeUUID/Sort by column name mechanism. The column value can
>>> > contain text data (in future they may contain image data as well)
>>> leading to
>>> > the possibility of a row out-growing the RAM capacity. Given this
>>> background
>>> > my questions are:
>>> >
>>> > a] How many columns are recommended against one row? Based on my app.
>>> needs,
>>> > I can imagine having 10 million would be a good starting point for the
>>> > max_limit (based on text data). Also note that my app. will use search
>>> in
>>> > ranges of 100 or 200 columns when there are large number of
>>> records(columnar
>>> > data) without a caching solution in the front.
>>> > b] What partitioner is recommended? so that the load in the cluster
>>> nodes is
>>> > not largely uneven.
>>> > c] Would you recommend changing the TimeUUID/Columnar sort mechanism
>>> (with a
>>> > change in the data model) to sort using row key mechanism? If so then
>>> what
>>> > partitioner is recommended?  with load not being largely uneven.
>>> >
>>> > Thanks
>>> >
>>>
>>
>>
>

--0016e65aeb0adb92ad0492b092d3
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

i) Yes<br><br>ii) Well, so you don&#39;t actually want to use version 1 UUI=
Ds for keys here. Although<br>they mostly increase in byte order over time,=
 it&#39;s only for the first 8 bytes.=A0 Instead,<br>you can use something =
like:<br>
<br>&#39;timestamp-foo&#39;<br><br>Where &#39;foo&#39; might be a randomly =
generated string or something unique per client.<br><br>You could also use =
&#39;YYYYMMDDSSmmm&#39; instead of the timestamp if that makes<br>queries e=
asier for you.<br>
<br>- Tyler<br><br><div class=3D"gmail_quote">On Fri, Oct 15, 2010 at 6:22 =
PM, Wicked J <span dir=3D"ltr">&lt;<a href=3D"mailto:wickedj2010@gmail.com"=
>wickedj2010@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, =
204, 204); padding-left: 1ex;">
Tyler,<br>Thanks for answering my question. Can you please clarify on point=
 (c)?<br><br>i] Are you saying that if I move to second row (identified by =
a rowKey in Cassandra) after I hit 10 million=A0 col. values for 1st row, o=
nly then the second row will be written to a new node in the cluster?=A0 me=
aning all the 10 million column values within the first row (rowKey) until =
then have been written to one and the same node regardless of the # of node=
s in the cluster. <br>

<br>ii] Assume I change my data model to the one in below (CF1) with a &quo=
t;OrderPreservingPartitioner&quot; then would I be able to read data in the=
 order inserted? Because my understanding is TimeUUID values cannot be inse=
rted for row Keys based on the Thrift API in v0.6.4 i.e. from the insert me=
thod in Cassandra.Client or am I missing something?<br>

<br>CF1:<br><br>Key: &#39;1&#39;<br>=A0 name: colname, value: &#39;First In=
serted&#39;, timestamp: 1287165326492<br>Key: &#39;2&#39;<br>=A0 name: coln=
ame, value: &#39;Second Inserted&#39;, timestamp: 1287165326523<br><br>Than=
ks!<div>
<div></div><div class=3D"h5"><br>
<br><div class=3D"gmail_quote">On Fri, Oct 15, 2010 at 12:18 PM, Tyler Hobb=
s <span dir=3D"ltr">&lt;<a href=3D"mailto:tyler@riptano.com" target=3D"_bla=
nk">tyler@riptano.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_q=
uote" style=3D"margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 2=
04, 204); padding-left: 1ex;">


a) 10 mil sounds fine.=A0 Just watch out for compaction. Huge rows can kill=
 you there,<br>from my understanding.<br><br>b) Use RandomPartitioner unles=
s you absolutely have to use something else.<br><br>c) If you&#39;re insert=
ing all along one row and only moving to another row when you<br>


hit 10 mil, you&#39;re only going to be writing to one node at a time.=A0 I=
n this sense,<br>you might want to consider using the TimeUUID as a row key=
 instead.=A0 There&#39;s<br>not really a problem with having tons of rows i=
n a column family.<br>


<br>If you want to be able to get a slice of time with this scheme, you can=
 either use<br>an order preserving partitioner or have a second column fami=
ly with an index<br>row (or rows) sorted by TimeUUID. (This sounds like wha=
t you&#39;re suggesting.)<br>


<font color=3D"#888888">
<br>- Tyler</font><div><div></div><div><br><div class=3D"gmail_quote"><br><=
blockquote class=3D"gmail_quote" style=3D"margin: 0pt 0pt 0pt 0.8ex; border=
-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">I wrote some thoug=
hts about this on my blog. I think it&#39;s still mostly correct:<br>


<br>
=A0* <a href=3D"http://www.ayogo.com/techblog/2010/04/sorting-in-cassandra/=
" target=3D"_blank">http://www.ayogo.com/techblog/2010/04/sorting-in-cassan=
dra/</a><br>
<div><div></div><div><br>
On Fri, Oct 15, 2010 at 11:14 AM, Wicked J &lt;<a href=3D"mailto:wickedj201=
0@gmail.com" target=3D"_blank">wickedj2010@gmail.com</a>&gt; wrote:<br>
&gt; Hi,<br>
&gt; I&#39;m using TimeUUID/Sort by column name mechanism. The column value=
 can<br>
&gt; contain text data (in future they may contain image data as well) lead=
ing to<br>
&gt; the possibility of a row out-growing the RAM capacity. Given this back=
ground<br>
&gt; my questions are:<br>
&gt;<br>
&gt; a] How many columns are recommended against one row? Based on my app. =
needs,<br>
&gt; I can imagine having 10 million would be a good starting point for the=
<br>
&gt; max_limit (based on text data). Also note that my app. will use search=
 in<br>
&gt; ranges of 100 or 200 columns when there are large number of records(co=
lumnar<br>
&gt; data) without a caching solution in the front.<br>
&gt; b] What partitioner is recommended? so that the load in the cluster no=
des is<br>
&gt; not largely uneven.<br>
&gt; c] Would you recommend changing the TimeUUID/Columnar sort mechanism (=
with a<br>
&gt; change in the data model) to sort using row key mechanism? If so then =
what<br>
&gt; partitioner is recommended?=A0 with load not being largely uneven.<br>
&gt;<br>
&gt; Thanks<br>
&gt;<br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>
</div></div></blockquote></div><br>

--0016e65aeb0adb92ad0492b092d3--