Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of honore.c@gmail.com designates
 209.85.216.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <1348849740.5202.4.camel@tim-desktop>
References: 
 <CAMyZJ7YtxMwyjba6d78hEDHMCw4a4Hkw+aby4w3u_2MRtjq0BQ@mail.gmail.com>
 <1348849740.5202.4.camel@tim-desktop>
From: Clement Honore <honore.c@gmail.com>
Date: Mon, 1 Oct 2012 10:45:58 +0200
Message-ID: 
 <CAMyZJ7YnFKM_1QJF4QCUfiWsH4DKrfBDQ5ohcE9Q1KC41yWfVw@mail.gmail.com>
Subject: Re: Help for creating a custom partitioner
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=20cf30334b436f110e04cafb7091

--20cf30334b436f110e04cafb7091
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi,

thanks for your answer.

We plan to use manual indexing too (with native C* indexing for other
cases).
So, for one index, we will get plenty of FK and a MultiGet call to get all
the associated entities, with RP, would then spread all the cluster.
As we don't know the cluster size yet, and as it's expected to grow at an
unknown rate, we are thinking about alternatives, now, for scalability.

But, to tell the truth, so far, we have not done performance tests.
But as the choice of a partitioner is the first C* cornerstone, we are
already thinking about a new partitioner.
We are planning tests "random vs custom partitioner" =3D> so, my questions
for creating, first, another one.

AFAIS, your partitioner (the higher bits of the hash from hashing the
category, and the lower bits of the hash from hashing the document id) will
put all the docs of a category in (in average) 1 node. Quite interesting,
thanks!
I could add such a partitioner to my test suite.

But, why not just hashing the "category" part of the row key ?
With such partitioner, as said before, many rows on *one* node are going to
have the same hash value.
- if it hurts Cassandra behavior/performance =3D> I am curious to know why.
Anyway, in that case, I see your partitioner, so far, as the best answer to
my wishes!
- if it's NOT hurting Cassandra behavior/performance =3D> it sounds, then, =
an
optimal partitioner for our needs.

Any idea about Cassandra behavior with such hash (category-only)
partitioner ?

Regards,
Cl=E9ment

2012/9/28 Tim Wintle <timwintle@gmail.com>

> On Fri, 2012-09-28 at 18:20 +0200, Clement Honore wrote:
> > Hi,****
> >
> > ** **
> >
> > I have hierarchical data.****
> >
> > I'm storing them in CF with rowkey somewhat like (category, doc id), an=
d
> > plenty of columns for a doc definition.****
> >
> > ** **
> >
> > I have hierarchical data traversal too.****
> >
> > The user just chooses one category, and then, interact with docs
> belonging
> > only to this category.****
> >
> > ** **
> >
> > 1) If I use RandomPartitioner, all docs could be spread within all node=
s
> in
> > the cluster =3D> bad performance.****
> >
> > ** **
> >
> > 2) Using RandomPartitioner, an alternative design could be
> rowkey=3Dcategory
> > and column name=3D(doc id, prop name)****
> >
> > I don't want it because I need fixed column names for indexing purposes=
,
> > and the "category" is quite a lonnnng string.****
> >
> > ** **
> >
> > 3) Then, I want to define a new partitioner for my rowkey (category, do=
c
> > id), doing MD5 only for the "category" part.****
> >
> > ** **
> >
> > The question is : with such partitioner, many rows on *one* node are
> going
> > to have the same MD5 value, as a result of this new partitioner.****
>
> If you do decide writing having rows on the same node is what you want,
> then you could take the higher bits of the hash from hashing the
> category, and the lower bits of the hash from hashing the document id.
>
> That would mean documents in a category would be close to each other in
> the ring - while being unlikely to share the same hash.
>
>
> However, If you're doing this then all reads/writes to the category are
> going to be to a single machine. That's not going to spread the load
> across the cluster very well as I assume a few categories are going to
> be far more popular than others.
>
> Have you tested that you actually get bad performance from
> RandomPartitioner?
>
> Tim
>
>

--20cf30334b436f110e04cafb7091
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div><span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-s=
ize:13px;background-color:rgb(255,255,255)">Hi,</span></div><div><span styl=
e=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px;backgr=
ound-color:rgb(255,255,255)"><br>

</span></div><div><span style=3D"color:rgb(34,34,34);font-family:arial,sans=
-serif;font-size:13px;background-color:rgb(255,255,255)">thanks for your an=
swer.</span></div><span style=3D"color:rgb(34,34,34);font-family:arial,sans=
-serif;font-size:13px;background-color:rgb(255,255,255)"><div>

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)"><br></span></div>We plan to use manu=
al indexing too (with native C* indexing for other cases).</span><br style=
=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px;backgro=
und-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">So, for one index, we will get plent=
y of FK and a MultiGet call to get all the associated entities, with RP, wo=
uld then spread all the cluster.</span><br style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">As we don&#39;t know the cluster siz=
e yet, and as it&#39;s expected to grow at an unknown rate, we are thinking=
 about alternatives, now, for scalability.</span><br style=3D"color:rgb(34,=
34,34);font-family:arial,sans-serif;font-size:13px;background-color:rgb(255=
,255,255)">

<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)"><span style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>But, to tell the truth, so far, we have not done performance tests.</span>=
<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">But as the choice of a partitioner i=
s the first C* cornerstone, we are already thinking about a new partitioner=
.</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-=
size:13px;background-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">We are planning tests &quot;random v=
s custom partitioner&quot; =3D&gt; so, my questions for creating, first, an=
other one.</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-se=
rif;font-size:13px;background-color:rgb(255,255,255)">

<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)"><span style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>AFAIS, your partitioner (the higher bits of the hash from hashing the cate=
gory, and the lower bits of the hash from hashing the document id) will put=
 all the docs of a category in (in average) 1 node. Quite interesting, than=
ks!</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;fon=
t-size:13px;background-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">I could add such a partitioner to my=
 test suite.</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-=
serif;font-size:13px;background-color:rgb(255,255,255)">

<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)"><span style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>But, why not just hashing the &quot;category&quot; part of the row key ?</=
span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-siz=
e:13px;background-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">With such partitioner, as said befor=
e, many rows on *one* node are going to have the same hash value.</span><br=
 style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px;b=
ackground-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">- if it hurts Cassandra behavior/per=
formance =3D&gt; I am curious to know why. Anyway, in that case, I see your=
 partitioner, so far, as the best answer to my wishes!</span><br style=3D"c=
olor:rgb(34,34,34);font-family:arial,sans-serif;font-size:13px;background-c=
olor:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">- if it&#39;s NOT hurting Cassandra =
behavior/performance =3D&gt; it sounds, then, an optimal partitioner for ou=
r needs.</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-seri=
f;font-size:13px;background-color:rgb(255,255,255)">

<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)"><span style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>Any idea about Cassandra behavior with such hash (category-only) partition=
er ?</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;fo=
nt-size:13px;background-color:rgb(255,255,255)">

<br style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:13p=
x;background-color:rgb(255,255,255)"><span style=3D"color:rgb(34,34,34);fon=
t-family:arial,sans-serif;font-size:13px;background-color:rgb(255,255,255)"=
>Regards,</span><br style=3D"color:rgb(34,34,34);font-family:arial,sans-ser=
if;font-size:13px;background-color:rgb(255,255,255)">

<span style=3D"color:rgb(34,34,34);font-family:arial,sans-serif;font-size:1=
3px;background-color:rgb(255,255,255)">Cl=E9ment</span><br><br><div class=
=3D"gmail_quote">2012/9/28 Tim Wintle <span dir=3D"ltr">&lt;<a href=3D"mail=
to:timwintle@gmail.com" target=3D"_blank">timwintle@gmail.com</a>&gt;</span=
><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">On Fri, 2012-09-28 at 18:20 +0200, Clement H=
onore wrote:<br>
&gt; Hi,****<br>
&gt;<br>
&gt; ** **<br>
&gt;<br>
&gt; I have hierarchical data.****<br>
<div class=3D"im">&gt;<br>
&gt; I&#39;m storing them in CF with rowkey somewhat like (category, doc id=
), and<br>
</div>&gt; plenty of columns for a doc definition.****<br>
&gt;<br>
&gt; ** **<br>
&gt;<br>
&gt; I have hierarchical data traversal too.****<br>
<div class=3D"im">&gt;<br>
&gt; The user just chooses one category, and then, interact with docs belon=
ging<br>
</div>&gt; only to this category.****<br>
&gt;<br>
&gt; ** **<br>
<div class=3D"im">&gt;<br>
&gt; 1) If I use RandomPartitioner, all docs could be spread within all nod=
es in<br>
</div>&gt; the cluster =3D&gt; bad performance.****<br>
&gt;<br>
&gt; ** **<br>
<div class=3D"im">&gt;<br>
&gt; 2) Using RandomPartitioner, an alternative design could be rowkey=3Dca=
tegory<br>
</div>&gt; and column name=3D(doc id, prop name)****<br>
<div class=3D"im">&gt;<br>
&gt; I don&#39;t want it because I need fixed column names for indexing pur=
poses,<br>
</div>&gt; and the &quot;category&quot; is quite a lonnnng string.****<br>
&gt;<br>
&gt; ** **<br>
<div class=3D"im">&gt;<br>
&gt; 3) Then, I want to define a new partitioner for my rowkey (category, d=
oc<br>
</div>&gt; id), doing MD5 only for the &quot;category&quot; part.****<br>
&gt;<br>
&gt; ** **<br>
<div class=3D"im">&gt;<br>
&gt; The question is : with such partitioner, many rows on *one* node are g=
oing<br>
</div>&gt; to have the same MD5 value, as a result of this new partitioner.=
****<br>
<br>
If you do decide writing having rows on the same node is what you want,<br>
then you could take the higher bits of the hash from hashing the<br>
category, and the lower bits of the hash from hashing the document id.<br>
<br>
That would mean documents in a category would be close to each other in<br>
the ring - while being unlikely to share the same hash.<br>
<br>
<br>
However, If you&#39;re doing this then all reads/writes to the category are=
<br>
going to be to a single machine. That&#39;s not going to spread the load<br=
>
across the cluster very well as I assume a few categories are going to<br>
be far more popular than others.<br>
<br>
Have you tested that you actually get bad performance from<br>
RandomPartitioner?<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
Tim<br>
<br>
</font></span></blockquote></div><br>

--20cf30334b436f110e04cafb7091--