Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of rklaehn@gmail.com designates
 209.85.214.180 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAKkz8Q3S2DzM79_y9SsaFQeS09bn3zJTDiZVjV_K2nwz-=nLAg@mail.gmail.com>
References: 
 <CAGA++nkTyQXKCsBe=Dzjc8Cy4TpWwJc0ARY-0nLC0iqXdR0jfQ@mail.gmail.com>
	<CAKkz8Q2UwYy6AbRjFwoavFLcr5jEHELR-QYcmuXVJ7rG6nxwwQ@mail.gmail.com>
	<CAGA++nmaJJ4QT8ygMxQgALmqiEb_meW_wHKEcELM602Kyjvt6A@mail.gmail.com>
	<CAKkz8Q1UM92Ga0+eNaxV5KzXcS+hcSA87cCGc12LMP=CWCppog@mail.gmail.com>
	<CAGA++nkOcLiOsQg9RBJwVVMuwxULosmPAKKsjPjmtN2kzC0Dqg@mail.gmail.com>
	<CAKkz8Q05DrqVbXNH7Xbm0EtUrmbepvGS0GUyXN=3wZwJNwkawA@mail.gmail.com>
	<CAGA++nmFGhVJ51bR+gsuWBNKdir3BEurSg4p0ef5ixXK2W9pfg@mail.gmail.com>
	<CAKkz8Q3S2DzM79_y9SsaFQeS09bn3zJTDiZVjV_K2nwz-=nLAg@mail.gmail.com>
Date: Mon, 24 Feb 2014 17:51:40 +0100
Message-ID: 
 <CAGA++nknwF9EM+NXPmtEjS5xXRA-8Pf45uh0YJjPOT3_anCgjQ@mail.gmail.com>
Subject: Re: Performance problem with large wide row inserts using CQL
From: =?ISO-8859-1?Q?R=FCdiger_Klaehn?= <rklaehn@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b66f0bd220cef04f329c923

--047d7b66f0bd220cef04f329c923
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

On Mon, Feb 24, 2014 at 11:47 AM, Sylvain Lebresne <sylvain@datastax.com>wr=
ote:

>
>>
>>>
>>>> I still have some questions regarding the mapping. Please bear with me
>>>> if these are stupid questions. I am quite new to Cassandra.
>>>>
>>>> The basic cassandra data model for a keyspace is something like this,
>>>> right?
>>>>
>>>> SortedMap<byte[], SortedMap<byte[], Pair<Long, byte[]>>
>>>>                  ^ row key. determines which server(s) the rest is
>>>> stored on
>>>>                                              ^ column key
>>>>                                                                ^
>>>> timestamp (latest one wins)
>>>>
>>>> ^ value (can be size 0)
>>>>
>>>
>>> It's a reasonable way to think of how things are stored internally, yes=
.
>>> Though as DuyHai mentioned, the first map is really sorting by token an=
d in
>>> general that means you use mostly the sorting of the second map concret=
ely.
>>>
>>>
>> Yes, understood.
>>
>> So the first SortedMap is sorted on some kind of hash of the actual key
>> to make sure the data gets evenly distributed along the nodes? What if m=
y
>> key is already a good hash: is there a way to use an identity function a=
s a
>> hash function (in CQL)?
>>
>
> It's possible, yes. The hash function we're talking about is what
> Cassandra calls "the partitioner". You configure the partitioner in the
> yaml config file and there is one partitioner, ByteOrderedPartitioner, th=
at
> is basically the identify function.
> We however usually discourage user for using it because the partitioner i=
s
> global to a cluster and cannot be changed (you basically pick it at clust=
er
> creation time and are stuck with it until the end of time), and since
> ByteOrderedPartitioner can easily lead to hotspot in the data distributio=
n
> if you're not careful...For those reasons, the default partitioner is als=
o
> much more tested, and I can't remember anyone mentioning the partitioner
> has been a bottleneck.
>
> Thanks for the info. I thought that this might be possible to adjust on a
per-keyspace level.

But if you can only do this globally, then I will leave it alone. Other
than the (probably negibile) performance impact of hashing the hash again,
there is nothing wrong with doing so. Hashing a SHA1-hash will give a good
distribution.

anyway, this is getting a bit off-topic.

cheers,

R=FCdiger

--047d7b66f0bd220cef04f329c923
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">On Mon, Feb 24, 2014 at 11:47 AM, Sylvain Lebresne <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:sylvain@datastax.com" target=3D"_blank">sy=
lvain@datastax.com</a>&gt;</span> wrote:<br><div class=3D"gmail_extra"><div=
 class=3D"gmail_quote">
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra">=
<div class=3D"gmail_quote"><div class=3D""><blockquote class=3D"gmail_quote=
" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color=
:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

<div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"><div=
><div>=A0</div>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"=
gmail_quote">

<div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bo=
rder-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:so=
lid;padding-left:1ex">
<div dir=3D"ltr"><div><br><div>I still have some questions regarding the ma=
pping. Please bear with
 me if these are stupid questions. I am quite new to Cassandra.<br><br>The =
basic cassandra data model for a keyspace is something like this, right?<br=
>
<br>SortedMap&lt;byte[], SortedMap&lt;byte[], Pair&lt;Long, byte[]&gt;&gt;<=
br></div><div>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ^ row key. d=
etermines which server(s) the rest is stored on<br></div><div>=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ^ column key<br>


</div><div>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ^ timestamp (latest one wi=
ns)<br></div><div>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 ^ value (can be size 0)<br></div>


</div></div></blockquote><div><br></div></div><div>It&#39;s a reasonable wa=
y to think of how things are stored internally, yes. Though as DuyHai menti=
oned, the first map is really sorting by token and in general that means yo=
u use mostly the sorting of the second map concretely.</div>


<div>
<div>=A0</div></div></div></div></div></blockquote></div><div>Yes, understo=
od. <br><br>So the first SortedMap is sorted on some kind of hash of the ac=
tual key to make sure the data gets evenly distributed along the nodes? Wha=
t if my key is already a good hash: is there a way to use an identity funct=
ion as a hash function (in CQL)?<br>

</div></div></div></div></blockquote><div><br></div></div><div>It&#39;s pos=
sible, yes. The hash function we&#39;re talking about is what Cassandra cal=
ls &quot;the partitioner&quot;. You configure the partitioner in the yaml c=
onfig file and there is one partitioner,=A0ByteOrderedPartitioner, that is =
basically the identify function.</div>

<div>We however usually discourage user for using it because the partitione=
r is global to a cluster and cannot be changed (you basically pick it at cl=
uster creation time and are stuck with it until the end of time), and since=
 ByteOrderedPartitioner can easily lead to hotspot in the data distribution=
 if you&#39;re not careful...For those reasons, the default partitioner is =
also much more tested, and I can&#39;t remember anyone mentioning the parti=
tioner has been a bottleneck.</div>

<div><br></div></div></div></div></blockquote><div>Thanks for the info. I t=
hought that this might be possible to adjust on a per-keyspace level. <br><=
br>But if you can only do this globally, then I will leave it alone. Other =
than the (probably negibile) performance impact of hashing the hash again, =
there is nothing wrong with doing so. Hashing a SHA1-hash will give a good =
distribution.<br>
</div></div><br></div><div class=3D"gmail_extra">anyway, this is getting a =
bit off-topic.<br><br></div><div class=3D"gmail_extra">cheers,<br><br>R=FCd=
iger<br></div></div>

--047d7b66f0bd220cef04f329c923--