Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: 
 <CALdd-zhunMZk+DUN=VGm8wC-tD659W7HUG9XcGUYoDtGiDKmsw@mail.gmail.com>
References: 
 <CAKc7CJRdGhHPr81qUSE-BEb-YmRatC++5o0J23WHbWC7DFwzbw@mail.gmail.com>
	<CALdd-zj5VBg=HdnwgsGjG5sqBftEL8-y-1iLf=PoOR_+bPwaSg@mail.gmail.com>
	<CAKc7CJTAF=kg4iN-Uf6NCUm0k2eWQj5usO4YGs4SQ62n-P79WQ@mail.gmail.com>
	<CALdd-zhunMZk+DUN=VGm8wC-tD659W7HUG9XcGUYoDtGiDKmsw@mail.gmail.com>
Date: Wed, 17 Aug 2011 07:01:33 -0500
Message-ID: 
 <CAKc7CJSqX+Af8Hny3mEpKrqBxQU5DezHP45vzm2EYa6nTA7_MA@mail.gmail.com>
Subject: Re: Partitioning, tokens, and sequential keys
From: David McNelis <dmcnelis@agentisenergy.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=bcaec548a4fbe9a71e04aab24180

--bcaec548a4fbe9a71e04aab24180
Content-Type: text/plain; charset=ISO-8859-1

Well, I think what  happened was that we had three tokens generated, 0,
567x, and 1134x... but the way that we read the comments in the yaml file,
we just set the second two nodes with the initial token and left the token
for the seed node  blank.  Then we started the seed node, started the other
 two, and then the seed node took on the 613x token.

One question on nodetool ring, the "owns" refers to how many of the possible
keys each node owns, not the actual node size correct?  So you could
technically have a load of 15gb, 60gb, and 15gb on a three node cluster, but
if you have the tokens set correctly each would own 33.33%.

Thanks.

On Tue, Aug 16, 2011 at 3:33 PM, Jonathan Ellis <jbellis@gmail.com> wrote:

> Yes, that looks about right.
>
> Totally baffled how the wiki script could spit out those tokens for a
> 3-node cluster.
>
> On Tue, Aug 16, 2011 at 2:04 PM, David McNelis
> <dmcnelis@agentisenergy.com> wrote:
> > Currently we have the initial_token for the seed node blank, and then the
> > three tokens we ended  up with are:
> > 56713727820156410577229101238628035242
> > 61396109050359754194262152792166260437
> > 113427455640312821154458202477256070485
> > I would assume that we'd want to take the node that
> > is 61396109050359754194262152792166260437 and move it to 0, yes?
> > In theory that should largely balance out our data... or am I missing
> > something there?
> > On Tue, Aug 16, 2011 at 1:54 PM, Jonathan Ellis <jbellis@gmail.com>
> wrote:
> >>
> >> what tokens did you end up using?
> >>
> >> are you sure it's actually due to different amounts of rows?  have you
> >> run cleanup and compact to make sure it's not unused data / obsolete
> >> replicas taking up the space?
> >>
> >> On Tue, Aug 16, 2011 at 1:41 PM, David McNelis
> >> <dmcnelis@agentisenergy.com> wrote:
> >> > We are currently running a three node cluster where we assigned the
> >> > initial
> >> > tokens using the Python script that is in the Wiki, and we're
> currently
> >> > using the Random Partitioner, RF=1, Cassandra 0.8 from the Riptano RPM
> >> > ....however we're seeing one node taken on over 60% of the data as we
> >> > load
> >> > data.
> >> > Our keys are sequential, and can range from 0 to 2^64, though in
> >> > practice
> >> > we're between 1 and 2,000,000,000, with the current  max around
> 50,000.
> >> >   In
> >> > order to balance out the  load would we be best served changing our
> >> > tokens
> >> > to make the top and bottom 1/3rd of the node go to the previous and
> next
> >> > nodes respectively, then running nodetool move?
> >> > Even if we do that, it would seem that we'd likely continue to run
> into
> >> > this
> >> > sort of issue as  we  add  additionally data... would we be better
> >> > served
> >> > with a different Partitioner strategy?  Or will we need to very
> actively
> >> > manage our tokens to avoid getting into an unbalanced situation?
> >> >
> >> > --
> >> > David McNelis
> >> > Lead Software Engineer
> >> > Agentis Energy
> >> > www.agentisenergy.com
> >> > o: 630.359.6395
> >> > c: 219.384.5143
> >> > A Smart Grid technology company focused on helping consumers of energy
> >> > control an often under-managed resource.
> >> >
> >> >
> >>
> >>
> >>
> >> --
> >> Jonathan Ellis
> >> Project Chair, Apache Cassandra
> >> co-founder of DataStax, the source for professional Cassandra support
> >> http://www.datastax.com
> >
> >
> >
> > --
> > David McNelis
> > Lead Software Engineer
> > Agentis Energy
> > www.agentisenergy.com
> > o: 630.359.6395
> > c: 219.384.5143
> > A Smart Grid technology company focused on helping consumers of energy
> > control an often under-managed resource.
> >
> >
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of DataStax, the source for professional Cassandra support
> http://www.datastax.com
>


-- 
*David McNelis*
Lead Software Engineer
Agentis Energy
www.agentisenergy.com
o: 630.359.6395
c: 219.384.5143

*A Smart Grid technology company focused on helping consumers of energy
control an often under-managed resource.*

--bcaec548a4fbe9a71e04aab24180
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Well, I think what =A0happened was that we had three tokens generated, 0, 5=
67x, and 1134x... but the way that we read the comments in the yaml file, w=
e just set the second two nodes with the initial token and left the token f=
or the seed node =A0blank. =A0Then we started the seed node, started the ot=
her =A0two, and then the seed node took on the 613x token.<div>
<br></div><div>One question on nodetool ring, the &quot;owns&quot; refers t=
o how many of the possible keys each node owns, not the actual node size co=
rrect? =A0So you could technically have a load of 15gb, 60gb, and 15gb on a=
 three node cluster, but if you have the tokens set correctly each would ow=
n 33.33%.</div>
<div><br></div><div>Thanks.<br><br><div class=3D"gmail_quote">On Tue, Aug 1=
6, 2011 at 3:33 PM, Jonathan Ellis <span dir=3D"ltr">&lt;<a href=3D"mailto:=
jbellis@gmail.com">jbellis@gmail.com</a>&gt;</span> wrote:<br><blockquote c=
lass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;=
padding-left:1ex;">
Yes, that looks about right.<br>
<br>
Totally baffled how the wiki script could spit out those tokens for a<br>
3-node cluster.<br>
<br>
On Tue, Aug 16, 2011 at 2:04 PM, David McNelis<br>
<div><div></div><div class=3D"h5">&lt;<a href=3D"mailto:dmcnelis@agentisene=
rgy.com">dmcnelis@agentisenergy.com</a>&gt; wrote:<br>
&gt; Currently we have the initial_token for the seed node blank, and then =
the<br>
&gt; three tokens we ended =A0up with are:<br>
&gt; 56713727820156410577229101238628035242<br>
&gt; 61396109050359754194262152792166260437<br>
&gt; 113427455640312821154458202477256070485<br>
&gt; I would assume that we&#39;d want to take the node that<br>
&gt; is=A061396109050359754194262152792166260437 and move it to 0, yes?<br>
&gt; In theory that should largely balance out our data... or am I missing<=
br>
&gt; something there?<br>
&gt; On Tue, Aug 16, 2011 at 1:54 PM, Jonathan Ellis &lt;<a href=3D"mailto:=
jbellis@gmail.com">jbellis@gmail.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; what tokens did you end up using?<br>
&gt;&gt;<br>
&gt;&gt; are you sure it&#39;s actually due to different amounts of rows? =
=A0have you<br>
&gt;&gt; run cleanup and compact to make sure it&#39;s not unused data / ob=
solete<br>
&gt;&gt; replicas taking up the space?<br>
&gt;&gt;<br>
&gt;&gt; On Tue, Aug 16, 2011 at 1:41 PM, David McNelis<br>
&gt;&gt; &lt;<a href=3D"mailto:dmcnelis@agentisenergy.com">dmcnelis@agentis=
energy.com</a>&gt; wrote:<br>
&gt;&gt; &gt; We are currently running a three node cluster where we assign=
ed the<br>
&gt;&gt; &gt; initial<br>
&gt;&gt; &gt; tokens using the Python script that is in the Wiki, and we=
9;re currently<br>
&gt;&gt; &gt; using the Random Partitioner, RF=3D1, Cassandra 0.8 from the =
Riptano RPM<br>
&gt;&gt; &gt; ....however we&#39;re seeing one node taken on over 60% of th=
e data as we<br>
&gt;&gt; &gt; load<br>
&gt;&gt; &gt; data.<br>
&gt;&gt; &gt; Our keys are sequential, and can range from 0 to 2^64, though=
 in<br>
&gt;&gt; &gt; practice<br>
&gt;&gt; &gt; we&#39;re between 1 and 2,000,000,000, with the current =A0ma=
x around 50,000.<br>
&gt;&gt; &gt; =A0 In<br>
&gt;&gt; &gt; order to balance out the =A0load would we be best served chan=
ging our<br>
&gt;&gt; &gt; tokens<br>
&gt;&gt; &gt; to make the top and bottom 1/3rd of the node go to the previo=
us and next<br>
&gt;&gt; &gt; nodes respectively, then running nodetool move?<br>
&gt;&gt; &gt; Even if we do that, it would seem that we&#39;d likely contin=
ue to run into<br>
&gt;&gt; &gt; this<br>
&gt;&gt; &gt; sort of issue as =A0we =A0add =A0additionally=A0data... would=
 we be better<br>
&gt;&gt; &gt; served<br>
&gt;&gt; &gt; with a different Partitioner strategy? =A0Or will we need to =
very actively<br>
&gt;&gt; &gt; manage our tokens to avoid getting into an unbalanced situati=
on?<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; --<br>
&gt;&gt; &gt; David McNelis<br>
&gt;&gt; &gt; Lead Software Engineer<br>
&gt;&gt; &gt; Agentis Energy<br>
&gt;&gt; &gt; <a href=3D"http://www.agentisenergy.com" target=3D"_blank">ww=
w.agentisenergy.com</a><br>
&gt;&gt; &gt; o: <a href=3D"tel:630.359.6395" value=3D"+16303596395">630.35=
9.6395</a><br>
&gt;&gt; &gt; c: <a href=3D"tel:219.384.5143" value=3D"+12193845143">219.38=
4.5143</a><br>
&gt;&gt; &gt; A Smart Grid technology company focused on helping consumers =
of energy<br>
&gt;&gt; &gt; control an often under-managed resource.<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Jonathan Ellis<br>
&gt;&gt; Project Chair, Apache Cassandra<br>
&gt;&gt; co-founder of DataStax, the source for professional Cassandra supp=
ort<br>
&gt;&gt; <a href=3D"http://www.datastax.com" target=3D"_blank">http://www.d=
atastax.com</a><br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; David McNelis<br>
&gt; Lead Software Engineer<br>
&gt; Agentis Energy<br>
&gt; <a href=3D"http://www.agentisenergy.com" target=3D"_blank">www.agentis=
energy.com</a><br>
&gt; o: <a href=3D"tel:630.359.6395" value=3D"+16303596395">630.359.6395</a=
><br>
&gt; c: <a href=3D"tel:219.384.5143" value=3D"+12193845143">219.384.5143</a=
><br>
&gt; A Smart Grid technology company focused on helping consumers of energy=
<br>
&gt; control an often under-managed resource.<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div>--<br>
<div><div></div><div class=3D"h5">Jonathan Ellis<br>
Project Chair, Apache Cassandra<br>
co-founder of DataStax, the source for professional Cassandra support<br>
<a href=3D"http://www.datastax.com" target=3D"_blank">http://www.datastax.c=
om</a><br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<b>David McNelis</b><div><font size=3D"1" color=3D"#666666">Lead Software E=
ngineer</font></div><div><font size=3D"1" color=3D"#666666">Agentis Energy<=
/font></div>
<div><font size=3D"1" color=3D"#666666"><a href=3D"http://www.agentisenergy=
.com" target=3D"_blank">www.agentisenergy.com</a></font></div><div><span st=
yle=3D"font-size:x-small;color:rgb(102, 102, 102)">o: 630.359.6395</span></=
div><div>
<span style=3D"font-size:x-small;color:rgb(102, 102, 102)">c: 219.384.5143<=
/span></div><div><span style=3D"font-size:x-small;color:rgb(102, 102, 102)"=
><br></span></div><div><span style=3D"font-family:&#39;Helvetica Neue&#39;,=
 Helvetica, Arial, sans-serif;line-height:18px"><font color=3D"#666666" siz=
e=3D"1"><i>A Smart Grid technology company focused on helping consumers of =
energy control an often under-managed resource.</i></font></span></div>
<div><br></div><br>
</div>

--bcaec548a4fbe9a71e04aab24180--