Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of tyler@datastax.com designates
 209.85.212.44 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <504D9502.1040100@gmail.com>
References: 
 <CAAjbL_=Rq_Kk0n9h71DU+rPWCCz2=z9-sg2qwPCPP9ipf=GVuw@mail.gmail.com>
	<CAAam9sv=8e3TmD=FrBFmEksWguN+rU0mZL5v4qkJwT5R5cPsww@mail.gmail.com>
	<CAAjbL_nFDk8CZTO_6Zo=Ge_M=S6dc0qEkh7XtMOVt1rxda0ugQ@mail.gmail.com>
	<CAAam9ssY+dyyjMJo6oKeiS7S8jcyS3X3PrqOoMAGQQrfG+hm=w@mail.gmail.com>
	<504D9502.1040100@gmail.com>
Date: Mon, 10 Sep 2012 13:48:45 -0500
Message-ID: 
 <CAAam9ssma3nWJqEwvPNeuHH7JjWwqf2MyNzNgxcv1EySgyTf-A@mail.gmail.com>
Subject: Re: new "nodetool ring" output and unbalanced ring?
From: Tyler Hobbs <tyler@datastax.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d043890954b05d804c95d6857

--f46d043890954b05d804c95d6857
Content-Type: text/plain; charset=ISO-8859-1

It leaves some breathing room for fixing mistakes, adding DCs, etc.  The
set of data in a 100 token range is basically the same as a 1 token range:
nothing, statistically speaking.

On Mon, Sep 10, 2012 at 2:21 AM, Guy Incognito <dnd1066@gmail.com> wrote:

>  out of interest, why -100 and not -1 or + 1?  any particular reason?
>
>
> On 06/09/2012 19:17, Tyler Hobbs wrote:
>
> To minimize the impact on the cluster, I would bootstrap a new 1d node at
> (42535295865117307932921825928971026432 - 100), then decommission the 1c
> node at 42535295865117307932921825928971026432 and run cleanup on your
> us-east nodes.
>
> On Thu, Sep 6, 2012 at 1:11 PM, William Oberman <oberman@civicscience.com>wrote:
>
>> Didn't notice the racks!  Of course....
>>
>>  If I change a 1c to a 1d, what would I have to do to make sure data
>> shuffles around correctly?  Repair everywhere?
>>
>>  will
>>
>> On Thu, Sep 6, 2012 at 2:09 PM, Tyler Hobbs <tyler@datastax.com> wrote:
>>
>>> The main issue is that one of your us-east nodes is in rack 1d, while
>>> the restart are in rack 1c.  With NTS and multiple racks, Cassandra will
>>> try use one node from each rack as a replica for a range until it either
>>> meets the RF for the DC, or runs out of racks, in which case it just picks
>>> nodes sequentially going clockwise around the ring (starting from the range
>>> being considered, not the last node that was chosen as a replica).
>>>
>>> To fix this, you'll either need to make the 1d node a 1c node, or make
>>> 42535295865117307932921825928971026432 a 1d node so that you're alternating
>>> racks within that DC.
>>>
>>>
>>> On Thu, Sep 6, 2012 at 12:54 PM, William Oberman <
>>> oberman@civicscience.com> wrote:
>>>
>>>> Hi,
>>>>
>>>>  I recently upgraded from 0.8.x to 1.1.x (through 1.0 briefly) and
>>>> nodetool -ring seems to have changed from "owns" to "effectively owns".
>>>>  "Effectively owns" seems to account for replication factor (RF).  I'm ok
>>>> with all of this, yet I still can't figure out what's up with my cluster.
>>>>  I have a NetworkTopologyStrategy with two data centers (DCs) with
>>>> RF/number nodes in DC combinations of:
>>>> DC Name, RF, # in DC
>>>> analytics, 1, 2
>>>> us-east, 3, 4
>>>> So I'd expect 50% on each analytics node, and 75% for each us-east
>>>> node.  Instead, I have two nodes in us-east with 50/100??? (the other two
>>>> are 75/75 as expected).
>>>>
>>>>  Here is the output of nodetool (all nodes report the same thing):
>>>>  Address         DC          Rack        Status State   Load
>>>>  Effective-Ownership Token
>>>>
>>>>                    127605887595351923798765477786913079296
>>>> x.x.x.x   us-east     1c          Up     Normal  94.57 GB        75.00%
>>>>              0
>>>> x.x.x.x   analytics   1c          Up     Normal  60.64 GB        50.00%
>>>>              1
>>>> x.x.x.x   us-east     1c          Up     Normal  131.76 GB       75.00%
>>>>              42535295865117307932921825928971026432
>>>> x.x.x.x    us-east     1c          Up     Normal  43.45 GB
>>>>  50.00%              85070591730234615865843651857942052864
>>>> x.x.x.x    analytics   1d          Up     Normal  60.88 GB
>>>>  50.00%              85070591730234615865843651857942052865
>>>> x.x.x.x   us-east     1d          Up     Normal  98.56 GB
>>>>  100.00%             127605887595351923798765477786913079296
>>>>
>>>>  If I use cassandra-cli to do "show keyspaces;" I get (and again, all
>>>> nodes report the same thing):
>>>>  Keyspace: civicscience:
>>>>   Replication Strategy:
>>>> org.apache.cassandra.locator.NetworkTopologyStrategy
>>>>   Durable Writes: true
>>>>     Options: [analytics:1, us-east:3]
>>>>  I removed the output about all of my column families (CFs), hopefully
>>>> that doesn't matter.
>>>>
>>>>  Did I compute the tokens wrong?  Is there a combination of nodetool
>>>> commands I can run to migrate the data around to rebalance to 75/75/75/75?
>>>>  I routinely run repair already.  And as the release notes required, I ran
>>>> upgradesstables during the upgrade process.
>>>>
>>>>  Before the upgrade, I was getting analytics = 0%, and us-east = 25%
>>>> on each node, which I expected for "owns".
>>>>
>>>>  will
>>>>
>>>>
>>>
>>>
>>>  --
>>> Tyler Hobbs
>>> DataStax <http://datastax.com/>
>>>
>>>
>>
>>
>>   --
>> Will Oberman
>> Civic Science, Inc.
>> 3030 Penn Avenue., First Floor
>> Pittsburgh, PA 15201
>> (M) 412-480-7835
>> (E) oberman@civicscience.com
>>
>
>
>
> --
> Tyler Hobbs
> DataStax <http://datastax.com/>
>
>
>


-- 
Tyler Hobbs
DataStax <http://datastax.com/>

--f46d043890954b05d804c95d6857
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

It leaves some breathing room for fixing mistakes, adding DCs, etc.=A0 The =
set of data in a 100 token range is basically the same as a 1 token range: =
nothing, statistically speaking.<br><br><div class=3D"gmail_quote">On Mon, =
Sep 10, 2012 at 2:21 AM, Guy Incognito <span dir=3D"ltr">&lt;<a href=3D"mai=
lto:dnd1066@gmail.com" target=3D"_blank">dnd1066@gmail.com</a>&gt;</span> w=
rote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    <div>out of interest, why -100 and not -1 or
      + 1?=A0 any particular reason?<div><div class=3D"h5"><br>
      <br>
      On 06/09/2012 19:17, Tyler Hobbs wrote:<br>
    </div></div></div><div><div class=3D"h5">
    <blockquote type=3D"cite">To minimize the impact on the cluster, I woul=
d
      bootstrap a new 1d node at (42535295865117307932921825928971026432
      - 100), then decommission the 1c node at
      42535295865117307932921825928971026432 and run cleanup on your
      us-east nodes.<br>
      <br>
      <div class=3D"gmail_quote">On Thu, Sep 6, 2012 at 1:11 PM, William
        Oberman <span dir=3D"ltr">&lt;<a href=3D"mailto:oberman@civicscienc=
e.com" target=3D"_blank">oberman@civicscience.com</a>&gt;</span>
        wrote:<br>
        <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex">
          Didn&#39;t notice the racks! =A0Of course....
          <div><br>
          </div>
          <div>If I change a 1c to a 1d, what would I have to do to make
            sure data shuffles around correctly? =A0Repair everywhere?</div=
>
          <div><br>
          </div>
          <div>will</div>
          <div>
            <div>
              <div><br>
                <div class=3D"gmail_quote">On Thu, Sep 6, 2012 at 2:09 PM,
                  Tyler Hobbs <span dir=3D"ltr">&lt;<a href=3D"mailto:tyler=
@datastax.com" target=3D"_blank">tyler@datastax.com</a>&gt;</span>
                  wrote:<br>
                  <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex">
                    The main issue is that one of your us-east nodes is
                    in rack 1d, while the restart are in rack 1c.=A0 With
                    NTS and multiple racks, Cassandra will try use one
                    node from each rack as a replica for a range until
                    it either meets the RF for the DC, or runs out of
                    racks, in which case it just picks nodes
                    sequentially going clockwise around the ring
                    (starting from the range being considered, not the
                    last node that was chosen as a replica).<br>
                    <br>
                    To fix this, you&#39;ll either need to make the 1d node
                    a 1c node, or make
                    42535295865117307932921825928971026432 a 1d node so
                    that you&#39;re alternating racks within that DC.
                    <div>
                      <div><br>
                        <br>
                        <div class=3D"gmail_quote">On Thu, Sep 6, 2012 at
                          12:54 PM, William Oberman <span dir=3D"ltr">&lt;<=
a href=3D"mailto:oberman@civicscience.com" target=3D"_blank">oberman@civics=
cience.com</a>&gt;</span>
                          wrote:<br>
                          <blockquote class=3D"gmail_quote" style=3D"margin=
:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,
                            <div><br>
                            </div>
                            <div>I recently upgraded from 0.8.x to 1.1.x
                              (through 1.0 briefly) and nodetool -ring
                              seems to have changed from &quot;owns&quot; t=
o
                              &quot;effectively owns&quot;. =A0&quot;Effect=
ively
                              owns&quot;=A0seems to account for replication
                              factor (RF). =A0I&#39;m ok with all of this, =
yet
                              I still can&#39;t figure out what&#39;s up wi=
th my
                              cluster. =A0I have a=A0NetworkTopologyStrateg=
y
                              with two data centers (DCs) with RF/number
                              nodes in DC combinations of:</div>
                            <div>DC Name, RF, # in DC</div>
                            <div>analytics, 1, 2</div>
                            <div>us-east, 3, 4</div>
                            <div>So I&#39;d expect 50% on each analytics
                              node, and 75% for each us-east node.
                              =A0Instead, I have two nodes in=A0us-east=A0w=
ith
                              50/100??? (the other two are 75/75 as
                              expected).</div>
                            <div><br>
                            </div>
                            <div>Here is the output of nodetool (all
                              nodes report the same thing):
                            </div>
                            <div>
                              <div>Address =A0 =A0 =A0 =A0 DC =A0 =A0 =A0 =
=A0 =A0Rack =A0 =A0
                                =A0 =A0Status State =A0 Load =A0 =A0 =A0 =
=A0 =A0
                                =A0Effective-Ownership Token =A0 =A0 =A0 =
=A0 =A0 =A0 =A0
                                =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=
 =A0=A0</div>
                              <div>=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0
                                =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=
 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0
                                =A0 =A0 =A0 =A0 =A0 =A0
                                =A0127605887595351923798765477786913079296
                                =A0 =A0=A0</div>
                              <div>x.x.x.x =A0 us-east =A0 =A0 1c =A0 =A0 =
=A0 =A0 =A0Up
                                =A0 =A0 Normal =A094.57 GB =A0 =A0 =A0 =A07=
5.00% =A0 =A0 =A0
                                =A0 =A0 =A0 =A00 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0
                                =A0 =A0 =A0 =A0 =A0=A0</div>
                              <div>x.x.x.x=A0 =A0analytics =A0 1c =A0 =A0 =
=A0 =A0 =A0Up
                                =A0 =A0 Normal =A060.64 GB =A0 =A0 =A0 =A05=
0.00% =A0 =A0 =A0
                                =A0 =A0 =A0 =A01 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0
                                =A0 =A0 =A0 =A0 =A0=A0</div>
                              <div>x.x.x.x=A0 =A0us-east =A0 =A0 1c =A0 =A0=
 =A0 =A0 =A0Up
                                =A0 =A0 Normal =A0131.76 GB =A0 =A0 =A0 75.=
00% =A0 =A0 =A0
                                =A0 =A0 =A0
                                =A042535295865117307932921825928971026432
                                =A0 =A0 =A0</div>
                              <div>x.x.x.x=A0 =A0 us-east =A0 =A0 1c =A0 =
=A0 =A0 =A0 =A0Up
                                =A0 =A0 Normal =A043.45 GB =A0 =A0 =A0 =A05=
0.00% =A0 =A0 =A0
                                =A0 =A0 =A0
                                =A085070591730234615865843651857942052864
                                =A0 =A0 =A0</div>
                              <div>x.x.x.x=A0 =A0 analytics =A0 1d =A0 =A0 =
=A0 =A0 =A0Up
                                =A0 =A0 Normal =A060.88 GB =A0 =A0 =A0 =A05=
0.00% =A0 =A0 =A0
                                =A0 =A0 =A0
                                =A085070591730234615865843651857942052865
                                =A0 =A0 =A0</div>
                              <div>x.x.x.x=A0 =A0us-east =A0 =A0 1d =A0 =A0=
 =A0 =A0 =A0Up
                                =A0 =A0 Normal =A098.56 GB =A0 =A0 =A0 =A01=
00.00% =A0 =A0
                                =A0 =A0 =A0 =A0
                                127605887595351923798765477786913079296=A0<=
/div>
                            </div>
                            <div><br>
                            </div>
                            <div>If I use cassandra-cli to do &quot;show
                              keyspaces;&quot; I get (and again, all nodes
                              report the same thing):</div>
                            <div>
                              <div>Keyspace: civicscience:</div>
                              <div>=A0 Replication Strategy:
                                org.apache.cassandra.locator.NetworkTopolog=
yStrategy</div>
                              <div>=A0 Durable Writes: true</div>
                              <div>=A0 =A0 Options: [analytics:1, us-east:3=
]</div>
                            </div>
                            <div>I removed the output about all of my
                              column families (CFs), hopefully that
                              doesn&#39;t matter.</div>
                            <div><br>
                            </div>
                            <div>Did I compute the tokens wrong? =A0Is
                              there a combination of nodetool commands I
                              can run to migrate the data around to
                              rebalance to 75/75/75/75? =A0I routinely run
                              repair already. =A0And as the release notes
                              required, I ran upgradesstables during the
                              upgrade process.</div>
                            <div><br>
                            </div>
                            <div>Before the upgrade, I was getting
                              analytics =3D 0%, and us-east =3D 25% on each
                              node, which I expected for &quot;owns&quot;.<=
/div>
                            <div><br>
                            </div>
                            <div>will</div>
                            <div><br>
                            </div>
                          </blockquote>
                        </div>
                        <br>
                        <br clear=3D"all">
                        <br>
                      </div>
                    </div>
                    <span><font color=3D"#888888">-- <br>
                        <font color=3D"#888888">Tyler Hobbs<span></span><br=
>
                          <a href=3D"http://datastax.com/" target=3D"_blank=
">DataStax</a><br>
                        </font><br>
                      </font></span></blockquote>
                </div>
                <br>
                <br clear=3D"all">
                <div><br>
                </div>
              </div>
            </div>
            <span><font color=3D"#888888">-- <br>
                Will Oberman<br>
                Civic Science, Inc.<br>
                3030 Penn Avenue., First Floor<br>
                Pittsburgh, PA 15201<br>
                (M) <a href=3D"tel:412-480-7835" value=3D"+14124807835" tar=
get=3D"_blank">412-480-7835</a><br>
                (E) <a href=3D"mailto:oberman@civicscience.com" target=3D"_=
blank">oberman@civicscience.com</a><br>
              </font></span></div>
        </blockquote>
      </div>
      <br>
      <br clear=3D"all">
      <br>
      -- <br>
      <font color=3D"#888888">Tyler Hobbs<span></span><br>
        <a href=3D"http://datastax.com/" target=3D"_blank">DataStax</a><br>
      </font><br>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br><br clear=3D"all"><br>-- <br><font color=3D"#888888"=
>Tyler Hobbs<span></span><br>
<a href=3D"http://datastax.com/" target=3D"_blank">DataStax</a><br></font><=
br>

--f46d043890954b05d804c95d6857--