Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: softfail (athena.apache.org: transitioning domain of
 caleb@steelhouse.com does not designate 66.46.182.56 as permitted sender)
From: Caleb Rackliffe <caleb@steelhouse.com>
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Date: Sun, 18 Mar 2012 02:47:05 -0400
Subject: Token Ring Gaps in a 2 DC Setup
Thread-Topic: Token Ring Gaps in a 2 DC Setup
Thread-Index: Ac0E0u58F7ib8TG8RwyB0QT0vSbpuA==
Message-ID: <CB8AD2F7.928D%caleb@steelhouse.com>
Accept-Language: en-US
Content-Language: en-US
user-agent: Microsoft-MacOutlook/14.14.0.111121
acceptlanguage: en-US
Content-Type: multipart/alternative;
	boundary="_000_CB8AD2F7928Dcalebsteelhousecom_"
MIME-Version: 1.0

--_000_CB8AD2F7928Dcalebsteelhousecom_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Hi Everyone,

I have a cluster using NetworkTopologyStrategy that looks like this:

10.41.116.22     DC1         RAC1         Up     Normal  13.21 GB        10=
.00%  0
10.54.149.202   DC2         RAC1         Up     Normal  6.98 GB            =
0.00%   1
10.41.116.20     DC1         RAC2         Up     Normal  12.75 GB        10=
.00%  17014118300000000000000000000000000000
10.41.116.16     DC1         RAC3         Up     Normal  12.62 GB        10=
.00%  34028236700000000000000000000000000000
10.54.149.203   DC2         RAC1         Up     Normal  6.7 GB             =
 0.00%   34028236700000000000000000000000000001
10.41.116.18     DC1         RAC4         Up     Normal  10.8 GB          1=
0.00%  51042355000000000000000000000000000000
10.41.116.14     DC1         RAC5         Up     Normal  10.27 GB        10=
.00%  68056473400000000000000000000000000000
10.54.149.204   DC2         RAC1         Up     Normal  6.7 GB             =
0.00%   68056473400000000000000000000000000001
10.41.116.12     DC1         RAC6         Up     Normal  10.58 GB        10=
.00%  85070591700000000000000000000000000000
10.41.116.10     DC1         RAC7         Up     Normal  10.89 GB        10=
.00%  102084710000000000000000000000000000000
10.54.149.205   DC2         RAC1         Up     Normal  7.51 GB           0=
.00%   102084710000000000000000000000000000001
10.41.116.8       DC1         RAC8          Up     Normal  10.48 GB        =
10.00%  119098828000000000000000000000000000000
10.41.116.24     DC1         RAC9         Up     Normal  10.89 GB        10=
.00%  136112947000000000000000000000000000000
10.54.149.206   DC2         RAC1         Up     Normal  6.37 GB           0=
.00%   136112947000000000000000000000000000001
10.41.116.26     DC1         RAC10       Up     Normal  11.17 GB        10.=
00%  153127065000000000000000000000000000000

There are two data centers, one with 10 nodes/2 replicas and one with 5 nod=
es/1 replica.  What I've attempted to do with my token assignments is have =
each node in the smaller DC handle 20% of the keyspace, and this would mean=
 that I should see roughly equal usage on all 15 boxes.  It just doesn't se=
em to be happening that way, though.  It looks like the "1 replica" nodes a=
re carrying about half the data the "2 replica" nodes are.  It's almost as =
if those nodes are only handling 10% of the keyspace instead of 20%.

Does anybody have any suggestions as to what might be going on?  I've run n=
odetool getendpoints against a bunch of keys, and I always get back three n=
odes, so I'm pretty confused.  I've also run repair on a few nodes in both =
data centers, but the sizes are still vastly different.

Thanks!

Caleb Rackliffe | Software Developer
M 949.981.0159 | caleb@steelhouse.com

--_000_CB8AD2F7928Dcalebsteelhousecom_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html><head></head><body style=3D"word-wrap: break-word; -webkit-nbsp-mode:=
 space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-si=
ze: 14px; font-family: Calibri, sans-serif; "><div><div><div>Hi Everyone,</=
div><div><br></div><div>I have a cluster using NetworkTopologyStrategy that=
 looks like this:</div><div><br></div><div><div><div>10.41.116.22 &nbsp; &n=
bsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC1 &nbsp; &nbsp; &nbsp; &nbsp; Up &n=
bsp; &nbsp; Normal &nbsp;13.21 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nbsp;0=
 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbs=
p; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nb=
sp;</div><div>10.54.149.202 &nbsp; DC2 &nbsp; &nbsp; &nbsp; &nbsp; RAC1 &nb=
sp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;6.98 GB &nbsp; &nbsp=
; &nbsp; &nbsp; &nbsp; &nbsp;0.00% &nbsp; 1 &nbsp; &nbsp; &nbsp; &nbsp; &nb=
sp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &=
nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;</div><div>10.41.116.20 &nbsp=
; &nbsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC2 &nbsp; &nbsp; &nbsp; &nbsp; U=
p &nbsp; &nbsp; Normal &nbsp;12.75 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nb=
sp;17014118300000000000000000000000000000 &nbsp; &nbsp; &nbsp;</div><div>10=
.41.116.16 &nbsp; &nbsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC3 &nbsp; &nbsp;=
 &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;12.62 GB &nbsp; &nbsp; &nbsp; =
&nbsp;10.00% &nbsp;34028236700000000000000000000000000000 &nbsp; &nbsp; &nb=
sp;</div><div>10.54.149.203 &nbsp; DC2 &nbsp; &nbsp; &nbsp; &nbsp; RAC1 &nb=
sp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;6.7 GB &nbsp; &nbsp;=
 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;0.00% &nbsp; 340282367000000000000000000=
00000000001 &nbsp; &nbsp; &nbsp;</div><div>10.41.116.18 &nbsp; &nbsp; DC1 &=
nbsp; &nbsp; &nbsp; &nbsp; RAC4 &nbsp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp=
; Normal &nbsp;10.8 GB &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nbsp;51042=
355000000000000000000000000000000 &nbsp; &nbsp; &nbsp;</div><div>10.41.116.=
14 &nbsp; &nbsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC5 &nbsp; &nbsp; &nbsp; =
&nbsp; Up &nbsp; &nbsp; Normal &nbsp;10.27 GB &nbsp; &nbsp; &nbsp; &nbsp;10=
.00% &nbsp;68056473400000000000000000000000000000 &nbsp; &nbsp; &nbsp;</div=
><div>10.54.149.204 &nbsp; DC2 &nbsp; &nbsp; &nbsp; &nbsp; RAC1 &nbsp; &nbs=
p; &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;6.7 GB &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; 0.00% &nbsp; 68056473400000000000000000000000000001 &n=
bsp; &nbsp; &nbsp;</div><div>10.41.116.12 &nbsp; &nbsp; DC1 &nbsp; &nbsp; &=
nbsp; &nbsp; RAC6 &nbsp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp=
;10.58 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nbsp;8507059170000000000000000=
0000000000000 &nbsp; &nbsp; &nbsp;</div><div>10.41.116.10 &nbsp; &nbsp; DC1=
 &nbsp; &nbsp; &nbsp; &nbsp; RAC7 &nbsp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nb=
sp; Normal &nbsp;10.89 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nbsp;102084710=
000000000000000000000000000000 &nbsp; &nbsp;&nbsp;</div><div>10.54.149.205 =
&nbsp; DC2 &nbsp; &nbsp; &nbsp; &nbsp; RAC1 &nbsp; &nbsp; &nbsp; &nbsp; Up =
&nbsp; &nbsp; Normal &nbsp;7.51 GB &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 0.00%=
 &nbsp; 102084710000000000000000000000000000001 &nbsp; &nbsp;&nbsp;</div><d=
iv>10.41.116.8 &nbsp; &nbsp; &nbsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC8 &n=
bsp; &nbsp; &nbsp; &nbsp; &nbsp;Up &nbsp; &nbsp; Normal &nbsp;10.48 GB &nbs=
p; &nbsp; &nbsp; &nbsp;10.00% &nbsp;119098828000000000000000000000000000000=
 &nbsp; &nbsp;&nbsp;</div><div>10.41.116.24 &nbsp; &nbsp; DC1 &nbsp; &nbsp;=
 &nbsp; &nbsp; RAC9 &nbsp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nb=
sp;10.89 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &nbsp;13611294700000000000000=
0000000000000000 &nbsp; &nbsp;&nbsp;</div><div>10.54.149.206 &nbsp; DC2 &nb=
sp; &nbsp; &nbsp; &nbsp; RAC1 &nbsp; &nbsp; &nbsp; &nbsp; Up &nbsp; &nbsp; =
Normal &nbsp;6.37 GB &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 0.00% &nbsp; 136112=
947000000000000000000000000000001 &nbsp; &nbsp;&nbsp;</div><div>10.41.116.2=
6 &nbsp; &nbsp; DC1 &nbsp; &nbsp; &nbsp; &nbsp; RAC10 &nbsp; &nbsp; &nbsp; =
Up &nbsp; &nbsp; Normal &nbsp;11.17 GB &nbsp; &nbsp; &nbsp; &nbsp;10.00% &n=
bsp;153127065000000000000000000000000000000</div></div></div><div><br></div=
><div>There are two data centers, one with 10 nodes/2 replicas and one with=
 5 nodes/1 replica. &nbsp;What I've attempted to do with my token assignmen=
ts is have each node in the smaller DC handle 20% of the keyspace, and this=
 would mean that I should see roughly equal usage on all 15 boxes. &nbsp;It=
 just doesn't seem to be happening that way, though. &nbsp;It looks like th=
e "1 replica" nodes are carrying about half the data the "2 replica" nodes =
are. &nbsp;It's almost as if those nodes are only handling 10% of the keysp=
ace instead of 20%.</div><div><br></div><div>Does anybody have any suggesti=
ons as to what might be going on? &nbsp;I've run nodetool&nbsp;getendpoints=
 against a bunch of keys, and I always get back three nodes, so I'm pretty =
confused. &nbsp;I've also run repair on a few nodes in both data centers, b=
ut the sizes are still vastly different.</div><div><br></div><div>Thanks!</=
div><div><div><br></div><div><b><div style=3D"font-weight: normal; "><b>Cal=
eb Rackliffe | Software Developer<span class=3D"Apple-tab-span" style=3D"wh=
ite-space:pre">	</span></b></div><div style=3D"font-weight: normal; ">M 949=
.981.0159 | caleb@steelhouse.com</div></b></div></div></div></div></body></=
html>

--_000_CB8AD2F7928Dcalebsteelhousecom_--