Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <CY1PR08MB1819F2B7D90169E53B9831F79B0C0@CY1PR08MB1819.namprd08.prod.outlook.com>
References: 
 <CY1PR08MB18198D24346F7659519817C99B0E0@CY1PR08MB1819.namprd08.prod.outlook.com>
	<CABNXB2BUH557XeOW0=pGPqqiyWvZueSVut9mP4+aUGVtGMpUdw@mail.gmail.com>
	<CY1PR08MB181951FF194BE2848DA800719B0E0@CY1PR08MB1819.namprd08.prod.outlook.com>
	<CABNXB2AFCwxbMSdGKukjyooxUzrMT8zGuHi=TRdOJN+ViyU78Q@mail.gmail.com>
	<CY1PR08MB1819A9BCB47941686194CC359B0D0@CY1PR08MB1819.namprd08.prod.outlook.com>
	<CAOxAL62oa+bmdo6bGgFh8MsBdxOyeXCRdU1pwrUFn2Pnc=EL+w@mail.gmail.com>
	<CY1PR08MB1819F2B7D90169E53B9831F79B0C0@CY1PR08MB1819.namprd08.prod.outlook.com>
Date: Fri, 4 Dec 2015 11:45:41 -0500
Message-ID: 
 <CAOxAL62bM6R0-_C0J++_666vu8VTbQyPy0nQyfZ4ZNgo+YC4GA@mail.gmail.com>
Subject: Re: cassandra reads are unbalanced
From: Jack Krupansky <jack.krupansky@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11440f82e1cdf60526153ce7

--001a11440f82e1cdf60526153ce7
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thanks for the elaboration. A few more questions...

Is there only a single thread in each client or are there multiple threads
doing reading in parallel? IOW, does a read need to complete before the
next read is issued.

What client Cassandra driver are you using? Java?

What does your connection code look like, say compared to the example in
the doc:
http://docs.datastax.com/en/developer/java-driver/2.0/java-driver/quick_sta=
rt/qsSimpleClientCreate_t.html

Just to make sure it really is connecting only to the local cluster and
using round robin and whether it is token aware.


-- Jack Krupansky

On Fri, Dec 4, 2015 at 10:51 AM, Walsh, Stephen <Stephen.Walsh@aspect.com>
wrote:

> Thanks for your input, but I think I=E2=80=99ve already answered most of =
your
> questions.
>
>
>
>
>
> How many clients do you have performing reads?
>
>
>
> ------------------
>
> On Wed, Dec 2, 2015 at 6:44 PM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote
>
> =E2=80=A6.
>
> There are 2 application (1 for each DC) who read and write at the same
> rate to their local DC
>
> =E2=80=A6.
>
> --------------------
>
>
>
>
>
>
>
>
>
>
>
>
>
> Is your load balancer in front of your clients or between your clients an=
d
> Cassandra?
>
>
>
> ------------------
>
> On Thu, Dec 3, 2015 at 4:58 AM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote:
>
> =E2=80=A6
>
> our production applications are behind a round robin load balancer
>
> =E2=80=A6
>
> ------------------
>
>
>
> No Load Balancers talk to cassandra =E2=80=93 I=E2=80=99m only mentioning=
 this to show
> that the writes / read are evenly distributed over the 2 DC=E2=80=99s
>
>
>
>
>
>
>
>
>
>
>
>
>
> Does Node1 of DC2 have the exact same configuration of hardware of the
> other nodes
>
> Yes
>
>
>
>
>
>
>
>
>
>
>
> Is it in the same rack
>
> It=E2=80=99s in AWS =E2=80=93 but we have it configured via the GossipPro=
perytFileSnitch
> that they are all on unique racks
>
>
>
>
>
>
>
>
>
>
>
>
>
> Maybe your load balancer thinks that node is more capable and handles
> requests faster so that it looks less loaded than the other two nodes
>
> Unlikely, it=E2=80=99s all TCP SSL pass though connections. It doesn=E2=
=80=99t balance on
> load, it just round robins each request
>
>
>
>
>
>
>
>
>
>
>
> You might also check the read counts after a very short interval of time
> to see if Node1 is uniformly getting more requests or just occasionally
>
> ------------------
>
> On Wed, Dec 2, 2015 at 3:36 PM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote =E2=80=A6
>
> We monitor the number of reads / writes of every table via the cassandra
> JMX metrics. (cassandra.db.read_count)
>
> =E2=80=A6
>
> ------------------
>
> We can only monitor in 1 hour moving window
>
>
>
>
>
>
>
>
>
> Maybe the other two nodes are in a different rack that occasionally has
> net connectivity issues
>
> Unlikely seems its AWS
>
>
>
>
>
>
>
>
>
>
>
>
>
> *From:* Jack Krupansky [mailto:jack.krupansky@gmail.com]
> *Sent:* 03 December 2015 16:11
>
> *To:* user@cassandra.apache.org
> *Subject:* Re: cassandra reads are unbalanced
>
>
>
> How many clients do you have performing reads?
>
>
>
> Is your load balancer in front of your clients or between your clients an=
d
> Cassandra?
>
>
>
> Does Node1 of DC2 have the exact same configuration of hardware of the
> other nodes? Is it in the same rack? Maybe your load balancer thinks that
> node is more capable and handles requests faster so that it looks less
> loaded than the other two nodes.
>
>
>
> You might also check the read counts after a very short interval of time
> to see if Node1 is uniformly getting more requests or just occasionally.
> Maybe the other two nodes are in a different rack that occasionally has n=
et
> connectivity issues so that the requests get diverted by the client/load
> balancer to Node1 during those times.
>
>
>
>
> -- Jack Krupansky
>
>
>
> On Thu, Dec 3, 2015 at 4:58 AM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote:
>
> Thanks but keep in mind that both DC should be getting the same load, our
> production applications are behind a round robin load balancer =E2=80=93 =
so each
> one our local application talk to its local Cassandra DataCenter.
>
>
>
> It took about 4 hours but the nodetool cleanup eventually balanced all
> nodes
>
>
>
> *From:* DuyHai Doan [mailto:doanduyhai@gmail.com]
> *Sent:* 02 December 2015 16:27
>
>
> *To:* user@cassandra.apache.org
> *Subject:* Re: cassandra reads are unbalanced
>
>
>
> If you're using the Java driver with LOCAL_ONE and the default load
> balancing strategy (TokenAware wrapped on DCAwareRoundRobin), the
> driver will always select the primary replica. To change this behavior an=
d
> introduce some randomness so that non primary replicas get a chance to
> serve a read:
>
>
>
> new TokenAwarePolicy(new DCAwareRoundRobinPolicy("local_DC"), true).
>
>
>
> The second parameter (true) asks the TokenAware policy to "shuffle"
> replica on each request to avoid always returning the primary replica.
>
>
>
> On Wed, Dec 2, 2015 at 6:44 PM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote:
>
> Very good questions.
>
>
>
> We have reads and writes at LOCAL_ONE.
>
> There are 2 application (1 for each DC) who read and write at the same
> rate to their local DC
>
> (All reads / writes started all perfectly even and degraded over time)
>
>
>
> We use DCAwareRoundRobin policy
>
>
>
> On update on the nodetool cleanup =E2=80=93 it has help but hasn=E2=80=99=
t balanced all
> nodes. Node 1 on DC2 is still quite high
>
>
>
> Node 1 (DC1)  =3D  1.35k    (seeder)
>
> Node 2 (DC1)  =3D  1.54k
>
> Node 3 (DC1)  =3D  1.45k
>
>
>
> Node 1 (DC2)  =3D  2.06k   (seeder)
>
> Node 2 (DC2)  =3D  1.38k
>
> Node 3 (DC2)  =3D  1.43k
>
>
>
>
>
> *From:* DuyHai Doan [mailto:doanduyhai@gmail.com]
> *Sent:* 02 December 2015 14:22
> *To:* user@cassandra.apache.org
> *Subject:* Re: cassandra reads are unbalanced
>
>
>
> Which Consistency level do you use for reads ? ONE ? Are you reading from
> only DC1 or from both DC ?
>
> What is the LoadBalancingStrategy you have configured for your driver ?
> TokenAware wrapped on DCAwareRoundRobin ?
>
>
>
>
>
>
>
>
>
>
>
> On Wed, Dec 2, 2015 at 3:36 PM, Walsh, Stephen <Stephen.Walsh@aspect.com>
> wrote:
>
> Hey all,
>
>
>
> Thanks for taking the time to help.
>
>
>
> So we have 6 cassandra nodes in 2 Data Centers.
>
> Both Data Centers have a replication of 3 =E2=80=93 so all nodes have all=
 the data.
>
>
>
> Over the last 2 days we=E2=80=99ve noticed that data reads / writes has s=
hifted
> from balanced to unbalanced
>
> (Nodetool status still shows 100% ownership on every node, with similar
> sizes)
>
>
>
>
>
> For Example
>
>
>
> We monitor the number of reads / writes of every table via the cassandra
> JMX metrics. (cassandra.db.read_count)
>
> Over the last hour of this run
>
>
>
> Reads
>
> Node 1 (DC1)  =3D  1.79k    (seeder)
>
> Node 2 (DC1)  =3D  1.92k
>
> Node 3 (DC1)  =3D  1.97k
>
>
>
> Node 1 (DC2)  =3D  2.90k   (seeder)
>
> Node 2 (DC2)  =3D  1.76k
>
> Node 3 (DC2)  =3D  1.19k
>
>
>
> As you see on DC1, everything is pretty well balanced, but on DC2 the
> reads favour Node1 over Node 3.
>
> I ran a nodetool repair yesterday =E2=80=93 ran for 6 hours and when comp=
leted
> didn=E2=80=99t change the read balance.
>
>
>
> Write levels are similar on  DC2, but not as bad a reads.
>
>
>
> Anyone any suggestion on how to rebalance? I=E2=80=99m thinking maybe run=
ning a
> nodetool cleanup in case some of the keys have shifted?
>
>
>
> Regards
>
> Stephen Walsh
>
>
>
>
>
> This email (including any attachments) is proprietary to Aspect Software,
> Inc. and may contain information that is confidential. If you have receiv=
ed
> this message in error, please do not read, copy or forward this message.
> Please notify the sender immediately, delete it from your system and
> destroy any copies. You may not further disclose or distribute this email
> or its attachments.
>
>
>
> This email (including any attachments) is proprietary to Aspect Software,
> Inc. and may contain information that is confidential. If you have receiv=
ed
> this message in error, please do not read, copy or forward this message.
> Please notify the sender immediately, delete it from your system and
> destroy any copies. You may not further disclose or distribute this email
> or its attachments.
>
>
>
> This email (including any attachments) is proprietary to Aspect Software,
> Inc. and may contain information that is confidential. If you have receiv=
ed
> this message in error, please do not read, copy or forward this message.
> Please notify the sender immediately, delete it from your system and
> destroy any copies. You may not further disclose or distribute this email
> or its attachments.
>
>
> This email (including any attachments) is proprietary to Aspect Software,
> Inc. and may contain information that is confidential. If you have receiv=
ed
> this message in error, please do not read, copy or forward this message.
> Please notify the sender immediately, delete it from your system and
> destroy any copies. You may not further disclose or distribute this email
> or its attachments.
>

--001a11440f82e1cdf60526153ce7
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks for the elaboration. A few more questions...<div><b=
r></div><div>Is there only a single thread in each client or are there mult=
iple threads doing reading in parallel? IOW, does a read need to complete b=
efore the next read is issued.</div><div><br></div><div>What client Cassand=
ra driver are you using? Java?</div><div><br></div><div>What does your conn=
ection code look like, say compared to the example in the doc:</div><div><a=
 href=3D"http://docs.datastax.com/en/developer/java-driver/2.0/java-driver/=
quick_start/qsSimpleClientCreate_t.html">http://docs.datastax.com/en/develo=
per/java-driver/2.0/java-driver/quick_start/qsSimpleClientCreate_t.html</a>=
</div><div><br></div><div>Just to make sure it really is connecting only to=
 the local cluster and using round robin and whether it is token aware.<br>=
<div><br></div></div></div><div class=3D"gmail_extra"><br clear=3D"all"><di=
v><div class=3D"gmail_signature"><div dir=3D"ltr">-- Jack Krupansky</div></=
div></div>
<br><div class=3D"gmail_quote">On Fri, Dec 4, 2015 at 10:51 AM, Walsh, Step=
hen <span dir=3D"ltr">&lt;<a href=3D"mailto:Stephen.Walsh@aspect.com" targe=
t=3D"_blank">Stephen.Walsh@aspect.com</a>&gt;</span> wrote:<br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid=
;padding-left:1ex">


<div lang=3D"EN-IE" link=3D"blue" vlink=3D"purple">
<div>
<p class=3D"MsoNormal">Thanks for your input, but I think I=E2=80=99ve alre=
ady answered most of your questions.<u></u><u></u></p><span class=3D"">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">How many clients do you have performing reads?<u></u=
><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">------------------<u></u><u></=
u></span></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">On Wed, Dec 2, 2015 at 6:44 PM, Walsh=
, Stephen &lt;<a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank"=
><span style=3D"color:#1f497d;text-decoration:none">Stephen.Walsh@aspect.co=
m</span></a>&gt;
 wrote<u></u><u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">=E2=80=A6.<u></u><u></u></span=
></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">There are 2 application (1 for each D=
C) who read and write at the same rate to their local DC<u></u><u></u></spa=
n></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">=E2=80=A6.<u></u><u></u></span=
></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">--------------------<u></u><u></u></s=
pan></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">Is your load balancer in front of your clients or be=
tween your clients and Cassandra?<u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">------------------<u></u><u></=
u></span></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">On Thu, Dec 3, 2015 at 4:58 AM, Walsh=
, Stephen &lt;<a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank"=
><span style=3D"color:#1f497d;text-decoration:none">Stephen.Walsh@aspect.co=
m</span></a>&gt;
 wrote:<u></u><u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">=E2=80=A6<u></u><u></u></span>=
</p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">our production applications are behin=
d a round robin load balancer<u></u><u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">=E2=80=A6<u></u><u></u></span>=
</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">------------------<u></u><u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">No Load Balancers talk to cassandra =
=E2=80=93 I=E2=80=99m only mentioning this to show that the writes / read a=
re evenly distributed over the 2 DC=E2=80=99s<u></u><u></u></span></p><span=
 class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal">Does Node1 of DC2 have the exact same configuration =
of hardware of the other nodes<span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d"><u></u><u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">Yes<u></u><u></u></span></p><s=
pan class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal">Is it in the same rack<u></u><u></u></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">It=E2=80=99s in AWS =E2=80=93 =
but we have it configured via the GossipProperytFileSnitch that they are al=
l on unique racks<u></u><u></u></span></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal">Maybe your load balancer thinks that node is more ca=
pable and handles requests faster so that it looks less loaded than the oth=
er two nodes<u></u><u></u></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">Unlikely, it=E2=80=99s all TCP=
 SSL pass though connections. It doesn=E2=80=99t balance on load, it just r=
ound robins each request<u></u><u></u></span></p><span class=3D"">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal">You might also check the read counts after a very sh=
ort interval of time to see if Node1 is uniformly getting more requests or =
just occasionally<u></u><u></u></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">------------------<u></u><u></=
u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">On Wed, Dec 2, 2015 at 3:36 PM, Walsh=
, Stephen &lt;<a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank"=
><span style=3D"color:#1f497d;text-decoration:none">Stephen.Walsh@aspect.co=
m</span></a>&gt;
 wrote =E2=80=A6<u></u><u></u></span></p><span class=3D"">
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">We monitor the number of reads / writ=
es of every table via the cassandra JMX metrics. (cassandra.db.read_count)<=
u></u><u></u></span></p>
</span><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&=
quot;Calibri&quot;,sans-serif;color:#1f497d">=E2=80=A6<u></u><u></u></span>=
</p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">------------------<u></u><u></u></spa=
n></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">We can only monitor in 1 hour moving =
window<u></u><u></u></span></p><span class=3D"">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal">Maybe the other two nodes are in a different rack th=
at occasionally has net connectivity issues<u></u><u></u></p>
</span><p class=3D"MsoNormal">Unlikely seems its AWS<span style=3D"font-siz=
e:11.0pt;font-family:&quot;Calibri&quot;,sans-serif;color:#1f497d"><u></u><=
u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><b><span lang=3D"EN-US" style=3D"font-size:11.0pt;fo=
nt-family:&quot;Calibri&quot;,sans-serif">From:</span></b><span lang=3D"EN-=
US" style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,sans-serif"> =
Jack Krupansky [mailto:<a href=3D"mailto:jack.krupansky@gmail.com" target=
=3D"_blank">jack.krupansky@gmail.com</a>]
<br>
<b>Sent:</b> 03 December 2015 16:11</span></p><div><div class=3D"h5"><br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: cassandra reads are unbalanced<u></u><u></u></div></div=
><p></p><div><div class=3D"h5">
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">How many clients do you have performing reads?<u></u=
><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Is your load balancer in front of your clients or be=
tween your clients and Cassandra?<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Does Node1 of DC2 have the exact same configuration =
of hardware of the other nodes? Is it in the same rack? Maybe your load bal=
ancer thinks that node is more capable and handles requests faster so that =
it looks less loaded than the other
 two nodes.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">You might also check the read counts after a very sh=
ort interval of time to see if Node1 is uniformly getting more requests or =
just occasionally. Maybe the other two nodes are in a different rack that o=
ccasionally has net connectivity issues
 so that the requests get diverted by the client/load balancer to Node1 dur=
ing those times.<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal"><br clear=3D"all">
<u></u><u></u></p>
<div>
<div>
<div>
<p class=3D"MsoNormal">-- Jack Krupansky<u></u><u></u></p>
</div>
</div>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">On Thu, Dec 3, 2015 at 4:58 AM, Walsh, Stephen &lt;<=
a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank">Stephen.Walsh@=
aspect.com</a>&gt; wrote:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm">
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Thanks but keep in mind that both DC =
should be getting the same load, our production applications are
 behind a round robin load balancer =E2=80=93 so each one our local applica=
tion talk to its local Cassandra DataCenter.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">It took about 4 hours but the nodetoo=
l cleanup eventually balanced all nodes</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><b><span lang=3D"EN-US" style=3D"font-size:11.0pt;fo=
nt-family:&quot;Calibri&quot;,sans-serif">From:</span></b><span lang=3D"EN-=
US" style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,sans-serif"> =
DuyHai
 Doan [mailto:<a href=3D"mailto:doanduyhai@gmail.com" target=3D"_blank">doa=
nduyhai@gmail.com</a>]
<br>
<b>Sent:</b> 02 December 2015 16:27</span><u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal"><br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: cassandra reads are unbalanced<u></u><u></u></p>
</div>
</div>
<div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal">If you&#39;re using the Java driver with LOCAL_ONE a=
nd the default load balancing strategy (TokenAware wrapped on=C2=A0DCAwareR=
oundRobin), the driver=C2=A0will always select the primary replica.
 To change this behavior and introduce some randomness so that non primary =
replicas get a chance to serve a read:<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">new TokenAwarePolicy(new DCAwareRoundRobinPolicy(&qu=
ot;local_DC&quot;), true).<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">The second parameter (true) asks the TokenAware poli=
cy to &quot;shuffle&quot; replica on each request to avoid always returning=
 the primary replica.<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">On Wed, Dec 2, 2015 at 6:44 PM, Walsh, Stephen &lt;<=
a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank">Stephen.Walsh@=
aspect.com</a>&gt; wrote:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0cm;margin-=
bottom:5.0pt">
<div>
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">Very good questions.</span><u></u><u>=
</u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">We have reads and writes at LOCAL_ONE=
.</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">There are 2 application (1 for each D=
C) who read and write at the same rate to their local DC</span><u></u><u></=
u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">(All reads / writes started all perfe=
ctly even and degraded over time)</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">We use DCAwareRoundRobin policy</span=
><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">On update on the nodetool cleanup =E2=
=80=93 it has help but hasn=E2=80=99t balanced all nodes. Node 1 on DC2 is =
still
 quite high</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 1 (DC1) =C2=A0=3D =C2=A01.35k=
=C2=A0=C2=A0=C2=A0 (seeder)</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 2 (DC1) =C2=A0=3D =C2=A01.54k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 3 (DC1) =C2=A0=3D =C2=A01.45k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 1 (DC2) =C2=A0=3D =C2=A02.06k=
=C2=A0=C2=A0 (seeder)</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 2 (DC2) =C2=A0=3D =C2=A01.38k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 3 (DC2) =C2=A0=3D =C2=A01.43k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE" style=3D"font-size:11.0pt;font-fam=
ily:&quot;Calibri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></=
u></p>
<p class=3D"MsoNormal"><span lang=3D"DE" style=3D"font-size:11.0pt;font-fam=
ily:&quot;Calibri&quot;,sans-serif;color:#1f497d">=C2=A0</span><u></u><u></=
u></p>
<p class=3D"MsoNormal"><b><span lang=3D"EN-US" style=3D"font-size:11.0pt;fo=
nt-family:&quot;Calibri&quot;,sans-serif">From:</span></b><span lang=3D"EN-=
US" style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,sans-serif"> =
DuyHai
 Doan [mailto:<a href=3D"mailto:doanduyhai@gmail.com" target=3D"_blank">doa=
nduyhai@gmail.com</a>]
<br>
<b>Sent:</b> 02 December 2015 14:22<br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: cassandra reads are unbalanced</span><u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:12.0pt">Which Consistency lev=
el do you use for reads ? ONE ? Are you reading from only DC1 or from both =
DC ?<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">What is the LoadBalancingStrategy you have configure=
d for your driver ? TokenAware wrapped on DCAwareRoundRobin ?<u></u><u></u>=
</p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
</div>
<div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<div>
<p class=3D"MsoNormal">On Wed, Dec 2, 2015 at 3:36 PM, Walsh, Stephen &lt;<=
a href=3D"mailto:Stephen.Walsh@aspect.com" target=3D"_blank">Stephen.Walsh@=
aspect.com</a>&gt; wrote:<u></u><u></u></p>
<blockquote style=3D"border:none;border-left:solid #cccccc 1.0pt;padding:0c=
m 0cm 0cm 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0cm;margin-=
bottom:5.0pt">
<div>
<div>
<p class=3D"MsoNormal">Hey all,<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Thanks for taking the time to help.<u></u><u></u></p=
>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">So we have 6 cassandra nodes in 2 Data Centers.<u></=
u><u></u></p>
<p class=3D"MsoNormal">Both Data Centers have a replication of 3 =E2=80=93 =
so all nodes have all the data.<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Over the last 2 days we=E2=80=99ve noticed that data=
 reads / writes has shifted from balanced to unbalanced<u></u><u></u></p>
<p class=3D"MsoNormal">(Nodetool status still shows 100% ownership on every=
 node, with similar sizes)<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">For Example<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">We monitor the number of reads / writes of every tab=
le via the cassandra JMX metrics. (cassandra.db.read_count)<u></u><u></u></=
p>
<p class=3D"MsoNormal">Over the last hour of this run<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Reads<u></u><u></u></p>
<p class=3D"MsoNormal">Node 1 (DC1) =C2=A0=3D =C2=A01.79k=C2=A0=C2=A0=C2=A0=
 (seeder)<u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 2 (DC1) =C2=A0=3D =C2=A01.92k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 3 (DC1) =C2=A0=3D =C2=A01.97k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 1 (DC2) =C2=A0=3D =C2=A02.90k=
=C2=A0=C2=A0
</span>(seeder)<u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 2 (DC2) =C2=A0=3D =C2=A01.76k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">Node 3 (DC2) =C2=A0=3D =C2=A01.19k=
</span><u></u><u></u></p>
<p class=3D"MsoNormal"><span lang=3D"DE">=C2=A0</span><u></u><u></u></p>
<p class=3D"MsoNormal">As you see on DC1, everything is pretty well balance=
d, but on DC2 the reads favour Node1 over Node 3.<u></u><u></u></p>
<p class=3D"MsoNormal">I ran a nodetool repair yesterday =E2=80=93 ran for =
6 hours and when completed didn=E2=80=99t change the read balance.<u></u><u=
></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Write levels are similar on=C2=A0 DC2, but not as ba=
d a reads.<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Anyone any suggestion on how to rebalance? I=E2=80=
=99m thinking maybe running a nodetool cleanup in case some of the keys hav=
e shifted?<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Regards<u></u><u></u></p>
<p class=3D"MsoNormal">Stephen Walsh<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
<p class=3D"MsoNormal">This email (including any attachments) is proprietar=
y to Aspect Software, Inc. and may contain information that is confidential=
. If you have received this message in error, please
 do not read, copy or forward this message. Please notify the sender immedi=
ately, delete it from your system and destroy any copies. You may not furth=
er disclose or distribute this email or its attachments.
<u></u><u></u></p>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
</div>
</div>
</div>
<div>
<div>
<p class=3D"MsoNormal">This email (including any attachments) is proprietar=
y to Aspect Software, Inc. and may contain information that is confidential=
. If you have received this message in error, please
 do not read, copy or forward this message. Please notify the sender immedi=
ately, delete it from your system and destroy any copies. You may not furth=
er disclose or distribute this email or its attachments.
<u></u><u></u></p>
</div>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
</div>
</div>
</div>
</div>
<div>
<div>
<p class=3D"MsoNormal">This email (including any attachments) is proprietar=
y to Aspect Software, Inc. and may contain information that is confidential=
. If you have received this message in error, please do not read, copy or f=
orward this message. Please notify
 the sender immediately, delete it from your system and destroy any copies.=
 You may not further disclose or distribute this email or its attachments.
<u></u><u></u></p>
</div>
</div>
</div>
</blockquote>
</div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
</div></div></div><div><div class=3D"h5">
This email (including any attachments) is proprietary to Aspect Software, I=
nc. and may contain information that is confidential. If you have received =
this message in error, please do not read, copy or forward this message. Pl=
ease notify the sender immediately,
 delete it from your system and destroy any copies. You may not further dis=
close or distribute this email or its attachments.
</div></div></div>

</blockquote></div><br></div>

--001a11440f82e1cdf60526153ce7--