Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of anthony.ikeda.dev@gmail.com
 designates 209.85.210.48 as permitted sender)
From: Ikeda Anthony <anthony.ikeda.dev@gmail.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_31314A9F-3CD0-4087-BFD1-02CACB379258"
Subject: Re: Local Quorum Performance...
Date: Sat, 17 Sep 2011 21:23:12 -0700
In-Reply-To: 
 <CA+oV6suCGdb=+S=Bz=yNXuHFYjCfdTdN9st6E0Z=Zhv_rxaMSQ@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CA+oV6sueDzXySRDfJ3xM9o07YvqZkmbHnm99zfhHpRvKTz0C4g@mail.gmail.com>
 <32A85DEC-AB54-4678-9772-585B95D71812@gmail.com>
 <CA+oV6suCGdb=+S=Bz=yNXuHFYjCfdTdN9st6E0Z=Zhv_rxaMSQ@mail.gmail.com>
Message-Id: <D0E9F96A-353E-48AF-8F1A-09306682AC7F@gmail.com>


--Apple-Mail=_31314A9F-3CD0-4087-BFD1-02CACB379258
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

I'm not sure if it's significant, but on first notice the IP addresses =
all have the same octets in the ProperyFileSnitch, yet the EC2Snitch, =
all the octets are different.

Ergo:
PropertyFileSnitch states that are all in the same data centre [168] and =
the same rac [2].
EC2Snitch states that all nodes in 3 different data centres [20, 73, =
236].

I'm still new at this too and may not have the full answer as we are =
prepping our prod env with the PropertyFileSnitch 2DC's and 3 nodes per =
DC. Though our QA environment is configured much the same way only it's =
3 nodes in a single DC:

consistency: LOCAL_QUORUM
strategy: NetworkTopologyStrategy
strategy_options: datacenter1:3

Our distribution is 33% equally.

Just reading the docs on the datastax website I'm starting to wonder how =
the PropertyFileSnitch distributes the data across the DC's:
For NetworkTopologyStrategy, it specifies the number of replicas per =
data center in a comma separated list of =
datacenter_name:number_of_replicas.=20

I'm wondering if you need to increase your replication factor to 3 to =
see the data replicate across the DC's

Anthony


On 17/09/2011, at 8:36 PM, Chris Marino wrote:

> Anthony, We used the Ec2Snitch for one sets of runs, but for another =
set we're using PropertyFileSnitch.
>=20
> With the PropertyFileSnitch we see:
>=20
> Address         DC          Rack        Status State   Load            =
Owns    Token                                      =20
>=20
>                                                                        =
        85070591730234615865843651857942052865     =20
>=20
> 192.168.2.1     us-east     1b          Up     Normal  60.59 MB        =
50.00%  0                                          =20
>=20
> 192.168.2.6     us-west     1c          Up     Normal  26.5 MB         =
0.00%   1                                          =20
>=20
> 192.168.2.2     us-east     1b          Up     Normal  29.86 MB        =
50.00%  85070591730234615865843651857942052864     =20
>=20
> 192.168.2.7     us-west     1c          Up     Normal  60.63 MB        =
0.00%   85070591730234615865843651857942052865  =20
>=20
>=20
>=20
> While with the EC2Snitch wwe see:
> Address         DC          Rack        Status State   Load            =
Owns    Token                                      =20
>=20
>                                                                        =
        85070591730234615865843651857942052865     =20
>=20
> 107.20.68.176   us-east     1b          Up     Normal  59.95 MB        =
50.00%  0                                          =20
>=20
> 204.236.179.193 us-west     1c          Up     Normal  53.67 MB        =
0.00%   1                                          =20
>=20
> 184.73.133.171  us-east     1b          Up     Normal  60.65 MB        =
50.00%  85070591730234615865843651857942052864     =20
>=20
> 204.236.166.4   us-west     1c          Up     Normal  26.33 MB        =
0.00%   85070591730234615865843651857942052865    =20
>=20
>=20
>=20
> What also strange is that the Load on the nodes changes as well. For =
example, node 204.236.166.4 sometimes is very low (~26KB), other times =
its closer to 30MB. We see the same kind of variability in both =
clusters.
>=20
>=20
> For both clusters, we're running stress tests with the following =
options:
>=20
>=20
> --consistency-level=3DLOCAL_QUORUM --threads=3D4 =
--replication-strategy=3DNetworkTopologyStrategy =
--strategy-properties=3Dus-east:2,us-west:2 --column-size=3D128 =
--keep-going --num-keys=3D100000 -r
>=20
> Any clues to what is going on here are greatly appreciated.
>=20
> Thanks
> CM
>=20
> On Sat, Sep 17, 2011 at 12:15 PM, Ikeda Anthony =
<anthony.ikeda.dev@gmail.com> wrote:
> What snitch do you have configured? We typically see a proper spread =
of data across all our nodes equally.
>=20
> Anthony
>=20
>=20
> On 17/09/2011, at 10:06 AM, Chris Marino wrote:
>=20
>> Hi, I have a question about what to expect when running a cluster =
across datacenters with Local Quorum consistency.
>>=20
>> My simplistic assumption is that the performance of an 8 node cluster =
split across 2 data centers and running with local quorum would perform =
roughly the same as a 4 node cluster in one data center.
>>=20
>> I'm 95% certain we've set up the keyspace so that the entire range is =
in one datacenter and the client is local. I see the keyspace split =
across all the local nodes, with remote nodes owning 0%. Yet when I run =
the stress tests against this configuration with local quorum, I see =
dramatically different results from when I ran the same tests against a =
4 node cluster.  I'm still 5% unsure of this because the documentation =
on how to configure this is pretty thin.
>>=20
>> My understanding of Local Quorum was that once the data was written =
to a local quorum, the commit would complete. I also believed that this =
would eliminate any WAN latency required for replication to the other =
DC.
>>=20
>> It not just that the split cluster runs slower, its also that there =
is enormous variability in identical tests. Sometimes by a factor of 2 =
or more. It seems as though the WAN latency is not only impacting =
performance, but that it's also introducing a wide variation on overally =
performance.
>>=20
>> Should WAN latency be completely hidden with local quorum? Or are =
there second order issues involved that will impact performance??
>>=20
>> I'm running in EC2 across us-east/west regions. I already know how =
unpredictable EC2 performance can be, but what I'm seeing with here is =
far beyond normal.performance variability for EC2
>>=20
>> Is there something obvious that I'm missing that would explain why =
the results are so different??=20
>>=20
>> Here's the config when we run a 2x2 cluster:
>>=20
>> Address         DC          Rack        Status State   Load           =
 Owns    Token                                      =20
>>                                                                       =
         85070591730234615865843651857942052865     =20
>> 192.168.2.1     us-east     1b          Up     Normal  25.26 MB       =
 50.00%  0                                          =20
>> 192.168.2.6     us-west     1c          Up     Normal  12.68 MB       =
 0.00%   1                                          =20
>> 192.168.2.2     us-east     1b          Up     Normal  12.56 MB       =
 50.00%  85070591730234615865843651857942052864     =20
>> 192.168.2.7     us-west     1c          Up     Normal  25.48 MB       =
 0.00%   85070591730234615865843651857942052865     =20
>>=20
>> Thanks in advance.
>> CM
>=20
>=20


--Apple-Mail=_31314A9F-3CD0-4087-BFD1-02CACB379258
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">I'm =
not sure if it's significant, but on first notice the IP addresses all =
have the same octets in the ProperyFileSnitch, yet the EC2Snitch, all =
the octets are =
different.<div><br></div><div>Ergo:</div><div>PropertyFileSnitch states =
that are all in the same data centre [168] and the same rac =
[2].</div><div>EC2Snitch states that all nodes in 3 different data =
centres [20, 73, 236].<br><div><br></div><div>I'm still new at this too =
and may not have the full answer as we are prepping our prod env with =
the PropertyFileSnitch 2DC's and 3 nodes per DC. Though our QA =
environment is configured much the same way only it's 3 nodes in a =
single DC:</div><div><br></div><div>consistency: =
LOCAL_QUORUM</div><div>strategy: =
NetworkTopologyStrategy</div><div>strategy_options: =
datacenter1:3</div><div><br></div><div>Our distribution is 33% =
equally.</div><div><br></div><div>Just reading the docs on the datastax =
website I'm starting to wonder how the PropertyFileSnitch distributes =
the data across the DC's:</div><div><span class=3D"Apple-style-span" =
style=3D"color: rgb(102, 102, 102); font-family: Arial, Helvetica, =
sans-serif; font-size: 13px; line-height: 20px; background-color: =
rgb(255, 255, 255); ">For</span><span class=3D"Apple-style-span" =
style=3D"color: rgb(102, 102, 102); font-family: Arial, Helvetica, =
sans-serif; font-size: 13px; line-height: 20px; background-color: =
rgb(255, 255, 255); ">&nbsp;</span><span class=3D"Apple-style-span" =
style=3D"color: rgb(102, 102, 102); font-family: Arial, Helvetica, =
sans-serif; font-size: 13px; line-height: 20px; background-color: =
rgb(255, 255, 255); "><tt class=3D"docutils literal"><span =
class=3D"pre">NetworkTopologyStrategy</span></tt></span><span =
class=3D"Apple-style-span" style=3D"color: rgb(102, 102, 102); =
font-family: Arial, Helvetica, sans-serif; font-size: 13px; line-height: =
20px; background-color: rgb(255, 255, 255); ">, it specifies the =
<b><u>number of replicas per data center</u></b> in a comma separated =
list of</span><span class=3D"Apple-style-span" style=3D"color: rgb(102, =
102, 102); font-family: Arial, Helvetica, sans-serif; font-size: 13px; =
line-height: 20px; background-color: rgb(255, 255, 255); =
">&nbsp;</span><span class=3D"Apple-style-span" style=3D"color: rgb(102, =
102, 102); font-family: Arial, Helvetica, sans-serif; font-size: 13px; =
line-height: 20px; background-color: rgb(255, 255, 255); =
"><em>datacenter_name</em></span><span class=3D"Apple-style-span" =
style=3D"color: rgb(102, 102, 102); font-family: Arial, Helvetica, =
sans-serif; font-size: 13px; line-height: 20px; background-color: =
rgb(255, 255, 255); ">:</span><span class=3D"Apple-style-span" =
style=3D"color: rgb(102, 102, 102); font-family: Arial, Helvetica, =
sans-serif; font-size: 13px; line-height: 20px; background-color: =
rgb(255, 255, 255); "><em>number_of_replicas</em></span><span =
class=3D"Apple-style-span" style=3D"color: rgb(102, 102, 102); =
font-family: Arial, Helvetica, sans-serif; font-size: 13px; line-height: =
20px; background-color: rgb(255, 255, 255); ">.</span><span =
class=3D"Apple-style-span" style=3D"color: rgb(102, 102, 102); =
font-family: Arial, Helvetica, sans-serif; font-size: 13px; line-height: =
20px; background-color: rgb(255, 255, 255); =
">&nbsp;</span></div><div><br></div><div>I'm wondering if you need to =
increase your replication factor to 3 to see the data replicate across =
the =
DC's</div><div><br></div><div>Anthony</div><div><br></div><div><br><div><d=
iv>On 17/09/2011, at 8:36 PM, Chris Marino wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">Anthony, =
We used the Ec2Snitch for one sets of runs, but for another set we're =
using PropertyFileSnitch.<div><br></div><div>With the PropertyFileSnitch =
we see:</div><div><br></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: helvetica, arial, freesans, clean, sans-serif; =
font-size: 11px; line-height: 14px; background-color: rgb(255, 255, =
255); "><pre style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: =
0px; margin-left: 0px; padding-top: 0px; padding-right: 0px; =
padding-bottom: 0px; padding-left: 0px; font: normal normal normal =
12px/normal 'Bitstream Vera Sans Mono', Courier, monospace; font-family: =
'Bitstream Vera Sans Mono', 'Courier New', monospace; font-size: 12px; =
line-height: 1.4; "><div class=3D"line" id=3D"LC168" style=3D"margin-top: =
0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; =
padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: =
1em; ">Address         DC          Rack        Status State   Load       =
     Owns    Token                                       </div>
<div class=3D"line" id=3D"LC169" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;85070591730234615865843651857942052865   =
   </div>
<div class=3D"line" id=3D"LC170" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">192.168.2.1     us-east     1b          Up     Normal  60.59 MB        =
50.00%  0                                           </div>
<div class=3D"line" id=3D"LC171" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">192.168.2.6     us-west     1c          Up     Normal  26.5 MB         =
0.00%   1                                           </div>
<div class=3D"line" id=3D"LC172" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">192.168.2.2     us-east     1b          Up     Normal  29.86 MB        =
50.00%  85070591730234615865843651857942052864      </div>
<div class=3D"line" id=3D"LC173" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">192.168.2.7     us-west     1c          Up     Normal  60.63 MB        =
0.00%   85070591730234615865843651857942052865   </div>
<div class=3D"line" id=3D"LC173" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
"><br></div><div class=3D"line" id=3D"LC173" style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: =
0px; padding-right: 0px; padding-bottom: 0px; padding-left: 1em; ">
While with the EC2Snitch wwe see:</div></pre></span><span =
class=3D"Apple-style-span" style=3D"font-family: helvetica, arial, =
freesans, clean, sans-serif; font-size: 11px; line-height: 14px; =
background-color: rgb(255, 255, 255); "><pre style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: =
0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font: =
normal normal normal 12px/normal 'Bitstream Vera Sans Mono', Courier, =
monospace; font-family: 'Bitstream Vera Sans Mono', 'Courier New', =
monospace; font-size: 12px; line-height: 1.4; "><div class=3D"line" =
id=3D"LC198" style=3D"margin-top: 0px; margin-right: 0px; margin-bottom: =
0px; margin-left: 0px; padding-top: 0px; padding-right: 0px; =
padding-bottom: 0px; padding-left: 1em; ">Address         DC          =
Rack        Status State   Load            Owns    Token                 =
                      </div>
<div class=3D"line" id=3D"LC199" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;85070591730234615865843651857942052865   =
   </div>
<div class=3D"line" id=3D"LC200" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">107.20.68.176   us-east     1b          Up     Normal  59.95 MB        =
50.00%  0                                           </div>
<div class=3D"line" id=3D"LC201" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">204.236.179.193 us-west     1c          Up     Normal  53.67 MB        =
0.00%   1                                           </div>
<div class=3D"line" id=3D"LC202" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">184.73.133.171  us-east     1b          Up     Normal  60.65 MB        =
50.00%  85070591730234615865843651857942052864      </div>
<div class=3D"line" id=3D"LC203" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
">204.236.166.4   us-west     1c          Up     Normal  26.33 MB        =
0.00%   85070591730234615865843651857942052865     </div>
<div class=3D"line" id=3D"LC203" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
"><br></div><div class=3D"line" id=3D"LC203" style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: =
0px; padding-right: 0px; padding-bottom: 0px; padding-left: 1em; ">
What also strange is that the Load on the nodes changes as well. For =
example, node 204.236.166.4 sometimes is very low (~26KB), other times =
its closer to 30MB. We see the same kind of variability in both =
clusters.</div><div class=3D"line" id=3D"LC203" style=3D"margin-top: =
0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; =
padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: =
1em; ">
<br></div><div class=3D"line" id=3D"LC203" style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0px; padding-top: =
0px; padding-right: 0px; padding-bottom: 0px; padding-left: 1em; ">For =
both clusters, we're running stress tests with the following =
options:</div>
<div class=3D"line" id=3D"LC203" style=3D"margin-top: 0px; margin-right: =
0px; margin-bottom: 0px; margin-left: 0px; padding-top: 0px; =
padding-right: 0px; padding-bottom: 0px; padding-left: 1em; =
"><br></div></pre></span><span class=3D"Apple-style-span" =
style=3D"font-family: 'Bitstream Vera Sans Mono', 'Courier New', =
monospace; font-size: 12px; line-height: 16px; white-space: pre; =
background-color: rgb(255, 255, 255); ">--consistency-level=3DLOCAL_QUORUM=
 --threads=3D4 --replication-strategy=3DNetworkTopologyStrategy =
--strategy-properties=3Dus-east:2,us-west:2 --column-size=3D128 =
--keep-going --num-keys=3D100000 -r</span></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: =
pre;"><br></span></font></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: pre;">Any =
clues to what is going on here are greatly =
appreciated.</span></font></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: pre; =
"><br></span></font></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: =
pre;">Thanks</span></font></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: =
pre;">CM</span></font></div>
<div><font class=3D"Apple-style-span" face=3D"'Bitstream Vera Sans =
Mono', 'Courier New', monospace"><span class=3D"Apple-style-span" =
style=3D"font-size: 12px; line-height: 16px; white-space: =
pre;"><br></span></font><div class=3D"gmail_quote">
On Sat, Sep 17, 2011 at 12:15 PM, Ikeda Anthony <span dir=3D"ltr">&lt;<a =
href=3D"mailto:anthony.ikeda.dev@gmail.com">anthony.ikeda.dev@gmail.com</a=
>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div style=3D"word-wrap:break-word">What snitch do you have configured? =
We typically see a proper spread of data across all our nodes =
equally.<div><br></div><font =
color=3D"#888888"><div>Anthony</div></font><div><div></div><div =
class=3D"h5">
<div><br></div><div><br><div><div>On 17/09/2011, at 10:06 AM, Chris =
Marino wrote:</div><br><blockquote type=3D"cite">Hi, I have a question =
about what to expect when running a cluster across datacenters with =
Local Quorum consistency.<div>
<br></div><div>My simplistic assumption is that the performance of an 8 =
node cluster split across 2 data centers and running with local quorum =
would perform roughly the same as a 4 node cluster in one data =
center.</div>
<div><br></div><div>I'm 95% certain we've set up the keyspace so that =
the&nbsp;entire&nbsp;range is in one datacenter and the client is local. =
I see the keyspace split across all the local nodes, with remote nodes =
owning 0%. Yet when I run the stress tests against this configuration =
with local quorum, I see dramatically different results from when I ran =
the same tests against a 4 node cluster. &nbsp;I'm still 5% unsure of =
this because the documentation on how to configure this is pretty =
thin.</div>

<div><br></div><div>My understanding of Local Quorum was that once the =
data was written to a local quorum, the commit would complete. I =
also&nbsp;believed&nbsp;that this would eliminate any WAN latency =
required for replication to the other DC.</div>

<div><br></div><div>It not just that the split cluster runs slower, its =
also that there is enormous variability in&nbsp;identical&nbsp;tests. =
Sometimes by a factor of 2 or more. It seems as though the WAN latency =
is not only impacting performance, but that it's also introducing a wide =
variation on overally&nbsp;performance.</div>

<div><br></div><div>Should WAN latency be&nbsp;completely&nbsp;hidden =
with local quorum? Or are there second order issues involved that will =
impact performance??</div><div><br></div><div>I'm running in EC2 across =
us-east/west&nbsp;regions. I already know how unpredictable =
EC2&nbsp;performance&nbsp;can be, but what I'm seeing with here is far =
beyond normal.performance variability for EC2</div>

<div><br></div><div>Is there something obvious that I'm missing that =
would explain why the results are so =
different??&nbsp;</div><div><br></div><div>Here's the config when we run =
a 2x2 cluster:</div><div><br></div><div>

<span style=3D"font-family:'Times New Roman';font-size:medium"><pre =
style=3D"word-wrap:break-word;white-space:pre-wrap">Address         DC   =
       Rack        Status State   Load            Owns    Token          =
                            =20
                                                                         =
      85070591730234615865843651857942052865     =20
192.168.2.1     us-east     1b          Up     Normal  25.26 MB        =
50.00%  0                                          =20
192.168.2.6     us-west     1c          Up     Normal  12.68 MB        =
0.00%   1                                          =20
192.168.2.2     us-east     1b          Up     Normal  12.56 MB        =
50.00%  85070591730234615865843651857942052864     =20
192.168.2.7     us-west     1c          Up     Normal  25.48 MB        =
0.00%   85070591730234615865843651857942052865      =
</pre></span></div><div><br></div><div>Thanks in =
advance.</div><div>CM</div>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
</blockquote></div><br></div></div></body></html>=

--Apple-Mail=_31314A9F-3CD0-4087-BFD1-02CACB379258--