Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of arodrime@gmail.com designates
 209.85.217.180 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAKkz8Q3HALLGAhS8qs5TLQHtva0coqEDXHrtxKPo=8D46Nx3Ew@mail.gmail.com>
References: 
 <CAGiE6h-u2g9V=dbQvCGVipRSsC6REwago=vPzScdEjRaLsJ=2w@mail.gmail.com>
 <CAP7WDFUFqQLWpc=m6hVy5OtMAp5PgmTQ0qiQwJ8S8b+zjFT-RA@mail.gmail.com>
 <CAGiE6h8tyefdqCRJjyLyPNKEk0J+64F=TogNo6MSdg3VHvHd8w@mail.gmail.com>
 <CA+VSrLoLAh0jX74dsCgBo1=OeZSegQ_BaX9c3qcZOPGOfqbxrA@mail.gmail.com>
 <CAKkz8Q1ZKX1UkCn6bSwhSagegynVPU9oxB5YNemeH-=thEvovA@mail.gmail.com>
 <CA+VSrLqUP_gPZdSqYMX-ZgnPZ7osCYKXZZBkDHSJuYs8Aac9ow@mail.gmail.com>
 <CAKkz8Q3HALLGAhS8qs5TLQHtva0coqEDXHrtxKPo=8D46Nx3Ew@mail.gmail.com>
From: Alain RODRIGUEZ <arodrime@gmail.com>
Date: Wed, 12 Jun 2013 16:53:51 +0200
Message-ID: 
 <CA+VSrLqBnFEqmd2F-Y9DdkR48eYL68pNfwpOEZ0KkBMqzPitog@mail.gmail.com>
Subject: Re: Multiple data center performance
To: user@cassandra.apache.org
Cc: comomore@gmail.com
Content-Type: multipart/alternative; boundary=089e0158c3d4c162d804def62f08

--089e0158c3d4c162d804def62f08
Content-Type: text/plain; charset=ISO-8859-1

Crystal clear, we use a lot of counters and I am always happy to learn this
kind of things.

Thanks a lot.

Alain


2013/6/12 Sylvain Lebresne <sylvain@datastax.com>

>
> Is there something special of this kind regarding counters over multiDC ?
>>
>
> No. Counters behave exactly as other writes as far the consistency level
> is concerned.
> Technically, the counter write path is different from the normal write
> path in the sense that a counter write
> will be written to one replica first and then written to the rest of the
> replicas in a second time (with a local
> read on the first replica in between, which is why counter writes are
> slower than normal ones). But,
> outside of the obvious performance impact, this has no impact on the
> behavior observed from a
> client point of view. The consistency level has the exact same meaning in
> particular (though one
> small difference is that counters don't support CL.ANY).
>
> --
> Sylvain
>
>
>>
>> Thank you anyway Sylvain
>>
>>
>> 2013/6/12 Sylvain Lebresne <sylvain@datastax.com>
>>
>>> It is the normal behavior, but that's true of any update, not only of
>>> counters.
>>>
>>> The consistency level does *not* influence which replica are written to.
>>> Cassandra always write to all replicas. The consistency level only decides
>>> how replica acknowledgement are waited for.
>>>
>>> --
>>> Sylvain
>>>
>>>
>>> On Wed, Jun 12, 2013 at 4:56 AM, Alain RODRIGUEZ <arodrime@gmail.com>wrote:
>>>
>>>> "counter will replicate to all replicas during write regardless the
>>>> consistency level"
>>>>
>>>> I that the normal behavior or a bug ?
>>>>
>>>>
>>>> 2013/6/11 Daning Wang <daning@netseer.com>
>>>>
>>>>> It is counter caused the problem. counter will replicate to all
>>>>> replicas during write regardless the consistency level.
>>>>>
>>>>> In our case. we don't need to sync the counter across the center. so
>>>>> moving counter to new keyspace and all the replica in one
>>>>> center solved problem.
>>>>>
>>>>> There is option replicate_on_write on table. If you turn that off for
>>>>> counter might have better performance. but you are on high risk to lose
>>>>> data and create inconsistency. I did not try this option.
>>>>>
>>>>> Daning
>>>>>
>>>>>
>>>>> On Sat, Jun 8, 2013 at 6:53 AM, srmore <comomore@gmail.com> wrote:
>>>>>
>>>>>> I am seeing the similar behavior, in my case I have 2 nodes in each
>>>>>> datacenter and one node always has high latency (equal to the latency
>>>>>> between the two datacenters). When one of the datacenters is shutdown the
>>>>>> latency drops.
>>>>>>
>>>>>> I am curious to know whether anyone else has these issues and if yes
>>>>>> how did to get around it.
>>>>>>
>>>>>> Thanks !
>>>>>>
>>>>>>
>>>>>> On Fri, Jun 7, 2013 at 11:49 PM, Daning Wang <daning@netseer.com>wrote:
>>>>>>
>>>>>>> We have deployed multi-center but got performance issue. When the
>>>>>>> nodes on other center are up, the read response time from clients is 4 or 5
>>>>>>> times higher. when we take those nodes down, the response time becomes
>>>>>>> normal(compare to the time before we changed to multi-center).
>>>>>>>
>>>>>>> We have high volume on the cluster, the consistency level is one for
>>>>>>> read. so my understanding is most of traffic between data center should be
>>>>>>> read repair. but seems that could not create much delay.
>>>>>>>
>>>>>>> What could cause the problem? how to debug this?
>>>>>>>
>>>>>>> Here is the keyspace,
>>>>>>>
>>>>>>> [default@dsat] describe dsat;
>>>>>>> Keyspace: dsat:
>>>>>>>   Replication Strategy:
>>>>>>> org.apache.cassandra.locator.NetworkTopologyStrategy
>>>>>>>   Durable Writes: true
>>>>>>>     Options: [dc2:1, dc1:3]
>>>>>>>   Column Families:
>>>>>>>     ColumnFamily: categorization_cache
>>>>>>>
>>>>>>>
>>>>>>> Ring
>>>>>>>
>>>>>>> Datacenter: dc1
>>>>>>> ===============
>>>>>>> Status=Up/Down
>>>>>>> |/ State=Normal/Leaving/Joining/Moving
>>>>>>> --  Address           Load       Tokens  Owns (effective)  Host ID
>>>>>>>                             Rack
>>>>>>> UN  xx.xx.xx..111       59.2 GB    256     37.5%
>>>>>>> 4d6ed8d6-870d-4963-8844-08268607757e  rac1
>>>>>>> DN  xx.xx.xx..121       99.63 GB   256     37.5%
>>>>>>> 9d0d56ce-baf6-4440-a233-ad6f1d564602  rac1
>>>>>>> UN  xx.xx.xx..120       66.32 GB   256     37.5%
>>>>>>> 0fd912fb-3187-462b-8c8a-7d223751b649  rac1
>>>>>>> UN  xx.xx.xx..118       63.61 GB   256     37.5%
>>>>>>> 3c6e6862-ab14-4a8c-9593-49631645349d  rac1
>>>>>>> UN  xx.xx.xx..117       68.16 GB   256     37.5%
>>>>>>> ee6cdf23-d5e4-4998-a2db-f6c0ce41035a  rac1
>>>>>>> UN  xx.xx.xx..116       32.41 GB   256     37.5%
>>>>>>> f783eeef-1c51-4f91-ab7c-a60669816770  rac1
>>>>>>> UN  xx.xx.xx..115       64.24 GB   256     37.5%
>>>>>>> e75105fb-b330-4f40-aa4f-8e6e11838e37  rac1
>>>>>>> UN  xx.xx.xx..112       61.32 GB   256     37.5%
>>>>>>> 2547ee54-88dd-4994-a1ad-d9ba367ed11f  rac1
>>>>>>> Datacenter: dc2
>>>>>>> ===============
>>>>>>> Status=Up/Down
>>>>>>> |/ State=Normal/Leaving/Joining/Moving
>>>>>>> --  Address           Load       Tokens  Owns (effective)  Host ID
>>>>>>>                             Rack
>>>>>>> DN  xx.xx.xx.199    58.39 GB   256     50.0%
>>>>>>> 6954754a-e9df-4b3c-aca7-146b938515d8  rac1
>>>>>>> DN  xx.xx.xx..61      33.79 GB   256     50.0%
>>>>>>> 91b8d510-966a-4f2d-a666-d7edbe986a1c  rac1
>>>>>>>
>>>>>>>
>>>>>>> Thank you in advance,
>>>>>>>
>>>>>>> Daning
>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

--089e0158c3d4c162d804def62f08
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Crystal clear, we use a lot of counters and I am always ha=
ppy to learn this kind of things.<div><br></div><div style>Thanks a lot.</d=
iv><div style><br></div><div style>Alain</div></div><div class=3D"gmail_ext=
ra">

<br><br><div class=3D"gmail_quote">2013/6/12 Sylvain Lebresne <span dir=3D"=
ltr">&lt;<a href=3D"mailto:sylvain@datastax.com" target=3D"_blank">sylvain@=
datastax.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D"m=
argin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><div class=3D"gmail_quote">=
<div class=3D"im"><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Is t=
here something special of this kind regarding counters over multiDC ?</div>


</div></blockquote><div><br></div></div><div>No. Counters behave exactly as=
 other writes as far the consistency level is concerned.</div><div>Technica=
lly, the counter write path is different from the normal write path in the =
sense that a counter write</div>


<div>will be written to one replica first and then written to the rest of t=
he replicas in a second time (with a local</div><div>read on the first repl=
ica in between, which is why counter writes are slower than normal ones). B=
ut,</div>


<div>outside of the obvious performance impact, this has no impact on the b=
ehavior observed from a</div><div>client point of view. The consistency lev=
el has the exact same meaning in particular (though one</div>
<div>small difference is that counters don&#39;t support CL.ANY).</div><div=
><br></div><div>--</div><div>Sylvain</div><div><div class=3D"h5"><div>=A0</=
div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-lef=
t:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr"><div><br></div><div>Thank you anyway Sylvain</div></div><d=
iv><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">
2013/6/12 Sylvain Lebresne <span dir=3D"ltr">&lt;<a href=3D"mailto:sylvain@=
datastax.com" target=3D"_blank">sylvain@datastax.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">It is the normal behavior, =
but that&#39;s true of any update, not only of counters.<div><br></div><div=
>


The consistency level does *not* influence which replica are written to. Ca=
ssandra always write to all replicas. The consistency level only decides ho=
w replica acknowledgement are waited for.</div>

<div><br></div><div>--</div><div>Sylvain</div></div><div><div><div class=3D=
"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Jun 12, 2013 at 4:=
56 AM, Alain RODRIGUEZ <span dir=3D"ltr">&lt;<a href=3D"mailto:arodrime@gma=
il.com" target=3D"_blank">arodrime@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>&quot;<span style=3D"f=
ont-family:arial,sans-serif;font-size:13px">counter will replicate to all r=
eplicas during write regardless the consistency level&quot;</span><div>


<span style=3D"font-family:arial,sans-serif;font-size:13px"><br>

</span></div></div><div><span style=3D"font-family:arial,sans-serif;font-si=
ze:13px">I that the normal behavior or a bug ?</span></div></div><div><div>=
<div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">
2013/6/11 Daning Wang <span dir=3D"ltr">&lt;<a href=3D"mailto:daning@netsee=
r.com" target=3D"_blank">daning@netseer.com</a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">It is counter caused the pr=
oblem. counter will replicate to all replicas during write regardless the c=
onsistency level.=A0<div>


<br></div><div>In our case. we don&#39;t need to sync the counter across th=
e center. so moving counter to new keyspace and all the replica in one cent=
er=A0solved=A0problem.</div>
<div><br></div><div>There is option=A0replicate_on_write on table. If you t=
urn that off for counter might have better performance. but you are on high=
 risk to lose data and create inconsistency. I did not try this option.</di=
v>


<span><font color=3D"#888888">
<div><br></div><div>Daning</div></font></span></div><div><div><div class=3D=
"gmail_extra"><br><br><div class=3D"gmail_quote">On Sat, Jun 8, 2013 at 6:5=
3 AM, srmore <span dir=3D"ltr">&lt;<a href=3D"mailto:comomore@gmail.com" ta=
rget=3D"_blank">comomore@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div>I am seeing the s=
imilar behavior, in my case I have 2 nodes in each datacenter and one node =
always has high latency (equal to the latency between the two datacenters).=
 When one of the datacenters is shutdown the latency drops.<br>


<br></div>I am curious to know whether anyone else has these issues and if =
yes how did to get around it.<br><br></div>Thanks !<br></div><div><div><div=
 class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">
On Fri, Jun 7, 2013 at 11:49 PM, Daning Wang <span dir=3D"ltr">&lt;<a href=
=3D"mailto:daning@netseer.com" target=3D"_blank">daning@netseer.com</a>&gt;=
</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">We have deployed multi-cent=
er but got performance issue. When the nodes on other center are up, the re=
ad response time from clients is 4 or 5 times higher. when we take those no=
des down, the response time becomes normal(compare to the time before we ch=
anged to multi-center).<div>


<br></div><div>We have high volume on the cluster, the consistency level is=
 one for read. so my understanding is most of traffic between data center s=
hould be read repair. but seems that could not create much delay.</div>


<div><br></div><div>What could cause the problem? how to debug this?</div><=
div><br></div><div>Here is the keyspace,</div><div><br></div><div><div>[def=
ault@dsat] describe dsat;</div><div>
Keyspace: dsat:</div><div>=A0 Replication Strategy: org.apache.cassandra.lo=
cator.NetworkTopologyStrategy</div><div>=A0 Durable Writes: true</div><div>=
=A0 =A0 Options: [dc2:1, dc1:3]</div><div>=A0 Column Families:</div><div>=
=A0 =A0 ColumnFamily: categorization_cache</div>


<div>=A0</div><div><br></div><div>Ring</div><div><br></div><div><div>Datace=
nter: dc1<br></div><div>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</div>=
<div>Status=3DUp/Down</div><div>|/ State=3DNormal/Leaving/Joining/Moving</d=
iv><div>-- =A0Address =A0 =A0 =A0 =A0 =A0 Load =A0 =A0 =A0 Tokens =A0Owns (=
effective) =A0Host ID =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 Rack</div>


<div>UN =A0xx.xx.xx..111 =A0 =A0 =A0 59.2 GB =A0 =A0256 =A0 =A0 37.5% =A0 =
=A0 =A0 =A0 =A0 =A0 4d6ed8d6-870d-4963-8844-08268607757e =A0rac1</div><div>=
DN =A0xx.xx.xx..121 =A0 =A0 =A0 99.63 GB =A0 256 =A0 =A0 37.5% =A0 =A0 =A0 =
=A0 =A0 =A0 9d0d56ce-baf6-4440-a233-ad6f1d564602 =A0rac1</div>


<div>UN =A0xx.xx.xx..120 =A0 =A0 =A0 66.32 GB =A0 256 =A0 =A0 37.5% =A0 =A0=
 =A0 =A0 =A0 =A0 0fd912fb-3187-462b-8c8a-7d223751b649 =A0rac1</div><div>UN =
=A0xx.xx.xx..118 =A0 =A0 =A0 63.61 GB =A0 256 =A0 =A0 37.5% =A0 =A0 =A0 =A0=
 =A0 =A0 3c6e6862-ab14-4a8c-9593-49631645349d =A0rac1</div>


<div>UN =A0xx.xx.xx..117 =A0 =A0 =A0 68.16 GB =A0 256 =A0 =A0 37.5% =A0 =A0=
 =A0 =A0 =A0 =A0 ee6cdf23-d5e4-4998-a2db-f6c0ce41035a =A0rac1</div><div>UN =
=A0xx.xx.xx..116 =A0 =A0 =A0 32.41 GB =A0 256 =A0 =A0 37.5% =A0 =A0 =A0 =A0=
 =A0 =A0 f783eeef-1c51-4f91-ab7c-a60669816770 =A0rac1</div>


<div>UN =A0xx.xx.xx..115 =A0 =A0 =A0 64.24 GB =A0 256 =A0 =A0 37.5% =A0 =A0=
 =A0 =A0 =A0 =A0 e75105fb-b330-4f40-aa4f-8e6e11838e37 =A0rac1</div><div>UN =
=A0xx.xx.xx..112 =A0 =A0 =A0 61.32 GB =A0 256 =A0 =A0 37.5% =A0 =A0 =A0 =A0=
 =A0 =A0 2547ee54-88dd-4994-a1ad-d9ba367ed11f =A0rac1</div>


<div>Datacenter: dc2</div><div>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D</div><div>Status=3DUp/Down</div><div>|/ State=3DNormal/Leaving/Joining/=
Moving</div><div>-- =A0Address =A0 =A0 =A0 =A0 =A0 Load =A0 =A0 =A0 Tokens =
=A0Owns (effective) =A0Host ID =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 Rack</div>


<div>DN =A0xx.xx.xx.199 =A0 =A058.39 GB =A0 256 =A0 =A0 50.0% =A0 =A0 =A0 =
=A0 =A0 =A0 6954754a-e9df-4b3c-aca7-146b938515d8 =A0rac1</div><div>DN =A0xx=
.xx.xx..61 =A0 =A0 =A033.79 GB =A0 256 =A0 =A0 50.0% =A0 =A0 =A0 =A0 =A0 =
=A0 91b8d510-966a-4f2d-a666-d7edbe986a1c =A0rac1</div>


</div><div><br></div><div><br></div><div>Thank you in advance,</div><div><b=
r></div><div>Daning</div></div><div><br></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div></div></div><br></div></div>
</blockquote></div><br></div>

--089e0158c3d4c162d804def62f08--