Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of ben@instaclustr.com designates
 209.85.220.50 as permitted sender)
From: Ben Bromhead <ben@instaclustr.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_DC216247-5CE2-4461-861D-9AE665131EB2"
Message-Id: <90EDDC17-ABD2-4D2F-9D12-4EE553792C68@instaclustr.com>
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Subject: Re: in AWS is it worth trying to talk to a server in the same zone as
 your client?
Date: Thu, 13 Feb 2014 08:14:56 +1100
References: 
 <CAF9x2_duYmfwK8g9Ng0RMyTD-c-Ssm499dUrg573G6VjbAvkwg@mail.gmail.com>
 <CAK0tFt4J1TSXneVoZKWiLheH625FdWf3OB14Xu0bJwx2VMCWBg@mail.gmail.com>
 <CAOZF2Bd-UtqPMTt743e71EB1zqDZw443xXdHEcKh=icbiWmj2A@mail.gmail.com>
 <etPan.52fbd3e1.6b8b4567.bbd1@Russells-iMac.local>
 <CAAjbL_mJiHJxVnLbLhkS0SoXQ+yE1e6t+yeNX3Fj1T10yQ-1cg@mail.gmail.com>
 <etPan.52fbd6e0.643c9869.bbd1@Russells-iMac.local>
 <CAK0tFt5kQRSRLRS=QEYbN-hR7vYFpXEqBENXb18a=4mJ9A-q=Q@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAK0tFt5kQRSRLRS=QEYbN-hR7vYFpXEqBENXb18a=4mJ9A-q=Q@mail.gmail.com>


--Apple-Mail=_DC216247-5CE2-4461-861D-9AE665131EB2
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

0.01/G between zones irrespective of IP is correct.

As for your original question, depending on the driver you are using you =
could write a custom co-ordinator node selection policy.

For example if you are using the Datastax driver you would extend =
http://www.datastax.com/drivers/java/2.0/apidocs/com/datastax/driver/core/=
policies/LoadBalancingPolicy.html

=85 and set the distance based on which zone the node is in.

An alternate method would be to define the zones as data centres and =
then you could leverage existing DC aware policies (We've never tried =
this though).=20


Ben Bromhead
Instaclustr | www.instaclustr.com | @instaclustr | +61 415 936 359


On 13/02/2014, at 8:00 AM, Andrey Ilinykh <ailinykh@gmail.com> wrote:

> I think you are mistaken. It is true for the same zone. between zones =
0.01/G
>=20
>=20
> On Wed, Feb 12, 2014 at 12:17 PM, Russell Bradberry =
<rbradberry@gmail.com> wrote:
> Not when using private IP addresses.  That pricing ONLY applies if you =
are using the public interface or EIP/ENI.  If you use the private IP =
addresses there is no cost associated.
>=20
>=20
>=20
> On February 12, 2014 at 3:13:58 PM, William Oberman =
(oberman@civicscience.com) wrote:
>=20
>> Same region, cross zone transfer is $0.01 / GB (see =
http://aws.amazon.com/ec2/pricing/, Data Transfer section).
>>=20
>>=20
>> On Wed, Feb 12, 2014 at 3:04 PM, Russell Bradberry =
<rbradberry@gmail.com> wrote:
>> Cross zone data transfer does not cost any extra money.=20
>>=20
>> LOCAL_QUORUM =3D QUORUM if all 6 servers are located in the same =
logical datacenter. =20
>>=20
>> Ensure your clients are connecting to either the local IP or the AWS =
hostname that is a CNAME to the local ip from within AWS.  If you =
connect to the public IP you will get charged for outbound data =
transfer.
>>=20
>>=20
>>=20
>> On February 12, 2014 at 2:58:07 PM, Yogi Nerella =
(ynerella999@gmail.com) wrote:
>>=20
>>> Also, may be you need to check the read consistency to local_quorum, =
otherwise the servers still try to read the data from all other data =
centers.
>>>=20
>>> I can understand the latency, but I cant understand how it would =
save money?   The amount of data transferred from the AWS server to the =
client should be same no matter where the client is connected?
>>>   =20
>>>=20
>>>=20
>>> On Wed, Feb 12, 2014 at 10:33 AM, Andrey Ilinykh =
<ailinykh@gmail.com> wrote:
>>> yes, sure. Taking data from the same zone will reduce latency and =
save you some money.
>>>=20
>>>=20
>>> On Wed, Feb 12, 2014 at 10:13 AM, Brian Tarbox =
<tarbox@cabotresearch.com> wrote:
>>> We're running a C* cluster with 6 servers spread across the four =
us-east1 zones.
>>>=20
>>> We also spread our clients (hundreds of them) across the four zones.
>>>=20
>>> Currently we give our clients a connection string listing all six =
servers and let C* do its thing.
>>>=20
>>> This is all working just fine...and we're paying a fair bit in AWS =
transfer costs.  There is a suspicion that this transfer cost is driven =
by us passing data around between our C* servers and clients.
>>>=20
>>> Would there be any value to trying to get a client to talk to one of =
the C* servers in its own zone?
>>>=20
>>> I understand (at least partially!) about coordinator nodes and =
replication and know that no matter which server is the coordinator for =
an operation replication may cause bits to get transferred to/from =
servers in other zones.  Having said that...is there a chance that =
trying to encourage a client to initially contact a server in its own =
zone would help?
>>>=20
>>> Thank you,
>>>=20
>>> Brian Tarbox
>>>=20
>>>=20
>>>=20
>>=20
>>=20
>>=20
>>=20
>=20


--Apple-Mail=_DC216247-5CE2-4461-861D-9AE665131EB2
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
">0.01/G between zones irrespective of IP is =
correct.<div><br></div><div>As for your original question, depending on =
the driver you are using you could write a custom co-ordinator node =
selection policy.</div><div><br></div><div>For example if you are using =
the Datastax driver you would extend&nbsp;<a =
href=3D"http://www.datastax.com/drivers/java/2.0/apidocs/com/datastax/driv=
er/core/policies/LoadBalancingPolicy.html">http://www.datastax.com/drivers=
/java/2.0/apidocs/com/datastax/driver/core/policies/LoadBalancingPolicy.ht=
ml</a></div><div><br></div><div>=85 and set the distance based on which =
zone the node is in.</div><div><br></div><div>An alternate method would =
be to define the zones as data centres and then you could leverage =
existing DC aware policies (We've never tried this =
though).&nbsp;</div><div><br></div><div><br><div>
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica;  font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-align: -webkit-auto; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div><div><div>Ben Bromhead</div><div></div></div><div>Instaclustr =
|&nbsp;<a =
href=3D"https://www.instaclustr.com/">www.instaclustr.com</a>&nbsp;|&nbsp;=
<a href=3D"http://twitter.com/instaclustr">@instaclustr</a>&nbsp;| +61 =
415 936 359</div></div><div><br></div></div><br =
class=3D"Apple-interchange-newline"><br =
class=3D"Apple-interchange-newline">
</div>
<br><div><div>On 13/02/2014, at 8:00 AM, Andrey Ilinykh &lt;<a =
href=3D"mailto:ailinykh@gmail.com">ailinykh@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div dir=3D"ltr">I think you are mistaken. It is true for =
the same zone. between zones 0.01/G</div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Feb 12, =
2014 at 12:17 PM, Russell Bradberry <span dir=3D"ltr">&lt;<a =
href=3D"mailto:rbradberry@gmail.com" =
target=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word"><div style=3D"font-family: Helvetica, =
Arial; font-size: 13px; margin: 0px; ">
Not when using private IP addresses. &nbsp;That pricing <i =
style=3D"font-weight:bold">ONLY </i>applies if you are using the public =
interface or EIP/ENI. &nbsp;If you use the private IP addresses there is =
no cost associated.</div><div>
<div class=3D"h5"> <div><br><br><span =
style=3D"font-family:helvetica,arial;font-size:13px"></span><span></span><=
/div> <br><p style=3D"color:#a0a0a8">On February 12, 2014 at 3:13:58 PM, =
William Oberman (<a href=3D"mailto://oberman@civicscience.com" =
target=3D"_blank">oberman@civicscience.com</a>) wrote:</p>
 <blockquote type=3D"cite"><span>


<div dir=3D"ltr">Same region, cross zone transfer is&nbsp;$0.01 / GB
(see&nbsp;<a href=3D"http://aws.amazon.com/ec2/pricing/" =
target=3D"_blank">http://aws.amazon.com/ec2/pricing/</a>, Data Transfer
section).
<div class=3D"gmail_extra"><br>
<br>
<div class=3D"gmail_quote">On Wed, Feb 12, 2014 at 3:04 PM, Russell
Bradberry <span dir=3D"ltr">&lt;<a href=3D"mailto:rbradberry@gmail.com" =
target=3D"_blank">rbradberry@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">
<div style=3D"font-family: Helvetica, Arial; font-size: 13px; margin: =
0px; ">
Cross zone data transfer does not cost any extra money.&nbsp;</div>
<div style=3D"font-family: Helvetica, Arial; font-size: 13px; margin: =
0px; ">
<br></div>
<div style=3D"font-family: Helvetica, Arial; font-size: 13px; margin: =
0px; ">
LOCAL_QUORUM =3D QUORUM if all 6 servers are located in the same
logical datacenter. &nbsp;</div>
<div style=3D"font-family: Helvetica, Arial; font-size: 13px; margin: =
0px; ">
<br></div>
<div style=3D"font-family: Helvetica, Arial; font-size: 13px; margin: =
0px; ">
Ensure your clients are connecting to either the local IP or the
AWS hostname that is a CNAME to the local ip from within AWS.
&nbsp;If you connect to the public IP you will get charged for
outbound data transfer.</div>
<div>
<div>
<div><br>
<br></div>
<br><p style=3D"color:#a0a0a8">On February 12, 2014 at 2:58:07 PM, Yogi
Nerella (<a href=3D"mailto://ynerella999@gmail.com" =
target=3D"_blank">ynerella999@gmail.com</a>) wrote:</p>
<blockquote type=3D"cite">
<div>
<div>
<div dir=3D"ltr"><span>Also, may be you need to check the read
consistency to local_quorum, otherwise the servers still try to
read the data from all other data centers.</span>
<div><span><br></span></div>
<div><span>I can understand the latency, but I cant understand how
it would save money? &nbsp; The amount of data transferred from the
AWS server to the client should be same no matter where the client
is connected?</span></div>
<div><span>&nbsp;&nbsp;&nbsp;</span></div>
</div>
<div class=3D"gmail_extra"><span><br>
<br></span>
<div class=3D"gmail_quote"><span>On Wed, Feb 12, 2014 at 10:33 AM,
Andrey Ilinykh <span dir=3D"ltr">&lt;<a href=3D"mailto:ailinykh@gmail.com"=
 target=3D"_blank">ailinykh@gmail.com</a>&gt;</span> wrote:<br></span>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir=3D"ltr">yes, sure. Taking data from the same zone will
reduce latency and save you some money.</div>
<div>
<div>
<div class=3D"gmail_extra"><br>
<br>
<div class=3D"gmail_quote">On Wed, Feb 12, 2014 at 10:13 AM, Brian
Tarbox <span dir=3D"ltr">&lt;<a href=3D"mailto:tarbox@cabotresearch.com" =
target=3D"_blank">tarbox@cabotresearch.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir=3D"ltr">We're running a C* cluster with 6 servers spread
across the four us-east1 zones.
<div><br></div>
<div>We also spread our clients (hundreds of them) across the four
zones.</div>
<div><br></div>
<div>Currently we give our clients a connection string listing all
six servers and let C* do its thing.</div>
<div><br></div>
<div>This is all working just fine...and we're paying a fair bit in
AWS transfer costs. &nbsp;There is a suspicion that this transfer
cost is driven by us passing data around between our C* servers and
clients.</div>
<div><br></div>
<div>Would there be any value to trying to get a client to talk to
one of the C* servers in its own zone?</div>
<div><br></div>
<div>I understand (at least partially!) about coordinator nodes and
replication and know that no matter which server is the coordinator
for an operation replication may cause bits to get transferred
to/from servers in other zones. &nbsp;Having said that...is there a
chance that trying to encourage a client to initially contact a
server in its own zone would help?</div>
<div><br>
Thank you,</div>
<div><br></div>
<div>Brian Tarbox</div>
<div><br></div>
</div>
</blockquote>
</div>
<br></div>
</div>
</div>
</blockquote>
</div>
<br></div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
<br>
<br clear=3D"all">
<div><br></div>
<br></div>
</div>


</span></blockquote></div></div></div></blockquote></div><br></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_DC216247-5CE2-4461-861D-9AE665131EB2--