Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of
 JEREMIAH.JORDAN@morningstar.com designates 216.228.224.32 as permitted
 sender)
From: Jeremiah Jordan <jeremiah.jordan@morningstar.com>
Mime-Version: 1.0 (Apple Message framework v1251.1)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_9EDE6501-4C3C-45DF-BAAB-CBB5CAE26734"
Subject: Re: Efficiency of Cross Data Center Replication...?
Date: Wed, 16 Nov 2011 20:46:05 -0600
In-Reply-To: 
 <CABH0dRPk=6f-x0D4mtJef_+_CVhAnwx_M+EhoSisLyJbxfaUdQ@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CACPZYUGiAsi0krUvHJwjb-C+uRWVJSV_Oypu5V3CZ6gjYurhkA@mail.gmail.com>
 <CALamADLym=_qSyK13V9XGsfRFzPPJjgjOxr1jVsHJmRvueLD=Q@mail.gmail.com>
 <CACPZYUFJi5qJPyGOUGYKZZ9erox1+VmV6HL_OpEyAgOvic4yKQ@mail.gmail.com>
 <CABH0dRPk=6f-x0D4mtJef_+_CVhAnwx_M+EhoSisLyJbxfaUdQ@mail.gmail.com>
Message-Id: <5F189A91-3C9A-4AB5-A278-CBF5F3330550@morningstar.com>


--Apple-Mail=_9EDE6501-4C3C-45DF-BAAB-CBB5CAE26734
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

Pretty sure data is sent to the coordinating node in DC2 at the same =
time it is sent to replicas in DC1, so I would think 10's of =
milliseconds after the transport time to DC2.

On Nov 16, 2011, at 3:48 PM, ehershey@gmail.com wrote:

> On a related note - assuming there are available resources across the =
board (cpu and memory on every node, low network latency, non-saturated =
nics/circuits/disks), what's a reasonable expectation for timing on =
replication? Sub-second? Less than five seconds?=20
>=20
> Ernie
>=20
> On Wed, Nov 16, 2011 at 4:00 PM, Brian Fleming =
<bigbrianfleming@gmail.com> wrote:
> Great - thanks Jake
>=20
> B.
>=20
> On Wed, Nov 16, 2011 at 8:40 PM, Jake Luciani <jakers@gmail.com> =
wrote:
> the former
>=20
>=20
> On Wed, Nov 16, 2011 at 3:33 PM, Brian Fleming =
<bigbrianfleming@gmail.com> wrote:
>=20
> Hi All,
> =20
> I have a question about inter-data centre replication : if you have 2 =
Data Centers, each with a local RF of 2 (i.e. total RF of 4) and write =
to a node in DC1, how efficient is the replication to DC2 - i.e. is that =
data :
>  - replicated over to a single node in DC2 once and internally =
replicated
>  or=20
>  - replicated explicitly to two separate nodes?
>=20
> Obviously from a LAN resource utilisation perspective, the former =
would be preferable.
>=20
> Many thanks,
>=20
> Brian
>=20
>=20
>=20
>=20
> --=20
> http://twitter.com/tjake
>=20
>=20


--Apple-Mail=_9EDE6501-4C3C-45DF-BAAB-CBB5CAE26734
Content-Transfer-Encoding: 7bit
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Pretty sure data is sent to the coordinating node in DC2 at the same time it is sent to replicas in DC1, so I would think 10's of milliseconds after the transport time to DC2.<div><br><div><div>On Nov 16, 2011, at 3:48 PM, <a href="mailto:ehershey@gmail.com">ehershey@gmail.com</a> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite">On a related note - assuming there are available resources across the board (cpu and memory on every node, low network latency, non-saturated nics/circuits/disks), what's a reasonable expectation for timing on replication? Sub-second? Less than five seconds?&nbsp;<div>

<br></div><div>Ernie<br><br><div class="gmail_quote">On Wed, Nov 16, 2011 at 4:00 PM, Brian Fleming <span dir="ltr">&lt;<a href="mailto:bigbrianfleming@gmail.com">bigbrianfleming@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

Great - thanks Jake<span class="HOEnZb"><font color="#888888"><div><br></div></font></span><div><span class="HOEnZb"><font color="#888888">B.<br><br></font></span><div class="gmail_quote"><div class="im">On Wed, Nov 16, 2011 at 8:40 PM, Jake Luciani <span dir="ltr">&lt;<a href="mailto:jakers@gmail.com" target="_blank">jakers@gmail.com</a>&gt;</span> wrote:<br>

</div><div><div class="h5"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
the former<div><div></div><div><br><br><div class="gmail_quote">On Wed, Nov 16, 2011 at 3:33 PM, Brian Fleming <span dir="ltr">&lt;<a href="mailto:bigbrianfleming@gmail.com" target="_blank">bigbrianfleming@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<br><div>Hi All,</div><div>&nbsp;</div><div>I have a question about inter-data centre replication : if you have 2 Data Centers, each with a local RF of 2 (i.e. total RF of 4) and write to a node in DC1, how efficient is the replication to DC2 - i.e. is that data :</div>


<div>&nbsp;- replicated over to a single node in DC2 once and internally replicated</div><div>&nbsp;or&nbsp;</div><div>&nbsp;- replicated explicitly to two separate nodes?</div><div><br></div><div>Obviously from a LAN resource utilisation perspective, the former would be preferable.</div>


<div><br></div><div>Many thanks,</div><div><br></div><div>Brian</div><div><br></div>
</blockquote></div><br><br clear="all"><div><br></div></div></div><font color="#888888">-- <br><a href="http://twitter.com/tjake" target="_blank">http://twitter.com/tjake</a><br>
</font></blockquote></div></div></div><br></div>
</blockquote></div><br></div>
</blockquote></div><br></div></body></html>
--Apple-Mail=_9EDE6501-4C3C-45DF-BAAB-CBB5CAE26734--