2013/5/20 Bryan Talbot <btalbot@aeriagames.com>

Option #3 since it depends on the placement strategy and n=
ot the partitioner.

=

=

-Bryan

On Mon, May 20, 2013 at 6:24 AM, Pinak Pani <nishant.h=
as.a.question@gmail.com> wrote:

I just wanted to verify the= fact that if I happen to setup a multi data-center Cassandra setup, will e= ach data center have the complete data-set with it?Say, I have two data-center each with two nodes, and a parti= tioner that ranges from 0 to 100. Initial token assigned this wayDC1:N1 =3D 00DC2:N1 =3D 25DC1:N2 = =3D 50DC2:N2 =3D 75where DCX is data c= enter X, NX is node X.Which one the following options is true?Option #1:DC1 and DC2, each will hold complete = dataset with keys bucketed as followsDC1:N1 =3D (50, 00] = =3D> 50 keysDC1:N2 =3D (00, 50]=A0=3D> 50 keys----

Complete data set mirrored at DC1DC2= :N1 =3D (75, 25]=A0=3D> 50 keysDC2:N2 =3D (25, 75]=A0=3D> = 50 keys----

Complete data set mirrored at DC2Option #2:=A0DC1 and DC2, each will hold 50= % of the data with keys bucketed as follows (much the same way in a single = C setup)DC1:N1 =3D (75, 00] =3D> 25 keysDC2:N= 1 =3D (00, 25]=A0=3D> 25 keysDC1:N2 =3D (25, 50]=A0=3D> 25 keysDC2:N2 =3D (50, 75]=A0= =3D> 25 keys----data is divided into t= he two data centers.Thanks,PP

=

Sincerely yours,

Apostolis X= ekoukoulotakis