Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of mrlalonde@live.ca designates
 65.54.190.144 as permitted sender)
Message-ID: <BAY160-W4230E2447A0CFDD544AFD2BEC90@phx.gbl>
From: Mathieu Lalonde <mrlalonde@live.ca>
To: <user@cassandra.apache.org>
Subject: DataCenters each with their own local data source
Date: Tue, 22 Nov 2011 19:57:09 -0500
Importance: Normal
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0


Hi=2C

I am wondering if Cassandra's features and datacenter awareness can help me=
 with my scalability problems.

Suppose that I have a 10-20 Data centers=2C each with their own local (mass=
ive) source of time series data.=A0 I would like:
- to avoid replication across data centers (this seems doable based on: htt=
p://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Different-Key=
Spaces-for-different-nodes-in-the-same-ring-td5096393.html#a5096568 )
- writes for local data to be done on the local data center (not sure about=
 that one)
- reads from a master data center to any remote data centers (not sure abou=
t that one either)

It sounds like I am trying to use Cassandra in a very different way that it=
 was intended to be used.
Should I simply have a middle-tier that takes care of distributing reads to=
 multiple data centers and treat each data center as its own autonomous clu=
ster?

Thanks!
Matt

 		 	   		  =