incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rob Coli <rc...@palominodb.com>
Subject Re: Data migration between clusters
Date Wed, 31 Oct 2012 17:43:03 GMT
On Tue, Oct 30, 2012 at 4:18 AM, 張 睿 <chou_ei@cyberagent.co.jp> wrote:
> Does anyone here know if there is an efficient way to migrate multiple
> cassandra clusters' data
> to a single cassandra cluster without any dataloss.

Yes.

1) create schema which is superset of all columnfamilies and all keyspaces
2) if all source clusters were the same fixed number of nodes, create
a new cluster with the same fixed number of nodes
3) nodetool drain and shut down all nodes on all participating clusters
4) copy sstables from old clusters, maintaining that data from source
node [x] ends up on target node [x]
5) start cassandra

However without more details as to your old clusters, new clusters,
and availability requirements, I can't give you a more useful answer.

Here's some background on bulk loading, including "copy-the-sstables."

http://palominodb.com/blog/2012/09/25/bulk-loading-options-cassandra

=Rob

-- 
=Robert Coli
AIM&GTALK - rcoli@palominodb.com
YAHOO - rcoli.palominob
SKYPE - rcoli_palominodb

Mime
View raw message