cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonathan Ellis <>
Subject Re: Backing up the data stored in cassandra
Date Thu, 08 Jul 2010 15:45:58 GMT

On Thu, Jul 8, 2010 at 12:50 AM, Dave Viner <> wrote:
> Hi all,
> What is the recommended strategy for backing up the data stored inside
> cassandra?
> I realized that Cass. is a distributed database, and with a decent
> replication factor, backups are "already done" in some sense.  But, as a
> relatively new user, I'm always concerned that the data is only within the
> system and not stored *anywhere* else.
> In an earlier email in the list, the recommendation was:
> Until tickets 193 and 520 are done, the easiest thing is to copy all
> the sstables from the other nodes that have replicas for the ranges it
> is responsible for (e.g. for replication factor of 3 on rack unaware
> partitioner, the nodes before it and the node after it on the right
> would suffice), and then run nodeprobe cleanup to clear out the
> excess.
> Is this still the recommended approach?  If I backed up the files in
> DataDirectories/*, is it possible to restore a node using those files?
> (That is, bring up a new node, copy the backed up files from the crashed
> node onto the new node, then have the new node join the cluster?)
> Thanks
> Dave Viner

Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support

View raw message