cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Coli <>
Subject Re: What is your backup strategy for Cassandra?
Date Thu, 10 Sep 2015 00:34:18 GMT
On Sun, Sep 6, 2015 at 12:32 AM, Gene <> wrote:

> I've seen quite a few blog posts here and there about various back up
> strategies.  I'm wondering if anyone on this list would be willing to share
> theirs.

> Things I'm curious about:
> 1. Data size

Up to hundreds of gigs per node.

> 2. Frequency for full snapshots

Never/always (depends on your perspective).

> 3. Frequency for copying snapshots off of the Cassandra nodes

As SSTables are flushed.

> 4. Do you use the incremental backups feature


> 5. Do you use commitlog archiving


> 6. What method you use to copy data off of the cluster (e.g. NFS, rsync,
> rsync+ssh, etc)

S3 upload.

> 7. Do you compress your backups, if so how soon (e.g. compress backups
> older than N days)

My SSTables are already snappy compressed, so I am skeptical of benefit
from re-compression.

> 8. Do you use any Off the Shelf scripts for your backups (e.g. tablesnap,
> cassandra_snapshotter, etc)


> 9. Do you utilise AWS for your backups, or do you keep it local (or
> offsite on your own hardware)


tl;dr - tablesnap works. There are awkward aspects to its use, but if you
are operating Cassandra in AWS it's probably the best off the shelf
off-node backup.

View raw message