The thing is I will send the data to a massive storage facility (I don't know what's behind the scenes) so I won't be backing up on one machine where I can install Cassandra. Does the sstable loader work just for copying data from a Cassandra cluster to somewhere on a disk where there is no Cassandra instance? If not what is the best way/tool to achieve that?
Stop your current cluster. Start a new cassandra instance on the machine you want to store your data on. Use the sstable loader to load the sstables from all of the current machines into the new machine. Run major compaction a couple times. You will have all of the data on one machine.
On 12/07/2011 10:17 AM, Alexandru Dan Sicoe wrote:
3 node Cassandra 0.8.5 cluster. I've left the system running in production environment for long term testing. I've accumulated about 350GB of data with RF=2. The machines I used for the tests are older and need to be replaced. Because of this I need to export the data to a permanent location. How should I export the data? In order to reduce the storage spac I want to export only the non-replicated data? I mean, just one copy of the data (without the replicas). Is this possible? How?