cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Bailey (JIRA)" <>
Subject [jira] [Created] (CASSANDRA-7779) Add option to sstableloader to only stream to the local dc
Date Fri, 15 Aug 2014 17:37:18 GMT
Nick Bailey created CASSANDRA-7779:

             Summary: Add option to sstableloader to only stream to the local dc
                 Key: CASSANDRA-7779
             Project: Cassandra
          Issue Type: Improvement
          Components: Tools
            Reporter: Nick Bailey
             Fix For: 1.2.19, 2.0.10, 2.1.1

This is meant to be a potential workaround for CASSANDRA-4756. Due to that ticket, trying
to load a cluster wide snapshot via sstableloader will potentially stream an enormous amount
of data. In a 3 datacenter cluster with rf=3 in each datacenter, 81 copies of the data would
be streamed. Once we have per range sstables we can optimize sstableloader to merge data and
only stream one copy, but until then we need a workaround. By only streaming to the local
datacenter we can load the data locally in each datacenter and only have 9 copies of the data
rather than 81.

This could potentially be achieved by the option to ignore certain nodes that already exists
in sstableloader, but in the case of vnodes and topology changes in the cluster, this could
require specifying every node in the cluster as 'ignored' on the command line which could
be problematic. This is just a shortcut to avoid that.

This message was sent by Atlassian JIRA

View raw message