cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Bailey (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CASSANDRA-4756) Bulk loading snapshots creates RF^2 copies of the data
Date Wed, 03 Oct 2012 20:52:07 GMT
Nick Bailey created CASSANDRA-4756:
--------------------------------------

             Summary: Bulk loading snapshots creates RF^2 copies of the data
                 Key: CASSANDRA-4756
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4756
             Project: Cassandra
          Issue Type: Improvement
    Affects Versions: 1.2.0 beta 1
            Reporter: Nick Bailey


Since a cluster snapshot will contain rf copies of each piece of data, bulkloading all of
those snapshots will create rf^2 copies of each piece of data.

Not sure what the solution here is. Ideally we would merge the RF copies of the data before
sending to the cluster. This would solve any inconsistencies that existed when the snapshot
was taken.

A more naive approach of only loading one of the RF copies and assuming there are no inconsistencies
might be an easier goal for the near term though.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message