cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-4756) Bulk loading snapshots creates RF^2 copies of the data
Date Wed, 10 Oct 2012 17:33:02 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-4756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13473389#comment-13473389
] 

Jonathan Ellis commented on CASSANDRA-4756:
-------------------------------------------

Damn.
                
> Bulk loading snapshots creates RF^2 copies of the data
> ------------------------------------------------------
>
>                 Key: CASSANDRA-4756
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4756
>             Project: Cassandra
>          Issue Type: Improvement
>    Affects Versions: 1.2.0 beta 1
>            Reporter: Nick Bailey
>
> Since a cluster snapshot will contain rf copies of each piece of data, bulkloading all
of those snapshots will create rf^2 copies of each piece of data.
> Not sure what the solution here is. Ideally we would merge the RF copies of the data
before sending to the cluster. This would solve any inconsistencies that existed when the
snapshot was taken.
> A more naive approach of only loading one of the RF copies and assuming there are no
inconsistencies might be an easier goal for the near term though.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message