incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Maxim Kramarenko <>
Subject Cassandra CF sharding
Date Thu, 27 May 2010 20:35:21 GMT

We have mail archive with one large CF for mail body. In our case, it's 
easy to shard data to 5-10 CF by customer id. We like to do this because:

1) We get more manageable instances, because we have many small CF 
instead of one multi-TB CF on each node.

2) Better disk space usage (need to reserve 50% of the largest shard for 
compaction only)

3) Can manage node load not by token only, but also by defining shards 
available per node.

Is my assumptions correct ? Any negative side effects ?

View raw message