incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gary Shi <gary...@gmail.com>
Subject Why SSTable is sorted by tokens instead of row keys?
Date Fri, 04 Nov 2011 12:49:34 GMT
I want to save time series event logs into Cassandra, and I need to load
them by key range (row key is time-based). But we can't use
RandomPartitioner in this way, while OrderPreservingPartitioner leads to
hot spot problem.

So I wonder why Cassandra save SSTable by sorted row tokens instead of
keys: if rows in SSTable are sorted by keys, it should be quite easy to
return rows by key range -- token should be used to determine which node
contains the data. For key range requests, Cassandra could ask every node
for that range of rows, merge them and return to the caller.

-- 
regards,
Gary Shi

Mime
View raw message