cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Robson <mar...@gmail.com>
Subject Re: Cassandra to store logs as a list
Date Wed, 20 Jan 2010 21:20:28 GMT
I think you really want to be using the OrderPreservingPartitioner and using
time-based keys.

It depends exactly how you're querying it. All querying use-cases need to be
taken into account when deciding how to structure your data.

If you use a time-based key with OPP, typically data become very unbalanced,
because the balancing algorithm (such as exists) depends on the keys
continuing to have a similar distribution as when the nodes were
kickstarted.

One solution would be to put some other field on the beginning of the key
that you might wish to use such as account id, customer id, site id, etc, if
you have sufficient of these to spread the data out evenly (do it in hex and
zero pad it, of course)

Mark

Mime
View raw message