cassandra-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonathan Ellis <>
Subject Re: Order preserving partitioning strategy
Date Tue, 24 Aug 2010 17:30:18 GMT
What are some good papers to read for background?

On Tue, Aug 24, 2010 at 12:26 PM, J. Andrew Rogers
<> wrote:
> On Mon, Aug 23, 2010 at 8:36 PM, Hien. To Trong <> wrote:
>> OrderPreservingPartitioner is efficient range queries but can cause
>> unevently distributed data. Does anyone has an idea of a
>> HybridPartitioner which takes advantages of both RandomPartitioner
>> and OPP, or at least a partitioner trade off between them.
> What you are looking for is skew adaptive partitioning i.e. like a
> B+Tree except distributable.
> A couple different methods for doing something like this exist, but
> you rarely see them and they have their own (different) tradeoffs. To
> the best of my knowledge, implementation requires a fairly deep
> architectural commitment; it is more involved than simply defining a
> partitioning function and the "adaptive" aspect must be distribution
> friendly. It is an active area of research in the literature with no
> obvious and simple solutions that can be lashed onto a database engine
> "as is".
> --
> J. Andrew Rogers

Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support

View raw message