incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Coli <rc...@eventbrite.com>
Subject Re: Secondary Indexes On Partitioned Time Series Data Question
Date Fri, 02 Aug 2013 00:32:08 GMT
On Thu, Aug 1, 2013 at 2:34 PM, Shahab Yunus <shahab.yunus@gmail.com> wrote:

> Can you shed some more light (or point towards some other resource) that
> why you think built-in Secondary Indexes should not be used easily or
> without much consideration? Thanks.
>

1) Secondary indexes are more or less modeled like a manual pseudo
Secondary Index CF would be.
2) Except they are more opaque than doing it yourself. For example you
cannot see information on them in nodetool cfstats.
3) And there have been a steady trickle of bugs which relate to their
implementation, in many cases resulting in them not returning the data they
should. [1]
4) These bugs would not apply to a manual pseudo Secondary Index CF.
5) And the only benefits you get are the marginal convenience of querying
the secondary index instead of a second CF, and atomic synchronized update.
6) Which most people do not actually need.

tl;dr : "unless you need the atomic update property, just use a manual
pseudo secondary index CF"

=Rob

[1] https://issues.apache.org/jira/browse/CASSANDRA-4785 ,
https://issues.apache.org/jira/browse/CASSANDRA-5540 ,
https://issues.apache.org/jira/browse/CASSANDRA-2897 , etc.

Mime
View raw message