cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nate McCall (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-5970) FilteredRangeSlice command for regex searches against column names on known sets of keys
Date Tue, 03 Sep 2013 21:03:52 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-5970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13757066#comment-13757066
] 

Nate McCall commented on CASSANDRA-5970:
----------------------------------------

bq. Filtering as in "Here's a list of keys, run the regex against their data" 

Yes. 

bq. or is there some kind of partial-match filter involved there as well?

No partial matching on the partition keys, but at least: equality, LTE and GTE.

I'll port over what we have to 'trunk' as-is in a patch file so we are for sure on the same
page about functionality - or at least we can start here in discussions. But again, I think
folding this into existing plumbing where practical should be part of the goal. 
                
> FilteredRangeSlice command for regex searches against column names on known sets of keys
> ----------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-5970
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5970
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Core
>            Reporter: Nate McCall
>
> This is the ability to apply a regex against columns when the set of keys is known. In
filtering the keys, we would like to allow for the following clauses: E, GTE, LTE, NE, inclusive
list, exclusive list.
> The end goal is to provide for efficient searching in the case where you have some knowledge
of the keys. A specific use case would be, say, searching user agent strings in the given
set of date buckets in the classic time-series web log use case. This is a "sweet spot" for
Cassandra and providing a more direct method of access for such will help a lot of users.
> Additionally, this will provide some level of feature parity with RDBMS crowd who've
had this feature for some time.
> Internally, this will include the introduction of a new Verb, SSTableScanner extension
and an ExtendedFilter implementation which applies the regex as well as a new method on StorageProxy.
> This issue does not cover exposing this new query method to thrift and CQL, but obviously
that will be required for this to be of any practical use. Those should be covered by separate
issues.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message