hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Baranau (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-6618) Implement FuzzyRowFilter with ranges support
Date Mon, 20 Aug 2012 19:45:37 GMT
Alex Baranau created HBASE-6618:

             Summary: Implement FuzzyRowFilter with ranges support
                 Key: HBASE-6618
                 URL: https://issues.apache.org/jira/browse/HBASE-6618
             Project: HBase
          Issue Type: New Feature
          Components: filters
            Reporter: Alex Baranau
            Priority: Minor

Apart from current ability to specify fuzzy row filter e.g. for <userId_actionId> format
as ????_0004 (where 0004 - actionId) it would be great to also have ability to specify the
"fuzzy range" , e.g. ????_0004, ..., ????_0099.

See initial discussion here: http://search-hadoop.com/m/WVLJdX0Z65

Note: currently it is possible to provide multiple fuzzy row rules to existing FuzzyRowFilter,
but in case when the range is big (contains thousands of values) it is not efficient.

Filter should perform efficient fast-forwarding during the scan (this is what distinguishes
it from regex row filter).

While such functionality may seem like a proper fit for custom filter (i.e. not including
into standard filter set) it looks like the filter may be very re-useable. We may judge based
on the implementation that will hopefully be added.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message