hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adela Maznikar (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (HBASE-5032) Add other DELETE type information into the delete bloom filter to optimize the time range query
Date Wed, 07 Aug 2013 18:12:52 GMT

     [ https://issues.apache.org/jira/browse/HBASE-5032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Adela Maznikar resolved HBASE-5032.
-----------------------------------

       Resolution: Fixed
    Fix Version/s: 0.89-fb
    
> Add other DELETE type information into the delete bloom filter to optimize the time range
query
> -----------------------------------------------------------------------------------------------
>
>                 Key: HBASE-5032
>                 URL: https://issues.apache.org/jira/browse/HBASE-5032
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Liyin Tang
>            Assignee: Adela Maznikar
>             Fix For: 0.89-fb
>
>
> To speed up time range scans we need to seek to the maximum timestamp of the requested
range,instead of going to the first KV of the (row, column) pair and iterating from there.
If we don't know the (row, column), e.g. if it is not specified in the query, we need to go
to end of the current row/column pair first, get a KV from there, and do another seek to (row',
column', timerange_max) from there. We can only skip over to the timerange_max timestamp when
we know that there are no DeleteColumn records at the top of that row/column with a higher
timestamp. We can utilize another Bloom filter keyed on (row, column) to quickly find that
out. (From HBASE-4962)
> So the motivation is to save seek ops for scanning time-range queries if we know there
is no delete for this row/column. 
> From the implementation perspective, we have already had a delete family bloom filter
which contains all the delete family key values. So we can reuse the same bloom filter for
all other kinds of delete information such as delete columns or delete. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message