hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-9679) Binary search in HFile block
Date Thu, 27 Mar 2014 02:15:18 GMT

    [ https://issues.apache.org/jira/browse/HBASE-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948775#comment-13948775

Lars Hofhansl commented on HBASE-9679:

I was just thinking about this. Forgot we have this issue.
We could scan through an HFileBlock as we bring it into the cache and build an index with
the offsets of each KV. Then we can binary search based on those offsets.

> Binary search in HFile block
> ----------------------------
>                 Key: HBASE-9679
>                 URL: https://issues.apache.org/jira/browse/HBASE-9679
>             Project: HBase
>          Issue Type: Improvement
>          Components: HFile
>    Affects Versions: 0.95.2, 0.94.12
>            Reporter: Liang Xie
>            Assignee: Liang Xie
>            Priority: Minor
> It's not a top priority issue, seems to me.
> Right now hbase do a linear scan to search a key within a hfile block on interst, in
special case, e.g. 100% read scenario or high read/write ratio scanario, it's useful to do
a binary search improvement to reduce the CPU cost and response time,  i think the biggest
benefit should be the cpu:)

This message was sent by Atlassian JIRA

View raw message