hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars George (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1481) Add fast row key only scanning
Date Thu, 02 Jul 2009 18:01:47 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12726588#action_12726588

Lars George commented on HBASE-1481:

This was filed back then after a discussion on IRC between - I think - you and me and maybe
Stack. For me this was for a faster row counting. Especially in the shell, which is quite
slow because it is a sequential scan. 

For the faster MR variant we have the RowCounter driver class in the hbase.jar, which now
uses as many mappers as the table has regions. The legacy "mapred" one seemed to have default
to the 1 mapper setting from the hadoop configs.

With that in place I am not sure if we need this issue still being open. 

> Add fast row key only scanning
> ------------------------------
>                 Key: HBASE-1481
>                 URL: https://issues.apache.org/jira/browse/HBASE-1481
>             Project: Hadoop HBase
>          Issue Type: Improvement
>    Affects Versions: 0.19.3
>            Reporter: Lars George
>            Priority: Minor
>             Fix For: 0.20.1, 0.21.0
> Instead of requiring a user to set up a scanner with any column and scan the table to
gather all row keys while ignoring the column value we should have a fast and lightweight
scanner that for example takes a "null" for the column list and then simply returns only the
matching keys of all non-empty or deleted rows. Filters should still be applicable.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message