hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-9086) Add some options to improve count performance
Date Sun, 04 Aug 2013 01:37:49 GMT

    [ https://issues.apache.org/jira/browse/HBASE-9086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13728721#comment-13728721

Lars Hofhansl commented on HBASE-9086:

bq. "FirstKeyOnly" should have been named "FirstRowOnly" and "FirstKeyOnly"

Well... It does not return a row, but the first KeyValue on the row. A FilterList of FirstKeyOnlyFilter
and KeyOnlyFilter might work, just need to try to make sure. Probably just needs a unit test
to make sure.
> Add some options to improve count performance
> ---------------------------------------------
>                 Key: HBASE-9086
>                 URL: https://issues.apache.org/jira/browse/HBASE-9086
>             Project: HBase
>          Issue Type: Wish
>          Components: shell
>    Affects Versions: 0.94.2
>            Reporter: Cheney Sun
>         Attachments: HBase-9086.patch, HBase-9086_v0.2.patch
> The current count command in HBase shell is quite slow if the row size is very big (100+kB
each). It would be helpful to provide some option to specify the column to count, which could
give user a chance to reduce the data volume to scan. 
> IMHO, only count the row key would be the ideal solution. Not sure how difficult to implement

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message