hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mikhail Antonov (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-15773) CellCounter improvements
Date Fri, 06 May 2016 02:14:12 GMT

    [ https://issues.apache.org/jira/browse/HBASE-15773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15273531#comment-15273531

Mikhail Antonov commented on HBASE-15773:

Good catch, skimmed over the patch, looks good [~ghelmling]  +1 pending QA run.

> CellCounter improvements
> ------------------------
>                 Key: HBASE-15773
>                 URL: https://issues.apache.org/jira/browse/HBASE-15773
>             Project: HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 1.2.0, 1.3.0
>            Reporter: Gary Helmling
>            Assignee: Gary Helmling
>             Fix For: 1.3.0
>         Attachments: HBASE-15773.001.patch
> Looking at the CellCounter map reduce, it seems like it can be improved in a few areas:
> * it does not currently support setting scan batching.  This is important when we're
fetching all versions for columns.  Actually, it would be nice to support all of the scan
configuration currently provided in TableInputFormat.
> * generating job counters containing row keys and column qualifiers is guaranteed to
blow up on anything but the smallest table.  This is not usable and doesn't make any sense
when the same counts are in the job output.  The row and qualifier specific counters should
be dropped.

This message was sent by Atlassian JIRA

View raw message