hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-15773) CellCounter improvements
Date Fri, 06 May 2016 23:09:12 GMT

    [ https://issues.apache.org/jira/browse/HBASE-15773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15274890#comment-15274890

Hudson commented on HBASE-15773:

SUCCESS: Integrated in HBase-1.3 #689 (See [https://builds.apache.org/job/HBase-1.3/689/])
HBASE-15773 Improvements to CellCounter job (garyh: rev 80574615085c9ea1a722546b86e22525e3232bd5)
* hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/CellCounter.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/TableInputFormat.java

> CellCounter improvements
> ------------------------
>                 Key: HBASE-15773
>                 URL: https://issues.apache.org/jira/browse/HBASE-15773
>             Project: HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 1.2.0, 1.3.0
>            Reporter: Gary Helmling
>            Assignee: Gary Helmling
>             Fix For: 1.3.0
>         Attachments: HBASE-15773.001.patch, HBASE-15773.002.patch
> Looking at the CellCounter map reduce, it seems like it can be improved in a few areas:
> * it does not currently support setting scan batching.  This is important when we're
fetching all versions for columns.  Actually, it would be nice to support all of the scan
configuration currently provided in TableInputFormat.
> * generating job counters containing row keys and column qualifiers is guaranteed to
blow up on anything but the smallest table.  This is not usable and doesn't make any sense
when the same counts are in the job output.  The row and qualifier specific counters should
be dropped.

This message was sent by Atlassian JIRA

View raw message