hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gary Helmling (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-15773) CellCounter improvements
Date Thu, 05 May 2016 17:51:12 GMT
Gary Helmling created HBASE-15773:
-------------------------------------

             Summary: CellCounter improvements
                 Key: HBASE-15773
                 URL: https://issues.apache.org/jira/browse/HBASE-15773
             Project: HBase
          Issue Type: Improvement
          Components: mapreduce
            Reporter: Gary Helmling


Looking at the CellCounter map reduce, it seems like it can be improved in a few areas:

* it does not currently support setting scan batching.  This is important when we're fetching
all versions for columns.  Actually, it would be nice to support all of the scan configuration
currently provided in TableInputFormat.
* generating job counters containing row keys and column qualifiers is guaranteed to blow
up on anything but the smallest table.  This is not usable and doesn't make any sense when
the same counts are in the job output.  The row and qualifier specific counters should be
dropped.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message