hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] Created: (HBASE-1811) Snapshot HFile and region statistics at compaction time and make info available to clients
Date Wed, 02 Sep 2009 23:19:32 GMT
Snapshot HFile and region statistics at compaction time and make info available to clients
------------------------------------------------------------------------------------------

                 Key: HBASE-1811
                 URL: https://issues.apache.org/jira/browse/HBASE-1811
             Project: Hadoop HBase
          Issue Type: Improvement
            Reporter: Andrew Purtell
            Priority: Minor


Consider snapshotting HFile and region statistics at major and minor compaction time and making
the info available to clients:

* Key statistics
 ** cardinality
 ** length avg/min/max/stdev
 ** information content measure (entropy, etc.)
 ** histogram
etc.

* Value statistics
 ** length avg/min/max/stdev
 ** information content measure (entropy, etc.)
 ** histogram
etc.

* Region statistics
 ** density estimation
 ** KV count
 ** total storage size (on disk)
 ** total storage size (uncompressed)
etc. 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message