hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Esteban Gutierrez (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-17756) We should have better introspection of HFiles
Date Wed, 08 Mar 2017 00:15:38 GMT
Esteban Gutierrez created HBASE-17756:

             Summary: We should have better introspection of HFiles
                 Key: HBASE-17756
                 URL: https://issues.apache.org/jira/browse/HBASE-17756
             Project: HBase
          Issue Type: Brainstorming
          Components: HFile
            Reporter: Esteban Gutierrez

[~saint.ack@gmail.com] was suggesting to use DataSketches (https://datasketches.github.io)
in order to write additional statistics to the HFiles. This could be used to improve our split
decisions, troubleshooting or potentially do other interesting analysis without having to
perform full table scans. The statistics could be stored as part of the HFile but we could
initially improve the visibility of the data by adding some statistics to HFilePrettyPrinter.

This message was sent by Atlassian JIRA

View raw message