hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "dhruba borthakur (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-5313) Restructure hfiles layout for better compression
Date Wed, 01 Feb 2012 06:40:58 GMT
Restructure hfiles layout for better compression

                 Key: HBASE-5313
                 URL: https://issues.apache.org/jira/browse/HBASE-5313
             Project: HBase
          Issue Type: Improvement
          Components: io
            Reporter: dhruba borthakur
            Assignee: dhruba borthakur

A HFile block contain a stream of key-values. Can we can organize these kvs on the disk in
a better way so that we get much greater compression ratios?

One option (thanks Prakash) is to store all the keys in the beginning of the block (let's
call this the key-section) and then store all their corresponding values towards the end of
the block. This will allow us to not-even decompress the values when we are scanning and skipping
over rows in the block.

Any other ideas? 

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message