hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jean-Daniel Cryans (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-5313) Restructure hfiles layout for better compression
Date Mon, 15 Jul 2013 19:54:50 GMT

    [ https://issues.apache.org/jira/browse/HBASE-5313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13708886#comment-13708886
] 

Jean-Daniel Cryans commented on HBASE-5313:
-------------------------------------------

[~he yongqiang], [~dhruba], [~mikhail]

Guys, I need your help to understand what's going on with this jira. HBASE-5521 has been committed
more than a year ago and nothing moved after that. Moreover, the code breaks encoding by making
it not thread safe. See HBASE-8732.

This makes me think that the code in 5521 was not seriously tested (maybe waiting on this
jira to tie all the loose ends?) and since we are trying to release 0.96.0 soonish, I'm currently
in favor of reverting it.
                
> Restructure hfiles layout for better compression
> ------------------------------------------------
>
>                 Key: HBASE-5313
>                 URL: https://issues.apache.org/jira/browse/HBASE-5313
>             Project: HBase
>          Issue Type: Improvement
>          Components: io
>            Reporter: dhruba borthakur
>            Assignee: dhruba borthakur
>
> A HFile block contain a stream of key-values. Can we can organize these kvs on the disk
in a better way so that we get much greater compression ratios?
> One option (thanks Prakash) is to store all the keys in the beginning of the block (let's
call this the key-section) and then store all their corresponding values towards the end of
the block. This will allow us to not-even decompress the values when we are scanning and skipping
over rows in the block.
> Any other ideas? 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message