hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (HBASE-68) [hbase] HStoreFiles needlessly store the column family name in every entry
Date Sun, 08 Jun 2014 21:51:01 GMT

     [ https://issues.apache.org/jira/browse/HBASE-68?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Andrew Purtell resolved HBASE-68.

    Resolution: Not a Problem

Use block encoding

> [hbase] HStoreFiles needlessly store the column family name in every entry
> --------------------------------------------------------------------------
>                 Key: HBASE-68
>                 URL: https://issues.apache.org/jira/browse/HBASE-68
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver
>            Reporter: Bryan Duxbury
>            Priority: Minor
> Today, HStoreFiles keep the entire serialized HStoreKey objects around for every cell
in the HStore. Since HStores are 1-1 with column families, this is really unnecessary - you
can always surmise the column family by looking at the HStore it belongs to. (This information
would ostensibly come from the file name or a header section.) This means that we could remove
the column family part of the HStoreKeys we put into the HStoreFile, reducing the size of
data stored. This would be a space-saving benefit, removing redundant data, and could be a
speed benefit, as you have to scan over less data in memory and transfer less data over the

This message was sent by Atlassian JIRA

View raw message