hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Reopened: (HBASE-68) [hbase] HStoreFiles needlessly store the column family name in every entry
Date Wed, 08 Jul 2009 18:05:14 GMT

     [ https://issues.apache.org/jira/browse/HBASE-68?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

stack reopened HBASE-68:

> [hbase] HStoreFiles needlessly store the column family name in every entry
> --------------------------------------------------------------------------
>                 Key: HBASE-68
>                 URL: https://issues.apache.org/jira/browse/HBASE-68
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: regionserver
>            Reporter: Bryan Duxbury
>            Priority: Minor
>             Fix For: 0.20.0
> Today, HStoreFiles keep the entire serialized HStoreKey objects around for every cell
in the HStore. Since HStores are 1-1 with column families, this is really unnecessary - you
can always surmise the column family by looking at the HStore it belongs to. (This information
would ostensibly come from the file name or a header section.) This means that we could remove
the column family part of the HStoreKeys we put into the HStoreFile, reducing the size of
data stored. This would be a space-saving benefit, removing redundant data, and could be a
speed benefit, as you have to scan over less data in memory and transfer less data over the

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message