hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "He Yongqiang (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-5457) add inline index in data block for data which are not clustered together
Date Thu, 23 Feb 2012 18:30:48 GMT

    [ https://issues.apache.org/jira/browse/HBASE-5457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13214915#comment-13214915

He Yongqiang commented on HBASE-5457:

@stack, we haven't thought that in much detail, but we can start the discussion by an example.

Let's say there is one column family, and it only contains one type column whose name is a
combine of 'string and ts'. So the data is sorted by 'string' first. But one query wants the
data to be sorted by ts instead.
> add inline index in data block for data which are not clustered together
> ------------------------------------------------------------------------
>                 Key: HBASE-5457
>                 URL: https://issues.apache.org/jira/browse/HBASE-5457
>             Project: HBase
>          Issue Type: New Feature
>            Reporter: He Yongqiang
> As we are go through our data schema, and we found we have one large column family which
is just duplicating data from another column family and is just a re-org of the data to cluster
data in a different way than the original column family in order to serve another type of
queries efficiently.
> If we compare this second column family with similar situation in mysql, it is like an
index in mysql. So if we can add inline block index on required columns, the second column
family then is not needed.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message