hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "He Yongqiang (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-352) Make Hive support column based storage
Date Fri, 20 Mar 2009 07:40:50 GMT

    [ https://issues.apache.org/jira/browse/HIVE-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683783#action_12683783

He Yongqiang commented on HIVE-352:

Thanks, Joydeep and Zheng. The advises are really helpful.
I have written a draft document according to suggestions from Zheng and Joydeep.
Here is the link: http://docs.google.com/Doc?id=dc9jpfdr_3ft7w3hc4

I agree with you guys, we can start from B2, and then B1. And finally find out should we need
to add the VFile in.
BTW, yestoday i also took a look on MapFile, which i found VFile has a same with MapFile in
that VFlie sometimes also need an index file. The main difference is that VFile does not need
a key part and sometimes even the value's length part. Because a VFile stores one column,
each column has a type, and if the data type of that column is fix lengthed, it only needs
to store the raw value bytes.

> Make Hive support column based storage
> --------------------------------------
>                 Key: HIVE-352
>                 URL: https://issues.apache.org/jira/browse/HIVE-352
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: He Yongqiang
> column based storage has been proven a better storage layout for OLAP. 
> Hive does a great job on raw row oriented storage. In this issue, we will enhance hive
to support column based storage. 
> Acctually we have done some work on column based storage on top of hdfs, i think it will
need some review and refactoring to port it to Hive.
> Any thoughts?

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message