hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-705) Let Hive can analyse hbase's tables
Date Sun, 21 Feb 2010 05:12:28 GMT

    [ https://issues.apache.org/jira/browse/HIVE-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836323#action_12836323

Jonathan Ellis commented on HIVE-705:

ISTM that merging the HBase columnfamilies into a single Hive table is the wrong approach
and could lead to poor performance; rather, each HBase CF should be its own Hive table, which
may of course be joined with others as necessary.  (I think using the word "table" for HBase's
"collection of CFs" is unfortunate in the first place since they are different animals; fundamentally,
the basic unit of data access in HBase is the CF.)

I'm interested because Cassandra is also looking at adding Hive support, and we also implement
a ColumnFamily data model.

> Let Hive can analyse hbase's tables
> -----------------------------------
>                 Key: HIVE-705
>                 URL: https://issues.apache.org/jira/browse/HIVE-705
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Samuel Guo
>            Assignee: John Sichi
>         Attachments: hbase-0.19.3-test.jar, hbase-0.19.3.jar, HIVE-705.1.patch, HIVE-705_draft.patch,
HIVE-705_revision806905.patch, HIVE-705_revision883033.patch
> Add a serde over the hbase's tables, so that hive can analyse the data stored in hbase

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message