hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-6147) Support avro data stored in HBase columns
Date Thu, 30 Jan 2014 23:24:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-6147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13887238#comment-13887238
] 

Xuefu Zhang commented on HIVE-6147:
-----------------------------------

This looks good, but from the patch, it seems that the solution is only for HBase. I wonder
if we have given thoughts on the idea of generalizing the problem and providing a general
solution. I can see the benefits of separating the storage (such as hbase) and data format
(avro, thrift, protocol buf, parquet, etc).  Then we solve M + N problems rather than M *
N problems. What if the avro data is coming from other storage, such as accumulo, or parquet
data from HBase.

> Support avro data stored in HBase columns
> -----------------------------------------
>
>                 Key: HIVE-6147
>                 URL: https://issues.apache.org/jira/browse/HIVE-6147
>             Project: Hive
>          Issue Type: Bug
>          Components: HBase Handler
>    Affects Versions: 0.12.0
>            Reporter: Swarnim Kulkarni
>            Assignee: Swarnim Kulkarni
>         Attachments: HIVE-6147.1.patch.txt
>
>
> Presently, the HBase Hive integration supports querying only primitive data types in
columns. It would be nice to be able to store and query Avro objects in HBase columns by making
them visible as structs to Hive. This will allow Hive to perform ad hoc analysis of HBase
data which can be deeply structured.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message