hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yeshwanth (Jira)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-6147) Support avro data stored in HBase columns
Date Thu, 18 Jun 2020 15:16:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-6147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17139492#comment-17139492
] 

yeshwanth commented on HIVE-6147:
---------------------------------

We have schema less avro bytes written to hbase cells, with schema id prefixed to the avro
bytes, similar to kafka avro serializer in confluent schema registry. how can i customize
HBaseSerDe to read & query the data from Hive. I have found "hbase.struct.serialization.class"
property but not able to identify which class/method to implement. Wondering anyone had same
use case and solved this already.

> Support avro data stored in HBase columns
> -----------------------------------------
>
>                 Key: HIVE-6147
>                 URL: https://issues.apache.org/jira/browse/HIVE-6147
>             Project: Hive
>          Issue Type: Improvement
>          Components: HBase Handler
>    Affects Versions: 0.12.0, 0.13.0
>            Reporter: Swarnim Kulkarni
>            Assignee: Swarnim Kulkarni
>            Priority: Major
>             Fix For: 0.14.0
>
>         Attachments: HIVE-6147.1.patch.txt, HIVE-6147.2.patch.txt, HIVE-6147.3.patch.txt,
HIVE-6147.3.patch.txt, HIVE-6147.4.patch.txt, HIVE-6147.5.patch.txt, HIVE-6147.6.patch.txt
>
>
> Presently, the HBase Hive integration supports querying only primitive data types in
columns. It would be nice to be able to store and query Avro objects in HBase columns by making
them visible as structs to Hive. This will allow Hive to perform ad hoc analysis of HBase
data which can be deeply structured.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message