hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nishant Bangarwa (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-19026) Configurable serde for druid kafka indexing
Date Mon, 12 Nov 2018 20:20:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-19026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nishant Bangarwa updated HIVE-19026:
------------------------------------
    Attachment: HIVE-19026.5.patch

> Configurable serde for druid kafka indexing 
> --------------------------------------------
>
>                 Key: HIVE-19026
>                 URL: https://issues.apache.org/jira/browse/HIVE-19026
>             Project: Hive
>          Issue Type: Task
>            Reporter: Nishant Bangarwa
>            Assignee: Nishant Bangarwa
>            Priority: Major
>         Attachments: HIVE-19026.1.patch, HIVE-19026.2.patch, HIVE-19026.3.patch, HIVE-19026.4.patch,
HIVE-19026.5.patch, HIVE-19026.patch
>
>
> https://issues.apache.org/jira/browse/HIVE-18976 introduces support for setting up druid
kafka-indexing service. 
> Input serialization should be configurable. for now we can say we only support json,
but there should be a mechanism to support other formats. Perhaps, we can make use of Hive's
serde library like LazySimpleSerde etc.
> Also add support to ingest timestamp column when the input timestamp column name in input
is not `__time`. 
> e.g. 
> CREATE EXTERNAL TABLE druid_kafka_test_avro(__time timestamp , other columns...)
>         STORED BY 'org.apache.hadoop.hive.druid.DruidStorageHandler'
>         TBLPROPERTIES (
>          "druid.timestamp.column" = "myinputColumnTimestamp"
>          .... other ppts 
>          ) 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message