hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Dimiduk (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2365) SQL support for bulk load into HBase
Date Tue, 04 Feb 2014 01:35:08 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13890259#comment-13890259
] 

Nick Dimiduk commented on HIVE-2365:
------------------------------------

After much fighting with input data and ordering, I have my first little improvement. I've
started a [WIP branch|https://github.com/ndimiduk/hive/tree/2365-sql-support-hbase-bulkloads]
over on Github. I will regularly rewrite it's history, but if you'd like to follow along,
I'll take comments as they come. Once things take shape, I'll squash into a patch and attach
here.

The patch posted supports generating HFiles from a table defined using the HBaseStorageHandler.
The next improvement here is to actually rewrite the plan to introduce a step that invokes
LoadIncrementalHFiles. After that, we can get rid of the need for specifying hfile.family.path,
just detect it from the column family from the mapping attribute and write the HFiles to a
temporary location before loading.

> SQL support for bulk load into HBase
> ------------------------------------
>
>                 Key: HIVE-2365
>                 URL: https://issues.apache.org/jira/browse/HIVE-2365
>             Project: Hive
>          Issue Type: Improvement
>          Components: HBase Handler
>            Reporter: John Sichi
>            Assignee: Nick Dimiduk
>
> Support the "as simple as this" SQL for bulk load from Hive into HBase.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message