hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Fosse (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-10924) add support for MERGE statement
Date Sun, 05 Jul 2015 19:06:04 GMT

    [ https://issues.apache.org/jira/browse/HIVE-10924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14614365#comment-14614365
] 

Paul Fosse commented on HIVE-10924:
-----------------------------------

Merge command seams to be needed to do the first use case of the ACID feature. 

"Once an hour, a set of inserts and updates (up to 500k rows) for various dimension tables
(eg. customer, inventory, stores) needs to be processed. The dimension tables have primary
keys and are typically bucketed and sorted on those keys."

Typically we will load the updates to a hive table and just want to merge that table to the
existing dimension.  We are either using the old way of doing this (ingest, reconcile, compact
& purge) or we are writing a Python script to process the updates.  But we can't do 500K
update statements an hour, so it doesn't seem the ACID does us any good for this use case
until we have merge

> add support for MERGE statement
> -------------------------------
>
>                 Key: HIVE-10924
>                 URL: https://issues.apache.org/jira/browse/HIVE-10924
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Planning, Query Processor
>    Affects Versions: 1.2.0
>            Reporter: Eugene Koifman
>            Assignee: Eugene Koifman
>
> add support for 
> MERGE INTO tbl USING src ON … WHEN MATCHED THEN ... WHEN NOT MATCHED THEN ...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message