hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sankar Hariappan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-17367) IMPORT table doesn't load data from export dumps for insert operation.
Date Tue, 22 Aug 2017 00:01:35 GMT

     [ https://issues.apache.org/jira/browse/HIVE-17367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sankar Hariappan updated HIVE-17367:
------------------------------------
    Summary: IMPORT table doesn't load data from export dumps for insert operation.  (was:
IMPORT should overwrite the table if the dump has same state as table.)

> IMPORT table doesn't load data from export dumps for insert operation.
> ----------------------------------------------------------------------
>
>                 Key: HIVE-17367
>                 URL: https://issues.apache.org/jira/browse/HIVE-17367
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2, Import/Export, repl
>    Affects Versions: 3.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>             Fix For: 3.0.0
>
>
> Repl v1 creates a set of EXPORT/IMPORT commands to replicate modified data (as per events)
across clusters.
> For instance, let's say, insert generates 2 events such as
> ALTER_TABLE (ID: 10)
> INSERT (ID: 11)
> Each event generates a set of EXPORT and IMPORT commands.
> ALTER_TABLE event generates metadata only export/import
> INSERT generates metadata+data export/import.
> As Hive always dump the latest copy of table during export, it sets the latest notification
event ID as current state of it. So, in this example, import of metadata by ALTER_TABLE event
sets the current state of the table as 11.
> Now, when we try to import the data dumped by INSERT event, it is noop as the table's
current state(11) is equal to the dump state (11) which in-turn leads to the data never gets
replicated to target cluster.
> So, it is necessary to allow overwrite of table/partition if their current state equals
the dump state.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message