hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-17367) IMPORT table doesn't load from data dump if a metadata-only dump was already imported.
Date Fri, 25 Aug 2017 20:43:00 GMT


Hive QA commented on HIVE-17367:

Here are the results of testing the latest attachment:

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 11005 tests executed
*Failed tests:*
org.apache.hadoop.hive.cli.TestAccumuloCliDriver.testCliDriver[accumulo_queries] (batchId=231)
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2] (batchId=100)
org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query14] (batchId=235)
org.apache.hive.jdbc.TestJdbcWithMiniHS2.testHttpRetryOnServerIdleTimeout (batchId=228)

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed

This message is automatically generated.

ATTACHMENT ID: 12883786 - PreCommit-HIVE-Build

> IMPORT table doesn't load from data dump if a metadata-only dump was already imported.
> --------------------------------------------------------------------------------------
>                 Key: HIVE-17367
>                 URL:
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2, Import/Export, repl
>    Affects Versions: 3.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>              Labels: DR, replication
>             Fix For: 3.0.0
>         Attachments: HIVE-17367.01.patch, HIVE-17367.02.patch
> Repl v1 creates a set of EXPORT/IMPORT commands to replicate modified data (as per events)
across clusters.
> For instance, let's say, insert generates 2 events such as
> INSERT (ID: 11)
> Each event generates a set of EXPORT and IMPORT commands.
> ALTER_TABLE event generates metadata only export/import
> INSERT generates metadata+data export/import.
> As Hive always dump the latest copy of table during export, it sets the latest notification
event ID as current state of it. So, in this example, import of metadata by ALTER_TABLE event
sets the current state of the table as 11.
> Now, when we try to import the data dumped by INSERT event, it is noop as the table's
current state(11) is equal to the dump state (11) which in-turn leads to the data never gets
replicated to target cluster.
> So, it is necessary to allow overwrite of table/partition if their current state equals
the dump state.

This message was sent by Atlassian JIRA

View raw message