hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17196) CM: ReplCopyTask should retain the original file names even if copied from CM path.
Date Fri, 15 Sep 2017 06:44:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167420#comment-16167420
] 

Daniel Dai commented on HIVE-17196:
-----------------------------------

A second look this is still a problem. However, this apply to bootstrap only. The conflict
could happen during bootstrap when:
1. the source table contains two file of the same content
2. both files are moved to CM at the time of repl load

Will upload a patch with test case.

> CM: ReplCopyTask should retain the original file names even if copied from CM path.
> -----------------------------------------------------------------------------------
>
>                 Key: HIVE-17196
>                 URL: https://issues.apache.org/jira/browse/HIVE-17196
>             Project: Hive
>          Issue Type: Sub-task
>          Components: repl
>    Affects Versions: 2.1.0
>            Reporter: Sankar Hariappan
>            Assignee: Daniel Dai
>             Fix For: 3.0.0
>
>         Attachments: HIVE-17196.1.patch
>
>
> Consider the below scenario,
> 1. Insert into table T1 with value(X).
> 2. Insert into table T1 with value(X).
> 3. Truncate the table T1. 
> – This step backs up 2 files with same content to cmroot which ends up with one file
in cmroot as checksum matches.
> 4. Incremental repl with above 3 operations.
> – In this step, both the insert event files will be read from cmroot where copy of
one leads to overwrite the other one as the file name is same in cm path (checksum as file
name).
> So, this leads to data loss and hence it is necessary to retain the original file names
even if we copy from cm path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message