hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "anishek (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-16898) Validation of source file after distcp in repl load
Date Fri, 15 Sep 2017 09:06:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-16898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167575#comment-16167575
] 

anishek commented on HIVE-16898:
--------------------------------

[~daijy] can you please provide a pull request for the same.


> Validation of source file after distcp in repl load 
> ----------------------------------------------------
>
>                 Key: HIVE-16898
>                 URL: https://issues.apache.org/jira/browse/HIVE-16898
>             Project: Hive
>          Issue Type: Bug
>          Components: HiveServer2
>    Affects Versions: 3.0.0
>            Reporter: anishek
>            Assignee: Daniel Dai
>             Fix For: 3.0.0
>
>         Attachments: HIVE-16898.1.patch
>
>
> time between deciding the source and destination path for distcp to invoking of distcp
can have a change of the source file, hence distcp might copy the wrong file to destination,
hence we should an additional check on the checksum of the source file path after distcp finishes
to make sure the path didnot change during the copy process. if it has take additional steps
to delete the previous file on destination and copy the new source and repeat the same process
as above till we copy the correct file. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message