hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ayush Saxena (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-14621) Distcp can not preserve timestamp with -delete option
Date Fri, 19 Jul 2019 15:02:00 GMT

    [ https://issues.apache.org/jira/browse/HDFS-14621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888953#comment-16888953
] 

Ayush Saxena commented on HDFS-14621:
-------------------------------------

Thanx [~pilchard] for the patch.
v004 LGTM +1
If no objections will push by EOD

> Distcp can not preserve timestamp with -delete  option
> ------------------------------------------------------
>
>                 Key: HDFS-14621
>                 URL: https://issues.apache.org/jira/browse/HDFS-14621
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: distcp
>    Affects Versions: 2.7.7, 3.1.2
>            Reporter: ludun
>            Priority: Major
>         Attachments: HDFS-14261.001.patch, HDFS-14621.002.patch, HDFS-14621.003.patch,
HDFS-14621.004.patch
>
>
> Use distcp with  -prbugpcaxt and -delete to copy data between cluster.
> hadoop distcp -Dmapreduce.job.queuename="QueueA" -prbugpcaxt -update -delete  hdfs://sourcecluster/user/hive/warehouse/sum.db
hdfs://destcluster/user/hive/warehouse/sum.db
> After distcp, we found  the timestamp of dest is different from source, and the timestamp
of some directory was the time distcp running.
> Check the code of distcp, in CopyCommitter, it preserves time first then process -delete
option which will change the timestamp of dest directory. So we should process -delete option
first. 
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message