hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shashikant Banerjee (Jira)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-13660) DistCp job fails when new data is appended in the file while the distCp copy job is running
Date Tue, 10 Sep 2019 09:55:01 GMT

     [ https://issues.apache.org/jira/browse/HDFS-13660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Shashikant Banerjee updated HDFS-13660:
---------------------------------------
    Status: Patch Available  (was: Open)

> DistCp job fails when new data is appended in the file while the distCp copy job is running
> -------------------------------------------------------------------------------------------
>
>                 Key: HDFS-13660
>                 URL: https://issues.apache.org/jira/browse/HDFS-13660
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: distcp
>            Reporter: Mukund Thakur
>            Assignee: Mukund Thakur
>            Priority: Critical
>         Attachments: distcp_failure_when_file_append.log
>
>
> Steps to reproduce: 
> Suppose distcp MR job is copying the file /tmp/web_returns_merged/data-m-002 and 
> we append some more data to this file using command 
> hadoop fs -appendToFile xaa  /tmp/web_returns_merged/data-m-002
> the job fails with exception 
>  Mismatch in length of source:hdfs://mycluster0/tmp/web_returns_merged/data-m-002 and
target.
> Attached the logs.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message