hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jothi Padmanabhan (JIRA)" <j...@apache.org>
Subject [jira] Created: (MAPREDUCE-1231) Distcp is very slow
Date Mon, 23 Nov 2009 04:16:39 GMT
Distcp is very slow

                 Key: MAPREDUCE-1231
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1231
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: distcp
    Affects Versions: 0.20.1
            Reporter: Jothi Padmanabhan
            Assignee: Jothi Padmanabhan
             Fix For: 0.20.2

Currently distcp does a checksums check in addition to file length check to decide if a remote
file has to be copied. If the number of files is high (thousands), this checksum check is
proving to be fairly costly leading to a long time before the copy is started.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message