hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tsz Wo (Nicholas), SZE (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3939) DistCp should support an option for deleting non-existing files.
Date Fri, 29 Aug 2008 00:07:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12626798#action_12626798

Tsz Wo (Nicholas), SZE commented on HADOOP-3939:

> Would it make sense to require either -update or -overwrite if -delete is specified?

We should enforce that.

> The fix to FileStatus makes sense, but when is the Path null?

I hit this when creating a FileStatus by the default constructor and then put is in some data
structure (I forgot which data structure).  The current implementation does not need to this
operation.  So I will revert this change.

> DistCp should support an option for deleting non-existing files.
> ----------------------------------------------------------------
>                 Key: HADOOP-3939
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3939
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: tools/distcp
>            Reporter: Tsz Wo (Nicholas), SZE
>            Assignee: Tsz Wo (Nicholas), SZE
>         Attachments: 3939_20080825.patch, 3939_20080825b.patch, 3939_20080826.patch
> One use case of DistCp is to sync two directories.  Currently, DistCp has an -update
option for overwriting dst files if src is different from dst.  However, it is not enough
for sync.  If there are some files in dst but not exist in src, there is no easy way to delete
them.  We should add a new option, say -delete, so that DistCp will delete the non-existing
in dst.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message