hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kai Zheng (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-9638) Improve DistCp Help and documentation
Date Tue, 12 Jan 2016 00:35:39 GMT

    [ https://issues.apache.org/jira/browse/HDFS-9638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15093011#comment-15093011

Kai Zheng commented on HDFS-9638:

Good to have this to improve and update the documentation. 

In the mailing list I had some comments as below.
I read the doc at the following link and regard it as the latest revision that corresponds
with the trunk codebase.
If that’s right, then we may need to complement it with the following important features
because I don’t see they are mentioned in the doc.
1.	–diff option, use snapshot diff report to identify the differences between source and
target to compute the copying list.
2.	–numListstatusThreads option, number of threads to concurrently compute the copying list.
3.	–p t, to preserve timestamps.
As above features are great things for user to use in order to speed up the time consuming
inter or intra cluster sync, not only to add these options in the table of command line options,
but also better to document them well as we did for other functions.

Would be good to check and address the questions here as well. Thanks.

> Improve DistCp Help and documentation
> -------------------------------------
>                 Key: HDFS-9638
>                 URL: https://issues.apache.org/jira/browse/HDFS-9638
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: distcp
>    Affects Versions: 3.0.0
>            Reporter: Wei-Chiu Chuang
>            Assignee: Wei-Chiu Chuang
>            Priority: Minor
>              Labels: supportability
> For example,
> -mapredSslConf <arg>   Configuration for ssl config file, to use with
>                         hftps://
> But this ssl config file should be in the classpath, which is not clearly stated.
> http://hadoop.apache.org/docs/current/hadoop-distcp/DistCp.html
> "When using the hsftp protocol with a source, the security- related properties may be
specified in a config-file and passed to DistCp. <ssl_conf_file> needs to be in the
classpath. "
> It is also not clear from the context if this ssl_conf_file should be at the client issuing
the command. (I think the answer is yes)
> Also, in: http://hadoop.apache.org/docs/current/hadoop-distcp/DistCp.html
> "The following is an example of the contents of the contents of a SSL Configuration file:"
> there's an extra "of the contents of the contents "

This message was sent by Atlassian JIRA

View raw message