hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-15208) DistCp to offer option to save src/dest filesets as alternative to delete()
Date Wed, 14 Feb 2018 18:41:01 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-15208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16364584#comment-16364584
] 

Steve Loughran commented on HADOOP-15208:
-----------------------------------------

Patch 003
* checkstyle (including some existing lines which some indentation changes had highlit as
bad style)
* added an ADL implementation of the DistCP test, as it had none. This needed the POM set
up too.

Tested: AWS (ireland), wasb (ireland), ADL (somewhere?)

> DistCp to offer option to save src/dest filesets as alternative to delete()
> ---------------------------------------------------------------------------
>
>                 Key: HADOOP-15208
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15208
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: tools/distcp
>    Affects Versions: 2.9.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Major
>         Attachments: HADOOP-15208-001.patch, HADOOP-15208-002.patch, HADOOP-15208-002.patch,
HADOOP-15208-003.patch
>
>
> There are opportunities to improve distcp delete performance and scalability with object
stores, but you need to test with production datasets to determine if the optimizations work,
don't run out of memory, etc.
> By adding the option to save the sequence files of source, dest listings, people (myself
included) can experiment with different strategies before trying to commit one which doesn't
scale



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message