hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "anishek (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-16900) optimization to give distcp a list of input files to copy to a destination target directory
Date Wed, 14 Jun 2017 09:53:00 GMT
anishek created HIVE-16900:
------------------------------

             Summary: optimization to give distcp a list of input files to copy to a destination
target directory
                 Key: HIVE-16900
                 URL: https://issues.apache.org/jira/browse/HIVE-16900
             Project: Hive
          Issue Type: Improvement
          Components: HiveServer2
    Affects Versions: 3.0.0
            Reporter: anishek
            Assignee: anishek
             Fix For: 3.0.0


During repl Copy currently we only allow operations per file as against list of files supported
by distcp, During bootstrap table/partitions load it will be great to load all files listed
in _files in a single distcp job to make it more efficient, this would require changes to
the _shims_ sub project in hive to additionally expose api's which take multiple source files.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message