hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sankar Hariappan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-21269) Mandate -update and -delete as DistCp options to sync data files for external tables replication.
Date Thu, 14 Feb 2019 17:08:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sankar Hariappan updated HIVE-21269:
------------------------------------
    Status: Patch Available  (was: Open)

02.patch fixed test failures.

>  Mandate -update and -delete as DistCp options to sync data files for external tables
replication.
> --------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-21269
>                 URL: https://issues.apache.org/jira/browse/HIVE-21269
>             Project: Hive
>          Issue Type: Bug
>          Components: repl
>    Affects Versions: 4.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>            Priority: Major
>              Labels: DR, pull-request-available, replication
>         Attachments: HIVE-21269.01.patch, HIVE-21269.02.patch
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently, external tables replication, copies the data in directory level. So, if target
directory exist, then DistCp should compare and update or skip data files in the directory
instead of creating new directory inside pre-existing target directory.
> This can be achieved using -update.
> Also, -delete option is needed to delete the files missing in source directory but present
in target.
> Hive should mandate these DistCp options even if user passes other options.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message