hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sankar Hariappan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-21269) Mandate -update and -delete as DistCp options to sync data for external tables replication.
Date Thu, 14 Feb 2019 06:57:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sankar Hariappan updated HIVE-21269:
------------------------------------
    Summary:  Mandate -update and -delete as DistCp options to sync data for external tables
replication.  (was:  Mandate -update and -delete as DistCp options to avoid data inconsistency
with external tables replication.)

>  Mandate -update and -delete as DistCp options to sync data for external tables replication.
> --------------------------------------------------------------------------------------------
>
>                 Key: HIVE-21269
>                 URL: https://issues.apache.org/jira/browse/HIVE-21269
>             Project: Hive
>          Issue Type: Bug
>          Components: repl
>    Affects Versions: 4.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>            Priority: Major
>              Labels: DR, replication
>
> Currently, external tables replication, copies the data in directory level. So, if target
directory exist, then DistCp should compare and update or skip data files in the directory
instead of creating new directory inside pre-existing target directory.
> This can be achieved using -update.
> Also, -delete option is needed to delete the files missing in source directory but present
in target.
> Hive should mandate these DistCp options even if user passes other options.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message