hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Feng Peng (JIRA)" <>
Subject [jira] [Commented] (HIVE-1161) Hive Replication
Date Wed, 20 Mar 2013 22:23:15 GMT


Feng Peng commented on HIVE-1161:

Is anyone planning to work on this issue? We plan to provide the replication between two clusters
on the partition level, i.e., given a source cluster and a target cluster, we can specify
a table and the tool would sync all the updated partitions from the source cluster to the
target cluster, and create the table on the target cluster if it doesn't already exist.

I saw the comments from Namit (
and am wondering if this is something already done somewhere and if it can be shared.

> Hive Replication
> ----------------
>                 Key: HIVE-1161
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>          Components: Contrib
>            Reporter: Edward Capriolo
>            Assignee: Edward Capriolo
>            Priority: Minor
> Users may want to replicate data between two distinct hadoop clusters or two hive warehouses
on the same cluster.
> Users may want to replicate entire catalogs or possibly on a table by table basis. Should
this process be batch driven or a be a full time running application? What are some practical
requirements, what are the limitations?
> Comments?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message