Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B68EC19546 for ; Tue, 29 Mar 2016 14:06:37 +0000 (UTC) Received: (qmail 27610 invoked by uid 500); 29 Mar 2016 14:06:37 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 26822 invoked by uid 500); 29 Mar 2016 14:06:36 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 26088 invoked by uid 99); 29 Mar 2016 14:06:32 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 29 Mar 2016 14:06:32 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id A83092C1F71 for ; Tue, 29 Mar 2016 14:06:25 +0000 (UTC) Date: Tue, 29 Mar 2016 14:06:25 +0000 (UTC) From: "Abhishek Soni (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-13639) SyncTable - rsync for HBase tables MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-13639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15216048#comment-15216048 ] Abhishek Soni commented on HBASE-13639: --------------------------------------- I have tried to use this feature but I couldn't. Its usage detail still feels incomplete. I can easily understand what this feature intends to do and its internal procedure while doing so. But it would be great if we have an elaborate usage information of this feature. For ex., I have a table test1 in cluster1 with cf1:1,cf1:b & cf1,c columns and another table with similar schema in cluster2 with name test2. What I understood from this feature is that I can copy data from test1 table to test2. But I am not sure how it should be done. Any document or simple steps for it would be great help. > SyncTable - rsync for HBase tables > ---------------------------------- > > Key: HBASE-13639 > URL: https://issues.apache.org/jira/browse/HBASE-13639 > Project: HBase > Issue Type: New Feature > Components: mapreduce, Operability, tooling > Reporter: Dave Latham > Assignee: Dave Latham > Labels: tooling > Fix For: 2.0.0, 0.98.14, 1.2.0 > > Attachments: HBASE-13639-0.98-addendum-hadoop-1.patch, HBASE-13639-0.98.patch, HBASE-13639-v1.patch, HBASE-13639-v2.patch, HBASE-13639-v3-0.98.patch, HBASE-13639-v3.patch, HBASE-13639.patch > > > Given HBase tables in remote clusters with similar but not identical data, efficiently update a target table such that the data in question is identical to a source table. Efficiency in this context means using far less network traffic than would be required to ship all the data from one cluster to the other. Takes inspiration from rsync. > Design doc: https://docs.google.com/document/d/1-2c9kJEWNrXf5V4q_wBcoIXfdchN7Pxvxv1IO6PW0-U/ -- This message was sent by Atlassian JIRA (v6.3.4#6332)