Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4D30618462 for ; Mon, 28 Dec 2015 23:11:50 +0000 (UTC) Received: (qmail 38114 invoked by uid 500); 28 Dec 2015 23:11:50 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 38064 invoked by uid 500); 28 Dec 2015 23:11:50 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 37975 invoked by uid 99); 28 Dec 2015 23:11:50 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Dec 2015 23:11:50 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 9F82E2C1F69 for ; Mon, 28 Dec 2015 23:11:49 +0000 (UTC) Date: Mon, 28 Dec 2015 23:11:49 +0000 (UTC) From: "Vladimir Rodionov (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HBASE-14030) HBase Backup/Restore Phase 1 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-14030?page=3Dcom.atlassi= an.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Rodionov updated HBASE-14030: -------------------------------------- Release Note:=20 This experimental feature allows to perform backup/restore operations, incl= uding incremental ones, on a set of HBase tables.=20 Key features and Use Cases A common practice of backup and restore in database is to first take full b= aseline backup, and then periodically take incremental backup that capture the changes since th= e full baseline backup. HBase cluster can store massive amount data. Therefore we want use = full backup in combination with incremental backups for HBase as well. The following is a typical use case scenario for full and incremental backu= p: =E2=97=8F The user takes a full backup of a table or a set of tables in HBa= se. =E2=97=8F The user schedules periodical incremental backups to capture the = changes from the full backup, or from last incremental backup. =E2=97=8F The user needs to restore table data to a past point in time. =E2=97=8F The full backup is restored to the table(s) or to different table= name(s). Then the incremental backups that are up to the desired point in time are applied on= top of the full backup. We would support the following key features and capabilities. =E2=97=8F Backup to DFS FileSystem across clusters and possibly to other st= orage media or servers. =E2=97=8F Support single table or a set of tables backup and restore (full = and incremental). =E2=97=8F Restore to different table names and to different clusters.=20 =E2=97=8F Support adding and removing tables to and from backup set without= interruption of incremental backup schedule. =E2=97=8F Support merge of incremental backups into longer period and bigge= r incremental backups for easy storage and restore. =E2=97=8F Support scheduled backups. =E2=97=8F Unified command line interface for all the above. To illustrate these key capabilities, the following are two more detailed u= se case examples. Use case example 1: 1. User takes a full backup of a set of tables (i.e. table1 and table2) in = HBase. 2. User takes incremental backups. The incremental backup will only track t= able1 and table2. 3. User adds other tables (i.e. table3 and table4) in HBase, and an implici= t full backup is executed during the add process 4. User continues to take incremental backups. The incremental backup data = would cover table1, table2, table3 and table4. 5. User wants to restore table3 and table4 to a past PIT (point-in-time). 6. Full backup in 3. is restored onto HBase cluster. Then the incremental b= ackups after that full backup are applied on top of the full restore until the PIT. Use case example 2: 1. User takes a full backup of a set of tables in HBase. 2. User takes daily incremental backups. 3. User merges the daily incremental backups into weekly incremental backup= s. 4. User combines/rolls up the weekly incremental backup into monthly increm= ental backups. 5. User wants to restore the tables to a past PIT. 6. Full backup is restored onto HBase cluster. 7. Monthly incremental backups before the desired PIT are applied. 8. Closest daily backups up to the PIT are applied. was: This experimental feature allows to perform backup/restore operations, incl= uding incremental ones, on a set of HBase tables.=20 > HBase Backup/Restore Phase 1 > ---------------------------- > > Key: HBASE-14030 > URL: https://issues.apache.org/jira/browse/HBASE-14030 > Project: HBase > Issue Type: Umbrella > Affects Versions: 2.0.0 > Reporter: Vladimir Rodionov > Assignee: Vladimir Rodionov > Fix For: 2.0.0 > > Attachments: HBASE-14030-v0.patch, HBASE-14030-v1.patch, HBASE-14= 030-v10.patch, HBASE-14030-v11.patch, HBASE-14030-v12.patch, HBASE-14030-v1= 3.patch, HBASE-14030-v14.patch, HBASE-14030-v15.patch, HBASE-14030-v17.patc= h, HBASE-14030-v18.patch, HBASE-14030-v2.patch, HBASE-14030-v20.patch, HBAS= E-14030-v21.patch, HBASE-14030-v22.patch, HBASE-14030-v23.patch, HBASE-1403= 0-v24.patch, HBASE-14030-v25.patch, HBASE-14030-v3.patch, HBASE-14030-v4.pa= tch, HBASE-14030-v5.patch, HBASE-14030-v6.patch, HBASE-14030-v7.patch, HBAS= E-14030-v8.patch > > > This is the umbrella ticket for Backup/Restore Phase 1. See HBASE-7912 de= sign doc for the phase description. -- This message was sent by Atlassian JIRA (v6.3.4#6332)