Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 91B4C9860 for ; Wed, 22 Feb 2012 20:02:15 +0000 (UTC) Received: (qmail 34634 invoked by uid 500); 22 Feb 2012 20:02:15 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 34519 invoked by uid 500); 22 Feb 2012 20:02:15 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 34497 invoked by uid 99); 22 Feb 2012 20:02:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Feb 2012 20:02:15 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Feb 2012 20:02:14 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 0D6AA335872 for ; Wed, 22 Feb 2012 20:01:54 +0000 (UTC) Date: Wed, 22 Feb 2012 20:01:54 +0000 (UTC) From: "Lars Hofhansl (Commented) (JIRA)" To: issues@hbase.apache.org Message-ID: <1640726298.5886.1329940914056.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1948061249.10800.1319038150893.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-4618) HBase backups MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13213941#comment-13213941 ] Lars Hofhansl commented on HBASE-4618: -------------------------------------- Are you planning to release the various tools you use as open source? At Salesforce we need to get started seriously on backup procedures and I would like to avoid a lot of duplicate work. > HBase backups > ------------- > > Key: HBASE-4618 > URL: https://issues.apache.org/jira/browse/HBASE-4618 > Project: HBase > Issue Type: Umbrella > Components: documentation, regionserver > Reporter: Karthik Ranganathan > Assignee: Karthik Ranganathan > > We have been working on the ability to do backups in HBase with different levels of protection. This is an umbrella task for all the backup related changes. Here are some kinds of changes - will create separate issues for them: > Roughly here are a few flavors of backups giving increasing levels of guarentees: > 1. Per cf backups > 2. Multi-cf backups with row atomicity preserved > 3. Multi-cf backups with row atomicity and point in time recovery. > On the perf dimension, here is a list of improvements: > 1. Copy the files - regular hadoop "cp" > 2. Use fast copy - copy blocks and stitch them together, saves top of rack bandwidth > 3. Use fast copy with hard links - no file copy, it does only ext3 level linking. > On the durability of data side: > 1. Ability to backup data onto the same racks as those running HBase > 2. Intra-datacenter backup > 3. Inter datacenter backup > Restores: > 1. Restore with a table name different from the backed up table name > 2. Restore a backed up table wen HBase cluster is not running at restore time > 3. Restore into a live and running cluster > Operationally: > 1. How to setup backups in live cluster > 2. Setting up intra-DC > 3. cross-DC backups > 4. Verifying a backup is good -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira