Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9721E10DBF for ; Fri, 11 Mar 2016 19:24:40 +0000 (UTC) Received: (qmail 12568 invoked by uid 500); 11 Mar 2016 19:24:39 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 12343 invoked by uid 500); 11 Mar 2016 19:24:39 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 12280 invoked by uid 99); 11 Mar 2016 19:24:39 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Mar 2016 19:24:39 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 09B0C2C1F62 for ; Fri, 11 Mar 2016 19:24:39 +0000 (UTC) Date: Fri, 11 Mar 2016 19:24:39 +0000 (UTC) From: "Vladimir Rodionov (JIRA)" To: dev@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (HBASE-15331) HBase Backup/Restore Phase 2: Optimized Restore operation MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-15331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Rodionov resolved HBASE-15331. --------------------------------------- Resolution: Fixed The work continues in HBASE-15448 > HBase Backup/Restore Phase 2: Optimized Restore operation > --------------------------------------------------------- > > Key: HBASE-15331 > URL: https://issues.apache.org/jira/browse/HBASE-15331 > Project: HBase > Issue Type: Improvement > Reporter: Vladimir Rodionov > Assignee: Vladimir Rodionov > > The current implementation for restore uses WALReplay M/R job. This has performance and stability problems, since it uses HBase client API to insert data. We have to migrate to bulk load approach: generate hfiles directly from snapshot and incremental images. We run separate M/R job for every backup image between last FULL backup and current incremental backup we restore to and for every table in the list (image). If we have 10 tables and 30 days of incremental backup images - this results in 30x10 = 300 M/R jobs. MUST be optimized. -- This message was sent by Atlassian JIRA (v6.3.4#6332)