Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EC32918934 for ; Mon, 27 Jul 2015 20:10:04 +0000 (UTC) Received: (qmail 9380 invoked by uid 500); 27 Jul 2015 20:10:04 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 9329 invoked by uid 500); 27 Jul 2015 20:10:04 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 9315 invoked by uid 99); 27 Jul 2015 20:10:04 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 27 Jul 2015 20:10:04 +0000 Date: Mon, 27 Jul 2015 20:10:04 +0000 (UTC) From: "santosh kumar (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-14128) Fix inability to run Multiple MR over the same Snapshot MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-14128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14643306#comment-14643306 ] santosh kumar commented on HBASE-14128: --------------------------------------- Noob here - so could you look at my questions: 1. What version is this targeted for? 2. I looked at the code and find that this method is being used by MultiTableSnapshotInputFormat and TableSnapshotInputFormat and for those 2 I can understand the usage. However, ScanPerformanceEvaluation uses this as well, but it is a Test class(being found in src/test), for which I don't understand the usage. 3. What would the target table name be based off ? A timestamp to prevent duplicate? 4. Is the underlying assumption that the underlying Hbase table doesn't exist when TableMapReduceUtil.initTableSnapshotMapperJob makes an attempt to run this restore? Would it be better if the 'target' table is passed as a parameter instead? > Fix inability to run Multiple MR over the same Snapshot > ------------------------------------------------------- > > Key: HBASE-14128 > URL: https://issues.apache.org/jira/browse/HBASE-14128 > Project: HBase > Issue Type: Bug > Components: mapreduce, snapshots > Reporter: Matteo Bertozzi > Assignee: santosh kumar > Priority: Minor > Labels: beginner, noob > > from the list, running multiple MR over the same snapshot does not work > {code} > public static void copySnapshotForScanner(Configuration conf, FileSystem .. > RestoreSnapshotHelper helper = new RestoreSnapshotHelper(conf, fs, > manifest, manifest.getTableDescriptor(), restoreDir, monitor, status); > {code} > the problem is that manifest.getTableDescriptor() will try to clone the snapshot with the same target name. ending up in "file already exist" exceptions. > we just need to clone that descriptor and generate a new target table name -- This message was sent by Atlassian JIRA (v6.3.4#6332)