Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B1EF3200D16 for ; Tue, 10 Oct 2017 16:48:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B05C6160BE1; Tue, 10 Oct 2017 14:48:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 09B05160BE0 for ; Tue, 10 Oct 2017 16:48:04 +0200 (CEST) Received: (qmail 35159 invoked by uid 500); 10 Oct 2017 14:48:04 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 35148 invoked by uid 99); 10 Oct 2017 14:48:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Oct 2017 14:48:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 70CCFD8DD4 for ; Tue, 10 Oct 2017 14:48:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id HdZ61E5dUAAS for ; Tue, 10 Oct 2017 14:48:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 1936560CE6 for ; Tue, 10 Oct 2017 14:48:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 5D18FE02C7 for ; Tue, 10 Oct 2017 14:48:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 1690024360 for ; Tue, 10 Oct 2017 14:48:00 +0000 (UTC) Date: Tue, 10 Oct 2017 14:48:00 +0000 (UTC) From: "Mike Drob (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-18693) adding an option to restore_snapshot to move mob files from archive dir to working dir MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 10 Oct 2017 14:48:05 -0000 [ https://issues.apache.org/jira/browse/HBASE-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16198767#comment-16198767 ] Mike Drob commented on HBASE-18693: ----------------------------------- The rubocop & ruby-lint warnings look fine to ignore for now. We'll need to do a major cleanup pass later on line length anyway. > adding an option to restore_snapshot to move mob files from archive dir to working dir > -------------------------------------------------------------------------------------- > > Key: HBASE-18693 > URL: https://issues.apache.org/jira/browse/HBASE-18693 > Project: HBase > Issue Type: Improvement > Components: mob > Affects Versions: 2.0.0-alpha-2 > Reporter: huaxiang sun > Assignee: huaxiang sun > Attachments: HBASE-18693.master.001.patch > > > Today, there is a single mob region where mob files for all user regions are saved. There could be many files (one million) in a single mob directory. When one mob table is restored or cloned from snapshot, links are created for these mob files. This creates a scaling issue for mob compaction. In mob compaction's select() logic, for each hFileLink, it needs to call NN's getFileStatus() to get the size of the linked hfile. Assume that one such call takes 20ms, 20ms * 1000000 = 6 hours. > To avoid this overhead, we want to add an option so that restore_snapshot can move mob files from archive dir to working dir. clone_snapshot is more complicated as it can clone a snapshot to a different table so moving that can destroy the snapshot. No option will be added for clone_snapshot. -- This message was sent by Atlassian JIRA (v6.4.14#64029)