Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B6B43200D18 for ; Wed, 27 Sep 2017 07:52:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B52DA1609EB; Wed, 27 Sep 2017 05:52:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 066CC1609D7 for ; Wed, 27 Sep 2017 07:52:04 +0200 (CEST) Received: (qmail 73636 invoked by uid 500); 27 Sep 2017 05:52:04 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 73618 invoked by uid 99); 27 Sep 2017 05:52:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Sep 2017 05:52:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 822511A2E4C for ; Wed, 27 Sep 2017 05:52:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id b96kByuZ_1Vb for ; Wed, 27 Sep 2017 05:52:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 087CD60E46 for ; Wed, 27 Sep 2017 05:52:02 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id C6190E0F6F for ; Wed, 27 Sep 2017 05:52:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 37B6E242AB for ; Wed, 27 Sep 2017 05:52:00 +0000 (UTC) Date: Wed, 27 Sep 2017 05:52:00 +0000 (UTC) From: "Ashu Pachauri (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-18090) Improve TableSnapshotInputFormat to allow more multiple mappers per region MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 27 Sep 2017 05:52:05 -0000 [ https://issues.apache.org/jira/browse/HBASE-18090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16182026#comment-16182026 ] Ashu Pachauri commented on HBASE-18090: --------------------------------------- +1 on the patch V5. I'll commit it tomorrow if no one objects by then (given the tests pass) > Improve TableSnapshotInputFormat to allow more multiple mappers per region > -------------------------------------------------------------------------- > > Key: HBASE-18090 > URL: https://issues.apache.org/jira/browse/HBASE-18090 > Project: HBase > Issue Type: Improvement > Components: mapreduce > Affects Versions: 1.4.0 > Reporter: Mikhail Antonov > Assignee: xinxin fan > Attachments: HBASE-18090-branch-1.3-v1.patch, HBASE-18090-branch-1.3-v2.patch, HBASE-18090-V3-master.patch, HBASE-18090-V4-master.patch, HBASE-18090-V5-master.patch > > > TableSnapshotInputFormat runs one map task per region in the table snapshot. This places unnecessary restriction that the region layout of the original table needs to take the processing resources available to MR job into consideration. Allowing to run multiple mappers per region (assuming reasonably even key distribution) would be useful. -- This message was sent by Atlassian JIRA (v6.4.14#64029)