hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashu Pachauri (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18090) Improve TableSnapshotInputFormat to allow more multiple mappers per region
Date Wed, 27 Sep 2017 01:22:01 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16181839#comment-16181839
] 

Ashu Pachauri commented on HBASE-18090:
---------------------------------------

Just noticed that HBASE-16894 is about adding this support for the TableFormat. Somehow I
missed the fact that it's linked to this jira. You can ignore my comment regarding adding
the support for TableInputFormat.

> Improve TableSnapshotInputFormat to allow more multiple mappers per region
> --------------------------------------------------------------------------
>
>                 Key: HBASE-18090
>                 URL: https://issues.apache.org/jira/browse/HBASE-18090
>             Project: HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 1.4.0
>            Reporter: Mikhail Antonov
>            Assignee: xinxin fan
>         Attachments: HBASE-18090-branch-1.3-v1.patch, HBASE-18090-branch-1.3-v2.patch,
HBASE-18090-V3-master.patch, HBASE-18090-V4-master.patch
>
>
> TableSnapshotInputFormat runs one map task per region in the table snapshot. This places
unnecessary restriction that the region layout of the original table needs to take the processing
resources available to MR job into consideration. Allowing to run multiple mappers per region
(assuming reasonably even key distribution) would be useful.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message