hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HBASE-20056) Performance optimization on MultiTableInputFormatBase#getSplits()
Date Wed, 28 Feb 2018 16:39:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-20056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16380639#comment-16380639
] 

Ted Yu edited comment on HBASE-20056 at 2/28/18 4:38 PM:
---------------------------------------------------------

Thanks for the patch, Yechao

Thanks for the review, Chiaping.


was (Author: yuzhihong@gmail.com):
Thanks for the patch, Yehao

Thanks for the review, Chiaping.

> Performance optimization on MultiTableInputFormatBase#getSplits() 
> ------------------------------------------------------------------
>
>                 Key: HBASE-20056
>                 URL: https://issues.apache.org/jira/browse/HBASE-20056
>             Project: HBase
>          Issue Type: Bug
>          Components: hbase, mapreduce
>    Affects Versions: 1.0.1, 1.3.1, 1.2.6
>            Reporter: ShivaKumar SS
>            Assignee: Yechao Chen
>            Priority: Minor
>              Labels: hbase, mapreduce, performance
>             Fix For: 1.2.7
>
>         Attachments: HBASE-20056.branch-1.2.patch
>
>
> Currently this method iterates the List of scan objects to get splits and for each iteration
it opens the HConnection object and closes it, which is heavy.
> It can be optimized such that a single Hconnection can be used to compute all the splits
of for all the scan objects for their splits computation.
> This optimization will help in reducing the launch time for MR Job.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message