hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-17462) Investigate using sliding window for read/write request costs in StochasticLoadBalancer
Date Fri, 20 Jan 2017 01:51:26 GMT

    [ https://issues.apache.org/jira/browse/HBASE-17462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15831013#comment-15831013
] 

Ted Yu commented on HBASE-17462:
--------------------------------

Thinking about "hbase.master.balancer.stochastic.numRegionLoadsToRemember", the parameter
doesn't take into account the number of regions in the cluster.
Meaning, the amount of heap consumed by RegionLoads would be out of norm for cluster with
large number of regions.

> Investigate using sliding window for read/write request costs in StochasticLoadBalancer
> ---------------------------------------------------------------------------------------
>
>                 Key: HBASE-17462
>                 URL: https://issues.apache.org/jira/browse/HBASE-17462
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Ted Yu
>            Assignee: Tim Brown
>              Labels: patch
>         Attachments: after_changes.png, before_changes.png, HBASE-17462.patch
>
>
> In the thread, http://search-hadoop.com/m/HBase/YGbbyUZKXWALkX1, Timothy was asking whether
the read/write request costs in StochasticLoadBalancer should be calculated as rates.
> This makes sense since read / write load on region server tends to fluctuate over time.
Using sliding window would reflect more recent trend in read / write load.
> Some factors to consider:
> The data structure used by StochasticLoadBalancer should be concise. The
> number of regions in a cluster can be expected to approach 1 million. We
> cannot afford to store long history of read / write requests in master.
> Efficiency of cost calculation should be high - there're many cost
> functions the balancer goes through, it is expected for each cost function
> to return quickly. Otherwise we would not come up with proper region
> movement plan(s) in time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message