hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "binlijin (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (HBASE-19290) Reduce zk request when doing split log
Date Thu, 23 Nov 2017 03:28:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-19290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16263748#comment-16263748
] 

binlijin edited comment on HBASE-19290 at 11/23/17 3:27 AM:
------------------------------------------------------------

bq. The above example lasted for almost an hour. 
bq.  With the patch, roughly how long does log splitting task last ?
We do not record the new numbers and the log do not exists now. But we record that we have
7.1TB wals and split it in 40mins.



was (Author: aoxiang):
bq. The above example lasted for almost an hour. 
      With the patch, roughly how long does log splitting task last ?
We do not record the new numbers and the log do not exists now. But we record that we have
7.1TB wals and split it in 40mins.


> Reduce zk request when doing split log
> --------------------------------------
>
>                 Key: HBASE-19290
>                 URL: https://issues.apache.org/jira/browse/HBASE-19290
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: binlijin
>            Assignee: binlijin
>         Attachments: HBASE-19290.master.001.patch, HBASE-19290.master.002.patch, HBASE-19290.master.003.patch,
HBASE-19290.master.004.patch
>
>
> We observe once the cluster has 1000+ nodes and when hundreds of nodes abort and doing
split log, the split is very very slow, and we find the regionserver and master wait on the
zookeeper response, so we need to reduce zookeeper request and pressure for big cluster.
> (1) Reduce request to rsZNode, every time calculateAvailableSplitters will get rsZNode's
children from zookeeper, when cluster is huge, this is heavy. This patch reduce the request.

> (2) When the regionserver has max split tasks running, it may still trying to grab task
and issue zookeeper request, we should sleep and wait until we can grab tasks again.  



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message