hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 代志远 (Commented) (JIRA) <j...@apache.org>
Subject [jira] [Commented] (HBASE-5075) regionserver crashed,and failover
Date Wed, 08 Feb 2012 09:18:59 GMT

    [ https://issues.apache.org/jira/browse/HBASE-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13203405#comment-13203405
] 

代志远 commented on HBASE-5075:
----------------------------

hbase is a online db,it's availability is very important.
if some regionserver carshed ,we can't wait too long to recovery service.
so my patch is to improve the ability of failover in hbase.

i come from alipay in china, we have some hbase cluster online, we have the second big cluster
in china.
                
> regionserver crashed,and failover
> ---------------------------------
>
>                 Key: HBASE-5075
>                 URL: https://issues.apache.org/jira/browse/HBASE-5075
>             Project: HBase
>          Issue Type: Improvement
>          Components: monitoring, regionserver, replication, zookeeper
>    Affects Versions: 0.90.4
>            Reporter: 代志远
>             Fix For: 0.92.1
>
>
> regionserver crashed,it is too long time to notify hmaster.when hmaster know regionserver's
shutdown,it is long time to fetch the hlog's lease.
> hbase is a online db,availability is very important.
> i have a idea to improve availability,mintor node to check regionserver's pid.if this
pid notexsits,i think the rs down,i will delete the znode,and force close the hlog file.
> so the period maybe 100ms.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message