hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-13200) Improper configuration can leads to endless lease recovery during failover
Date Wed, 11 Mar 2015 23:14:38 GMT

    [ https://issues.apache.org/jira/browse/HBASE-13200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14357780#comment-14357780
] 

stack commented on HBASE-13200:
-------------------------------

Patch lgtm. Have you tried it on a cluster or in prod [~heliangliang]? Thanks.

> Improper configuration can leads to endless lease recovery during failover
> --------------------------------------------------------------------------
>
>                 Key: HBASE-13200
>                 URL: https://issues.apache.org/jira/browse/HBASE-13200
>             Project: HBase
>          Issue Type: Bug
>          Components: MTTR
>            Reporter: He Liangliang
>            Assignee: He Liangliang
>         Attachments: HBASE-13200.patch
>
>
> When a node (DN+RS) has machine/OS level failure, another RS will try to do lease recovery
for the log file. It will retry for every hbase.lease.recovery.dfs.timeout (default to 61s)
from the second time. When the hdfs configuration is not properly configured (e.g. socket
connection timeout) and without patch HDFS-4721, the lease recovery time can exceeded the
timeout specified by hbase.lease.recovery.dfs.timeout. This will lead to  endless retries
and preemptions until the final timeout.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message