hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ryan rawson (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-3142) If a master dies and comes back up before his znode expires, the RS heartbeat can lock up
Date Mon, 08 Nov 2010 23:44:07 GMT

    [ https://issues.apache.org/jira/browse/HBASE-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12929820#action_12929820
] 

ryan rawson commented on HBASE-3142:
------------------------------------

did the rs not recover when the master started?

> If a master dies and comes back up before his znode expires, the RS heartbeat can lock
up
> -----------------------------------------------------------------------------------------
>
>                 Key: HBASE-3142
>                 URL: https://issues.apache.org/jira/browse/HBASE-3142
>             Project: HBase
>          Issue Type: Bug
>          Components: master, regionserver
>    Affects Versions: 0.89.20100924, 0.90.0
>            Reporter: Jonathan Gray
>            Assignee: ryan rawson
>            Priority: Critical
>             Fix For: 0.90.0
>
>
> During a rolling restart, we ran into a case where a master was shutdown and then brought
back up before the znode expired.
> On the RS side, while the master was down, it was getting ConnectionRefused exceptions
trying to heartbeat to what it thinks is the active master.
> Once the master process comes back up, the next heartbeat done by all the RSs just blocks
indefinitely.
> This is somewhat related to HBASE-3141

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message