hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HBASE-2441) ZK failures early in RS startup sequence cause infinite busy loop
Date Wed, 14 Apr 2010 01:33:50 GMT

     [ https://issues.apache.org/jira/browse/HBASE-2441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Todd Lipcon updated HBASE-2441:
-------------------------------

    Attachment: hbase-2441.txt

Here's a patch that fixes this. Would appreciate if someone's able to reproduce - not sure
how to write a test plan for it and haven't been able to reproduce it quite yet. The trick
is to get a ZK session and have it expire before reportForDuty is called (perhaps by making
the RS retry a bunch of times to talk to the master)

> ZK failures early in RS startup sequence cause infinite busy loop
> -----------------------------------------------------------------
>
>                 Key: HBASE-2441
>                 URL: https://issues.apache.org/jira/browse/HBASE-2441
>             Project: Hadoop HBase
>          Issue Type: Bug
>    Affects Versions: 0.20.3
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>         Attachments: hbase-2441.txt
>
>
> If the RS loses its ZK session before it reports for duty, the abort() call will trigger
an NPE, and then the stop boolean doesn't get toggled. The RS will then loop forever trying
to register itself in the expired ZK session, and fill up the logs.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message