hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-3596) [replication] Wait a few seconds before transferring queues
Date Thu, 17 Mar 2011 10:34:31 GMT

    [ https://issues.apache.org/jira/browse/HBASE-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13007878#comment-13007878
] 

Hudson commented on HBASE-3596:
-------------------------------

Integrated in HBase-TRUNK #1792 (See [https://hudson.apache.org/hudson/job/HBase-TRUNK/1792/])
    

> [replication] Wait a few seconds before transferring queues 
> ------------------------------------------------------------
>
>                 Key: HBASE-3596
>                 URL: https://issues.apache.org/jira/browse/HBASE-3596
>             Project: HBase
>          Issue Type: Improvement
>    Affects Versions: 0.90.1
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>             Fix For: 0.90.2
>
>         Attachments: HBASE-3596.patch
>
>
> ReplicationSourceManager.transferQueues is running a little too fast at the moment and
this has the bad side effect of making us run into HBASE-2611 at almost every cluster restart.
The reason is that some servers might shut down faster than others so that the last RS that
are notified will at the same time see their friends dying, and will try to pick their queues.
What happens then is that they also get told to shutdown and might be able to close their
ZK session before the queue transfer process is completed, which is what 2611 is about.
> Currently the only to fix to that is to delete the lock znode by hand and bounce a region
server so that it picks up the queue on startup.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message