hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yu Li (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-14699) Replication crashes regionservers when hbase.wal.provider is set to multiwal
Date Wed, 28 Oct 2015 06:35:27 GMT

    [ https://issues.apache.org/jira/browse/HBASE-14699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977852#comment-14977852
] 

Yu Li commented on HBASE-14699:
-------------------------------

Hi [~ashu210890],
The issue of ReplicationManager#cleanOldLogs is already found and addressed by HBASE-6617
(refer to [this comment|https://issues.apache.org/jira/browse/HBASE-6617?focusedCommentId=14708924&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14708924]).
HBASE-6617 is already integrated into branch-1 and master, but not branch-1.2. I believe branch-1
won't have such issue, you could give it a try if possible.

[~busbey], feel free to let me know if would like to take HBASE-6617 into branch-1.2, I could
make a quick patch although there may be some rebase work :-)

> Replication crashes regionservers when hbase.wal.provider is set to multiwal
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-14699
>                 URL: https://issues.apache.org/jira/browse/HBASE-14699
>             Project: HBase
>          Issue Type: Bug
>          Components: Replication
>            Reporter: Ashu Pachauri
>            Assignee: Ashu Pachauri
>            Priority: Blocker
>
> When the hbase.wal.provider is set to multiwal and replication is enabled, the regionservers
start crashing with the following exception:
> {code}
> <hostname>,16020,1445495411258: Failed to write replication wal position (filename=<hostname>%2C16020%2C1445495411258.null0.1445495898373,
position=1322399)
> org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode for /hbase/replication/rs/<hostname>,16020,1445495411258/1/<hostname>%2C16020%2C1445495411258.null0.1445495898373
> 	at org.apache.zookeeper.KeeperException.create(KeeperException.java:111)
> 	at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> 	at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1270)
> 	at org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:429)
> 	at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:940)
> 	at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:990)
> 	at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:984)
> 	at org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:129)
> 	at org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:177)
> 	at org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.run(ReplicationSource.java:388)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message