hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nitay Joffe (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HBASE-1311) ZooKeeperWrapper: Failed to set watcher on ZNode /hbase/master
Date Fri, 08 May 2009 19:22:45 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nitay Joffe updated HBASE-1311:
-------------------------------

    Attachment: hbase-1311.patch

Here's what I've got so far.

In this patch:

ZooKeeper is the ground state of truth, so if we lose our connection to it,
then everyone thinks we're gone. So, we should act as such, which means
aborting and restarting.

I moved all of the state that has to be reinitialized into a new reinitialize()
method that is called by the constructor and my retart() method. It's rather
unfortunate that most of the things ended up moving into here (you can't call
run() on threads twice), so a lot of stuff is not final anymore.

I was seeing a problem with shutting down HDFS and starting it back up again,
so I added an AtomicBoolean to prevent the HDFS shutdown hook from running when
I restart.


I think I am now seeing the problem reported in HBASE-1362 when running the test in this patch.

> ZooKeeperWrapper: Failed to set watcher on ZNode /hbase/master
> --------------------------------------------------------------
>
>                 Key: HBASE-1311
>                 URL: https://issues.apache.org/jira/browse/HBASE-1311
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>            Assignee: Nitay Joffe
>             Fix For: 0.20.0
>
>         Attachments: dumb-wrapper-hack.patch, hbase-1311.patch
>
>
> After about 12 hours of operation, this repeats over and over in the regionserver log:
> 2009-04-05 19:44:38,445 WARN org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper: Failed
to set watcher on ZNode /hbase/master
> org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session
expired
> 	at org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
> 	at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:709)
> 	at org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.watchMasterAddress(ZooKeeperWrapper.java:235)
> 	at org.apache.hadoop.hbase.regionserver.HRegionServer.watchMasterAddress(HRegionServer.java:343)
> 	at org.apache.hadoop.hbase.regionserver.HRegionServer.process(HRegionServer.java:339)
> 	at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:366)
> 2009-04-05 19:44:38,445 WARN org.apache.hadoop.hbase.regionserver.HRegionServer: Unable
to set watcher on ZooKeeper master address. Retrying.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message