hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Reopened: (HBASE-1534) Got ZooKeeper event, state: Disconnected on HRS and then NPE on reinit
Date Wed, 29 Jul 2009 19:38:14 GMT

     [ https://issues.apache.org/jira/browse/HBASE-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

stack reopened HBASE-1534:
--------------------------


Just saw this on jgray cluster.  Looking at reportForDuty, looks like might be able to fall
through reportForDuty method and out into the init (if stopRequested had not been reset say?).

Here is jgray log:

{code}
2009-07-29 08:12:25,630 INFO org.apache.hadoop.hbase.regionserver.HRegionServer: Shutdown
thread complete
2009-07-29 08:12:25,630 INFO org.apache.hadoop.hbase.regionserver.MemStoreFlusher: globalMemStoreLimit=597.5m,
globalMemStoreLimitLowMark=298.8m, maxHeap=1.9g
2009-07-29 08:12:25,631 INFO org.apache.hadoop.hbase.regionserver.HRegionServer: Runs every
10000000ms
2009-07-29 08:12:25,637 INFO org.apache.zookeeper.ZooKeeper: Initiating client connection,
host=zk5:2181,zk4:2181,zk3:2181,zk2:2181,zk6:2181 sessionTimeout=60000 watcher=org.apache.hadoop.hbase.regionserver.HRegionServer@2c96cf11
2009-07-29 08:12:25,638 INFO org.apache.zookeeper.ClientCnxn: Attempting connection to server
zk3/XX.XX.XX.143:2181
2009-07-29 08:12:25,641 INFO org.apache.zookeeper.ClientCnxn: Priming connection to java.nio.channels.SocketChannel[connected
local=/XX.XX.XX.217:49697 remote=zk3/XX.XX.XX.143:2181]
2009-07-29 08:12:25,641 INFO org.apache.zookeeper.ClientCnxn: Server connection successful
2009-07-29 08:12:25,650 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down
2009-07-29 08:12:25,650 ERROR org.apache.hadoop.hbase.regionserver.HRegionServer: Failed init
java.lang.NullPointerException
	at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:705)
	at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:420)
	at java.lang.Thread.run(Thread.java:636)
2009-07-29 08:12:25,651 FATAL org.apache.hadoop.hbase.regionserver.HRegionServer: Unhandled
exception. Aborting...
java.io.IOException: Region server startup failed
	at org.apache.hadoop.hbase.regionserver.HRegionServer.convertThrowableToIOE(HRegionServer.java:831)
	at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:747)
	at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:420)
	at java.lang.Thread.run(Thread.java:636)
Caused by: java.lang.NullPointerException
	at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:705)
	... 2 more

{code}

Scenario was expired zk session.

> Got ZooKeeper event, state: Disconnected on HRS and then NPE on reinit
> ----------------------------------------------------------------------
>
>                 Key: HBASE-1534
>                 URL: https://issues.apache.org/jira/browse/HBASE-1534
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: Nitay Joffe
>             Fix For: 0.20.0
>
>         Attachments: hbase-1534-v2.patch, hbase-1534.patch
>
>
> We got disconnect from zk but then when we tried to reinitialize ourselves, got a NPE.
 See below.
> {code}
> 2009-06-17 11:58:55,102 [Thread-16] INFO org.apache.hadoop.hbase.regionserver.HRegionServer:
Starting shutdown thread. 
> 2009-06-17 11:58:55,102 [Thread-16] INFO org.apache.hadoop.hbase.regionserver.HRegionServer:
Shutdown thread complete
> 2009-06-17 11:58:55,102 [main-EventThread] INFO org.apache.hadoop.hbase.ipc.HBaseRpcMetrics:
Initializing RPC Metrics with hostName=HRegionServer, port=60021
> 2009-06-17 11:58:55,103 [main-EventThread] INFO org.apache.hadoop.hbase.regionserver.MemcacheFlusher:
globalMemcacheLimit=556.7m, globalMemcacheLimitLowMark=347.9m, maxHeap=1.4g
> 2009-06-17 11:58:55,103 [main-EventThread] INFO org.apache.hadoop.hbase.regionserver.HRegionServer:
Runs every 10000000ms
> 2009-06-17 11:58:55,148 [regionserver/0:0:0:0:0:0:0:0:60021] ERROR org.apache.hadoop.hbase.regionserver.HRegionServer:
Failed init
> java.lang.NullPointerException
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:713)
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:431)
>     at java.lang.Thread.run(Thread.java:619)
> 2009-06-17 11:58:55,153 [regionserver/0:0:0:0:0:0:0:0:60021] FATAL org.apache.hadoop.hbase.regionserver.HRegionServer:
Unhandled exception. Aborting...
> java.io.IOException: Region server startup failed
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.convertThrowableToIOE(HRegionServer.java:832)
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:751)
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:431)
>     at java.lang.Thread.run(Thread.java:619)
> Caused by: java.lang.NullPointerException
>     at org.apache.hadoop.hbase.regionserver.HRegionServer.init(HRegionServer.java:713)
>     ... 2 more   
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message