hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stack <st...@duboce.net>
Subject Re: Regionserver is crashed frequently these days
Date Fri, 01 Apr 2011 16:17:08 GMT
On Fri, Apr 1, 2011 at 9:01 AM, 陈加俊 <cjjvictory@gmail.com> wrote:
> 2011-04-01 19:13:40,413 WARN org.apache.hadoop.hbase.regionserver.Store:
> Failed open of hdfs://
> master.uc.uuwatch.com:9000/hbase/cjjHTML/1494733632/page/5173469199902346167.1864097884;
> presumption is that file was corrupted at flush and lost edits picked up by
> commit log replay. Verify!
> java.io.IOException: Cannot open filename
> /hbase/cjjHTML/1864097884/page/5173469199902346167
> ......
>

This is a case where a daughter region is unable to open its parent
regions storefile (The daughter refers to parent storefiles for a
period of time after initial open).  Look at what happened to the
parent region.  Was it prematurely removed?

> 2011-04-01 19:17:22,716 INFO
> org.apache.hadoop.hbase.regionserver.HRegionServer: MSG_REGION_CLOSE:
> cjjHTML,http://news.ifeng.com/gundong/detail_2011_03/15/515
> 4913_0.shtml,1300245193111: Overloaded
> 2011-04-01 19:17:22,716 INFO

This we've discussed.

> 2011-04-01 22:01:49,212 WARN org.apache.zookeeper.ClientCnxn: Exception
> closing session 0x942f0f7ae13d0000 to sun.nio.ch.SelectionKeyImpl@34819c89
> java.io.IOException: TIMED OUT
>        at


This looks like straight session timeout against ZK.   Long GC pause?

St.Ack

Mime
View raw message