Return-Path: Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: (qmail 34962 invoked from network); 14 Jan 2011 23:04:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Jan 2011 23:04:32 -0000 Received: (qmail 51414 invoked by uid 500); 14 Jan 2011 23:04:31 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 51261 invoked by uid 500); 14 Jan 2011 23:04:31 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 51253 invoked by uid 99); 14 Jan 2011 23:04:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Jan 2011 23:04:31 +0000 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of yuzhihong@gmail.com designates 209.85.161.41 as permitted sender) Received: from [209.85.161.41] (HELO mail-fx0-f41.google.com) (209.85.161.41) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 14 Jan 2011 23:04:25 +0000 Received: by fxm12 with SMTP id 12so3302914fxm.14 for ; Fri, 14 Jan 2011 15:04:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:date:message-id:subject:from:to :content-type; bh=frmuB4HkDNdcFFgHc8cHAkjJwOrAO5vF3kp/CYuAAIY=; b=afQeMnjaX7PLdH9ZOyvhIla/qLh7cY21ltU/uM5/npW1o3DGkUT5kcunxtEQJv1Hm8 PBYSEo9hHT0PAJ/hBEYSxJo8MEeLzPeMnhHzA05h/AfGY2uKylQlavBV/S5pz+ez7uKz 04UU8WBJqCbX4lQT6ge1cPvsWORIFMI+Y9iMg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=ZMjUMb3cbjl+xEEpJDGzDMDY/o3t8Fk4JvEeiw8qyhMYDRPepavi7KEPKDxTYy/5jK EVEO0NG7ZcLASBOp0L0NVvkU8bxNVgxkQcXRxUvnXM+/iyu5F47wcZ8UWklNg0EvRzKG ur7PVmfWe+xLaCG4Xtb1co2bupZpRdsO7afPQ= MIME-Version: 1.0 Received: by 10.223.72.14 with SMTP id k14mr1362286faj.45.1295046243184; Fri, 14 Jan 2011 15:04:03 -0800 (PST) Received: by 10.223.96.1 with HTTP; Fri, 14 Jan 2011 15:04:03 -0800 (PST) Date: Fri, 14 Jan 2011 15:04:03 -0800 Message-ID: Subject: YouAreDeadException From: Ted Yu To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=20cf3054a2034af0710499d6730f --20cf3054a2034af0710499d6730f Content-Type: text/plain; charset=ISO-8859-1 I ran 0.90 RC3 in dev cluster. I saw the following in region server log: Caused by: org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378 as dead server at org.apache.hadoop.hbase.master.ServerManager.checkIsDead(ServerManager.java:197) at org.apache.hadoop.hbase.master.ServerManager.regionServerReport(ServerManager.java:247) at org.apache.hadoop.hbase.master.HMaster.regionServerReport(HMaster.java:648) at sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.hbase.ipc.HBaseRPC$Server.call(HBaseRPC.java:570) at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1036) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:753) at org.apache.hadoop.hbase.ipc.HBaseRPC$Invoker.invoke(HBaseRPC.java:257) at $Proxy0.regionServerReport(Unknown Source) at org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:702) ... 2 more 2011-01-13 03:55:08,982 INFO org.apache.zookeeper.ZooKeeper: Initiating client connection, connectString=sjc1-hadoop0.sjc1.carrieriq.com:2181sessionTimeout=90000 watcher=hconnection 2011-01-13 03:55:08,914 FATAL org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server serverName=sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378, load=(requests=0, regions=6, usedHeap=514, maxHeap=3983): regionserver:60020-0x12d7b7b1c760004 regionserver:60020-0x12d7b7b1c760004 received expired from ZooKeeper, aborting org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.connectionEvent(ZooKeeperWatcher.java:328) at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:246) at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530) at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506) --------------- And the following from master log: 2011-01-13 03:52:42,003 INFO org.apache.hadoop.hbase.zookeeper.RegionServerTracker: RegionServer ephemeral node deleted, processing expiration [ sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378] 2011-01-13 03:52:42,005 DEBUG org.apache.hadoop.hbase.master.ServerManager: Added=sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378 to dead servers, submitted shutdown handler to be executed, root=false, meta=false 2011-01-13 03:52:42,005 INFO org.apache.hadoop.hbase.master.handler.ServerShutdownHandler: Splitting logs for sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378 2011-01-13 03:52:42,092 INFO org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Splitting 1 hlog(s) in hdfs:// sjc1-hadoop0.sjc1.carrieriq.com:9000/hbase/.logs/sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378 2011-01-13 03:52:42,093 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Writer thread Thread[WriterThread-0,5,main]: starting 2011-01-13 03:52:42,094 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Writer thread Thread[WriterThread-1,5,main]: starting 2011-01-13 03:52:42,096 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLogSplitter: Splitting hlog 1 of 1: hdfs:// sjc1-hadoop0.sjc1.carrieriq.com:9000/hbase/.logs/sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378/sjc1-hadoop1.sjc1.carrieriq.com%3A60020.1294860449407, length=0 Please advise what could be the cause. Thanks --20cf3054a2034af0710499d6730f--