Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 0A2B89267 for ; Tue, 27 Mar 2012 16:55:29 +0000 (UTC) Received: (qmail 67254 invoked by uid 500); 27 Mar 2012 16:55:27 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 67192 invoked by uid 500); 27 Mar 2012 16:55:27 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 67184 invoked by uid 99); 27 Mar 2012 16:55:27 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Mar 2012 16:55:27 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yuzhihong@gmail.com designates 74.125.82.169 as permitted sender) Received: from [74.125.82.169] (HELO mail-we0-f169.google.com) (74.125.82.169) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 27 Mar 2012 16:55:19 +0000 Received: by werj55 with SMTP id j55so98978wer.14 for ; Tue, 27 Mar 2012 09:54:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=oSd8CqD9Bn2mhOxvtBpCSPB3D0YXWcy7fZELiCyn6/E=; b=YMFe4WYZC06aHUnW3dINvmvp8scmHLFGkf1zZCVVc0CrnPZ1wMTgGEpgIfk4T7oPjq +6+n6q6pXjWAABjgP9h0bADyw8mS8Evjmcj+fk/6s2xjdzGViDRvy/T7W+plrVb4XLpj PV/b/hMISvtSx6D/Bf4zBgZvfGrvM/irbP7io2/mQy2fJw2R+lhgdc6kiwItSVjync3U qPlhSKKEwUnks3wctsLoVBy0+9r01Enj59qzln/bbZ0YJtlyqd5PsFuk4vOF4XSqRhjj ka6rx8kzxm5jeTVPMyQPpov8B1BWWcpggDGVrzsva3TkM4N9ifb9puq3rkc1BvJk6bnx Hp6g== MIME-Version: 1.0 Received: by 10.180.83.198 with SMTP id s6mr29162996wiy.8.1332867299394; Tue, 27 Mar 2012 09:54:59 -0700 (PDT) Received: by 10.216.196.12 with HTTP; Tue, 27 Mar 2012 09:54:59 -0700 (PDT) In-Reply-To: References: Date: Tue, 27 Mar 2012 09:54:59 -0700 Message-ID: Subject: Re: ArrayIndexOutOfBoundsException in 0.90.7-SNAPSHOT From: Ted Yu To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=f46d0442726ce9c51a04bc3c5992 --f46d0442726ce9c51a04bc3c5992 Content-Type: text/plain; charset=ISO-8859-1 Index 20 corresponds to RS_ZK_REGION_FAILED_OPEN which was added by: HBASE-5490 Move the enum RS_ZK_REGION_FAILED_OPEN to the last of the enum list in 0.90 EventHandler (Ram) As of now, is there any server that is still running 0.90.4 ? Such server(s) wouldn't be able to interpret RS_ZK_REGION_FAILED_OPEN correctly. Cheers On Tue, Mar 27, 2012 at 9:07 AM, Daniel Lamberger < daniel.lamberger@gigya-inc.com> wrote: > Hello, > > We recently migrated to 0.90.7-SNAPSHOT, and are encountering the above > exception, which seems to fail various HBase operations. > > How it came to be: > > * We upgraded from 0.90.4 to 0.90.7, however not all slaves were restarted, > i.e. we ran slaves from different versions for a couple of days. > > * We tried disabling a table and that operation locked up, with the > following recurring errors in the log file: > > 2012-03-27 09:11:54,402 INFO > org.apache.hadoop.hbase.master.AssignmentManager: Region has been > PENDING_CLOSE for too long, running forced unassign again on region=... > 2012-03-27 09:12:04,404 INFO > org.apache.hadoop.hbase.master.AssignmentManager: Regions in transition > timed out: ... state=PENDING_CLOSE, ts=1332853083660 > > * We restarted the cluster and the table we previously tried to disable was > now marked as disabled. When trying to re-enable it, the operation failed. > The log: > > hbase.master.handler.EnableTableHandler: Attemping to enable the table > api_status > hbase.master.handler.EnableTableHandler: Table has 7 regions of which 7 are > online. > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for 1572f94d627fe784eb5653d6f32378c8 with OFFLINE > state > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for 2ebf932e9bf7c438db3144b892918d08 with OFFLINE > state > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for 499064a3f5de2b6b11144c3f5d4c8060 with OFFLINE > state > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for a9dea3db85a4219057ae71a79ad92c8c with OFFLINE > state > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for 8373f4db3d61a8e2ea209b2fdebd4c33 with OFFLINE > state > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Creating (or > updating) unassigned node for 50d12796747cdc5c188589b6ed47d485 with OFFLINE > state > hbase.master.AssignmentManager: Handling transition=M_ZK_REGION_OFFLINE, > server=hadoop1-hbm1.XXX, region=1572f94d627fe784eb5653d6f32378c8 > hbase.master.AssignmentManager: No previous transition plan was found (or > we are ignoring an existing plan) for > api_status,,1332438613488.1572f94d627fe784eb5653d6f32378c8. so generated a > random one; > hri=api_status,,1332438613488.1572f94d627fe784eb5653d6f32378c8., src=, > dest=hadoop1-s02.XXX,60020,1332856612196; 10 (online=10, exclude=null) > available servers > > hbase.master.AssignmentManager: Assigning region > api_status,,1332438613488.1572f94d627fe784eb5653d6f32378c8. to > hadoop1-s02.XXX,60020,1332856612196 > hbase.master.AssignmentManager: No previous transition plan was found (or > we are ignoring an existing plan) for > api_status,XXX,1332440559822.2ebf932e9bf7c438db3144b892918d08. so generated > a random one; > hri=api_status,XXX,1332440559822.2ebf932e9bf7c438db3144b892918d08., src=, > dest=hadoop1-s02.XXX,60020,1332856612196; 10 (online=10, exclude=null) > available servers > > (those couple of lines are repeated for the rest of the slaves) > > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s05.XXX,60020,1332857174731, > region=499064a3f5de2b6b11144c3f5d4c8060 > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s05.XXX,60020,1332857174731, > region=a9dea3db85a4219057ae71a79ad92c8c > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s02.XXX,60020,1332856612196, > region=1572f94d627fe784eb5653d6f32378c8 > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s05.XXX,60020,1332857174731, > region=499064a3f5de2b6b11144c3f5d4c8060 > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENED, > server=hadoop1-s02.XXX,60020,1332856612196, > region=2ebf932e9bf7c438db3144b892918d08 > hbase.master.handler.OpenedRegionHandler: Handling OPENED event for > 2ebf932e9bf7c438db3144b892918d08; deleting unassigned node > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Deleting existing > unassigned node for 2ebf932e9bf7c438db3144b892918d08 that is in expected > state RS_ZK_REGION_OPENED > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s05.XXX,60020,1332857174731, > region=a9dea3db85a4219057ae71a79ad92c8c > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s01.XXX,60020,1332856427347, > region=8373f4db3d61a8e2ea209b2fdebd4c33 > hbase.zookeeper.ZKAssign: master:60000-0x23004a31d9083df Successfully > deleted unassigned node for region 2ebf932e9bf7c438db3144b892918d08 in > expected state RS_ZK_REGION_OPENED > hbase.master.handler.OpenedRegionHandler: Opened region > api_status,XXX,1332440559822.2ebf932e9bf7c438db3144b892918d08. on > hadoop1-s02.XXX,60020,1332856612196 > hbase.master.AssignmentManager: Handling transition=RS_ZK_REGION_OPENING, > server=hadoop1-s06.XXX,60020,1332857364660, > region=50d12796747cdc5c188589b6ed47d485 > hbase.master.AssignmentManager: Regions in transition timed out: > > gs_users,5351402|fWTtXMEa2WXHOo01esutJA==,1330321577716.01f70fbfd1a6b6582c4c4c2c814fb3ed. > state=OPENING, ts=1332857787876 > hbase.master.AssignmentManager: Region has been OPENING for too long, > reassigning > > region=gs_users,5351402|fWTtXMEa2WXHOo01esutJA==,1330321577716.01f70fbfd1a6b6582c4c4c2c814fb3ed. > ERROR org.apache.hadoop.hbase.master.AssignmentManager$TimeoutMonitor: > Caught exception > java.lang.ArrayIndexOutOfBoundsException: 20 > at > > org.apache.hadoop.hbase.executor.RegionTransitionData.readFields(RegionTransitionData.java:148) > at > org.apache.hadoop.hbase.util.Writables.getWritable(Writables.java:105) > at > org.apache.hadoop.hbase.util.Writables.getWritable(Writables.java:75) > at > > org.apache.hadoop.hbase.executor.RegionTransitionData.fromBytes(RegionTransitionData.java:198) > at > > org.apache.hadoop.hbase.zookeeper.ZKAssign.getDataNoWatch(ZKAssign.java:755) > at > > org.apache.hadoop.hbase.master.AssignmentManager$TimeoutMonitor.chore(AssignmentManager.java:1743) > at org.apache.hadoop.hbase.Chore.run(Chore.java:66) > > > Any insights would be appreciated. > > Thank you. > --f46d0442726ce9c51a04bc3c5992--