Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 68802 invoked from network); 16 Dec 2010 00:29:24 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 16 Dec 2010 00:29:24 -0000 Received: (qmail 78637 invoked by uid 500); 16 Dec 2010 00:29:24 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 78620 invoked by uid 500); 16 Dec 2010 00:29:24 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 78612 invoked by uid 99); 16 Dec 2010 00:29:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Dec 2010 00:29:24 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Dec 2010 00:29:22 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oBG0T04V024320 for ; Thu, 16 Dec 2010 00:29:00 GMT Message-ID: <11260567.148871292459340687.JavaMail.jira@thor> Date: Wed, 15 Dec 2010 19:29:00 -0500 (EST) From: "stack (JIRA)" To: issues@hbase.apache.org Subject: [jira] Updated: (HBASE-3365) EOFE contacting crashed RS causes Master abort In-Reply-To: <33246877.148841292458860676.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HBASE-3365: ------------------------- Attachment: 3365.txt Small patch that adds EOFE as possible exception sending close region. Will just apply. > EOFE contacting crashed RS causes Master abort > ---------------------------------------------- > > Key: HBASE-3365 > URL: https://issues.apache.org/jira/browse/HBASE-3365 > Project: HBase > Issue Type: Bug > Reporter: stack > Assignee: stack > Fix For: 0.90.0 > > Attachments: 3365.txt > > > Just got this testing: > {code} > 2010-12-16 00:05:02,863 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: Assigning region TestTable,0071897074,1292373519828.8cec43d5df41ea830b08180f688f2819. to sv2borg181,60020,1292457487454 > 2010-12-16 00:05:02,867 FATAL org.apache.hadoop.hbase.master.HMaster: Remote unexpected exception > java.io.IOException: Call to sv2borg185/10.20.20.185:60020 failed on local exception: java.io.EOFException > at org.apache.hadoop.hbase.ipc.HBaseClient.wrapException(HBaseClient.java:788) > at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:757) > at org.apache.hadoop.hbase.ipc.HBaseRPC$Invoker.invoke(HBaseRPC.java:257) > at $Proxy7.closeRegion(Unknown Source) > at org.apache.hadoop.hbase.master.ServerManager.sendRegionClose(ServerManager.java:589) > at org.apache.hadoop.hbase.master.AssignmentManager.unassign(AssignmentManager.java:1085) > at org.apache.hadoop.hbase.master.AssignmentManager.unassign(AssignmentManager.java:1032) > at org.apache.hadoop.hbase.master.AssignmentManager.balance(AssignmentManager.java:1791) > at org.apache.hadoop.hbase.master.HMaster.balance(HMaster.java:688) > at org.apache.hadoop.hbase.master.HMaster$1.chore(HMaster.java:579) > at org.apache.hadoop.hbase.Chore.run(Chore.java:66) > Caused by: java.io.EOFException > at java.io.DataInputStream.readInt(DataInputStream.java:375) > at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.receiveResponse(HBaseClient.java:521) > at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:459) > 2010-12-16 00:05:02,868 INFO org.apache.hadoop.hbase.master.HMaster: Aborting > {code} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.