Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 010C54146 for ; Thu, 7 Jul 2011 06:40:06 +0000 (UTC) Received: (qmail 97622 invoked by uid 500); 7 Jul 2011 06:40:01 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 96196 invoked by uid 500); 7 Jul 2011 06:39:46 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 96076 invoked by uid 99); 7 Jul 2011 06:39:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 06:39:41 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 06:39:39 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 007504A1C1 for ; Thu, 7 Jul 2011 06:39:19 +0000 (UTC) Date: Thu, 7 Jul 2011 06:39:18 +0000 (UTC) From: "Jieshan Bean (JIRA)" To: issues@hbase.apache.org Message-ID: <801203025.6303.1310020758998.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <623471579.39880.1308967967394.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-4033) The shutdown RegionServer could be added to AssignmentManager.servers again MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-4033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061064#comment-13061064 ] Jieshan Bean commented on HBASE-4033: ------------------------------------- I have not make a same version patch for trunk...Sorry. I'll make it later...:) > The shutdown RegionServer could be added to AssignmentManager.servers again > --------------------------------------------------------------------------- > > Key: HBASE-4033 > URL: https://issues.apache.org/jira/browse/HBASE-4033 > Project: HBase > Issue Type: Bug > Components: master > Affects Versions: 0.90.3 > Reporter: Jieshan Bean > Fix For: 0.90.4 > > Attachments: A_hbase-root-master-167-6-1-11.rar, HBASE-4033-90-V1.patch, HBASE-4033-90-V2.patch, HBASE-4033-trunk-V1.patch, analysis.gif, test-report.txt > > > The folling steps can easily recreate the problem: > 1. There's thousands of regions in the cluster. > 2. Stop the cluster. > 3. Start the cluster. Killing one regionserver while the regions were opening. Restarted it after 10 seconds. > The shutted regionserver will appear in the AssignmentManager.servers list again. > For example: > Issue 1: > 2011-06-23 14:14:30,775 DEBUG org.apache.hadoop.hbase.master.LoadBalancer: Server information: 167-6-1-12,20020,1308803390123=2220, 167-6-1-13,20020,1308803391742=2374, 167-6-1-11,20020,1308803386333=2205, 167-6-1-13,20020,1308803514394=2183 > Two regionservers(One of it had aborted) had the same hostname but different startcode: > 167-6-1-13,20020,1308803391742=2374 > 167-6-1-13,20020,1308803514394=2183 > Issue 2: > (1).The Rs 167-6-1-11,20020,1308105402003 finished shutdown at "10:46:37,774": > 10:46:37,774 INFO org.apache.hadoop.hbase.master.handler.ServerShutdownHandler: Finished processing of shutdown of 167-6-1-11,20020,1308105402003 > (2).Overwriting happened, it seemed the RS was still exist in the set of AssignmentManager#regions: > 10:45:55,081 WARN org.apache.hadoop.hbase.master.AssignmentManager: Overwriting 612342de1fe4733f72299d70addb6d11 on serverName=167-6-1-11,20020,1308105402003, load=(requests=0, regions=0, usedHeap=0, maxHeap=0) > (3).Region was assigned to this dead RS again at "10:50:20,671": > 10:50:20,671 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: Assigning region Jeason10,08058613800000030,1308032774777.612342de1fe4733f72299d70addb6d11. to 167-6-1-11,20020,1308105402003 -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira