Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 2627 invoked from network); 3 Aug 2009 21:11:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Aug 2009 21:11:34 -0000 Received: (qmail 81630 invoked by uid 500); 3 Aug 2009 21:11:39 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 81583 invoked by uid 500); 3 Aug 2009 21:11:39 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 81573 invoked by uid 99); 3 Aug 2009 21:11:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Aug 2009 21:11:39 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Aug 2009 21:11:35 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id E5AD2234C055 for ; Mon, 3 Aug 2009 14:11:14 -0700 (PDT) Message-ID: <2132323707.1249333874939.JavaMail.jira@brutus> Date: Mon, 3 Aug 2009 14:11:14 -0700 (PDT) From: "Jonathan Gray (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Created: (HBASE-1737) Regions unbalanced when adding new node MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org Regions unbalanced when adding new node --------------------------------------- Key: HBASE-1737 URL: https://issues.apache.org/jira/browse/HBASE-1737 Project: Hadoop HBase Issue Type: Bug Affects Versions: 0.20.0 Reporter: Jonathan Gray Priority: Blocker Fix For: 0.20.0 When adding a new RegionServer to a cluster, the new RS will receive some regions but not enough to actually be considered balanced. To recreate, just take an RS offline, allow regions to be reassigned, and then bring it back up. Master will get itself into a broken, stuck state where it continuously outputs a line like this: {noformat} 2009-08-03 12:54:57,812 DEBUG org.apache.hadoop.hbase.master.RegionManager: Server dn4,60020,1249329081079 will be unloaded for balance. Server load: 341 avg: 318.0, regions can be moved: 55 {noformat} This line is output every 3 seconds and never stops until another RS joins/leaves the cluster. Making this a blocker because when your new RS only gets some regions (in my case, about half as many as it should have), then all new regions will be assigned to that RS. This basically destroys any possibility for good load distribution with new data. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.