Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 41244 invoked from network); 13 Feb 2008 03:26:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 13 Feb 2008 03:26:31 -0000 Received: (qmail 51726 invoked by uid 500); 13 Feb 2008 03:26:25 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 51714 invoked by uid 500); 13 Feb 2008 03:26:25 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 51705 invoked by uid 99); 13 Feb 2008 03:26:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Feb 2008 19:26:25 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Feb 2008 03:25:47 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id CD9C071406C for ; Tue, 12 Feb 2008 19:26:07 -0800 (PST) Message-ID: <30610908.1202873167812.JavaMail.jira@brutus> Date: Tue, 12 Feb 2008 19:26:07 -0800 (PST) From: "Bryan Duxbury (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-55) [hbase] Improve Master region assignment function MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-55?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12568419#action_12568419 ] Bryan Duxbury commented on HBASE-55: ------------------------------------ I think the tricky part of trying to incorporate update/read rate and memory usage is that this could change very quickly, and if we are making balancing decisions based on this, we could get really bad oscillations in assignments. Moreover, there's really no such thing as "too busy". Either it's less busy than average, and it should take on new regions, or more busy than average, and regions should be taken away. If all of the servers have equal load, but the average is "too high", then all you get is poor performance. At no point does it make sense for a region server to "say no" to an assignment, because in theory the master has decided that assignment is optimal for the known factors. I think that calculating the load factor has to be simple, otherwise we can easily get caught up trying to build a complicated load function that takes into account every factor the region server can offer, but only provides a marginal improvement over simpler functions. Let's try a simple metric and see what happens. If it fails to give us decent distribution, then we can go back to the drawing board. As far as HBASE-70, I see that not so much as a way for us to monitor how much memory is in use as for each region server to work best within the memory it has. Perhaps we also need to be aware of the "swappiness" of a region server (how much of it's cache is being discarded due to memory pressure), but that may be separate. > [hbase] Improve Master region assignment function > ------------------------------------------------- > > Key: HBASE-55 > URL: https://issues.apache.org/jira/browse/HBASE-55 > Project: Hadoop HBase > Issue Type: Improvement > Components: regionserver > Reporter: Bryan Duxbury > Fix For: 0.2.0 > > > We would like the master's region assignment function to take into account more factors when choosing where to assign regions. > > - More advanced accounting of load on regionserver - memory, # requests, etc > - Don't deploy both daughter regions to the same regionserver > - Assign regions where the underlying DFS blocks are hosted if possible > Please add additional ideas in comments as they come up. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.