Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 80391 invoked from network); 29 Mar 2009 21:48:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Mar 2009 21:48:14 -0000 Received: (qmail 33336 invoked by uid 500); 29 Mar 2009 21:48:13 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 33259 invoked by uid 500); 29 Mar 2009 21:48:13 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 33248 invoked by uid 99); 29 Mar 2009 21:48:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Mar 2009 21:48:13 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Mar 2009 21:48:11 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id A3685234C003 for ; Sun, 29 Mar 2009 14:47:50 -0700 (PDT) Message-ID: <932989300.1238363270660.JavaMail.jira@brutus> Date: Sun, 29 Mar 2009 14:47:50 -0700 (PDT) From: "stack (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-57) [hbase] Master should allocate regions to regionservers based upon data locality and rack awareness MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-57?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12693621#action_12693621 ] stack commented on HBASE-57: ---------------------------- The going direct to local blocks reading is HADOOP-4801. In summary, the payoff short-circuiting the datanode is small, and yet to be seen -- at least to date -- and it seems doubtful that a second route to the data will be opened because of security concerns, etc. Thats my take on the issue (It could change of course). I think that if we only made savings in network traffic, that'd be reason enough to implement locality algorithms. JK makes an interesting point above that we could manufacture hot datanodes if we blindly serve regions from a datanode that hosts all the data but this can happen now since we operate blindly and its only smart use of the locality info that will help damp hot spots. Samuel, if still interested, have you made petition to become a GSOC student using this issue as your project? (Add in some of JKs notes on need to research what happens in a running cluster so know best what to implement). > [hbase] Master should allocate regions to regionservers based upon data locality and rack awareness > --------------------------------------------------------------------------------------------------- > > Key: HBASE-57 > URL: https://issues.apache.org/jira/browse/HBASE-57 > Project: Hadoop HBase > Issue Type: Improvement > Components: master > Affects Versions: 0.2.0 > Reporter: stack > Fix For: 0.20.0 > > > Currently, regions are assigned regionservers based off a basic loading attribute. A factor to include in the assignment calcuation is the location of the region in hdfs; i.e. servers hosting region replicas. If the cluster is such that regionservers are being run on the same nodes as those running hdfs, then ideally the regionserver for a particular region should be running on the same server as hosts a region replica. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.