Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 95AF0DA1F for ; Thu, 22 Nov 2012 10:03:02 +0000 (UTC) Received: (qmail 6162 invoked by uid 500); 22 Nov 2012 10:03:02 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 6111 invoked by uid 500); 22 Nov 2012 10:03:02 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 6081 invoked by uid 99); 22 Nov 2012 10:03:01 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Nov 2012 10:03:01 +0000 Date: Thu, 22 Nov 2012 10:03:00 +0000 (UTC) From: "Junping Du (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1522346850.16447.1353578581021.JavaMail.jiratomcat@arcas> Subject: [jira] [Updated] (HDFS-3495) Update Balancer to support new NetworkTopology with NodeGroup MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-3495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junping Du updated HDFS-3495: ----------------------------- Attachment: HADOOP-9045-v4.patch Nicholas, I address all your comments in v4 patch. Great thanks for your review! > Update Balancer to support new NetworkTopology with NodeGroup > ------------------------------------------------------------- > > Key: HDFS-3495 > URL: https://issues.apache.org/jira/browse/HDFS-3495 > Project: Hadoop HDFS > Issue Type: Bug > Components: balancer > Affects Versions: 1.1.0, 2.0.2-alpha > Reporter: Junping Du > Assignee: Junping Du > Attachments: HADOOP-8473-Balancer-NodeGroup-aware.patch, HADOOP-9045-v4.patch, HDFS-3495-v2.patch, HDFS-3495-v3.patch > > > Since the Balancer is a Hadoop Tool, it was updated to be directly aware of four-layer hierarchy instead of creating an alternative Balancer implementation. To accommodate extensibility, a new protected method, doChooseNodesForCustomFaultDomain is now called from the existing chooseNodes method so that a subclass of the Balancer could customize the balancer algotirhm for other failure and locality topologies. An alternative option is to encapsulate the algorithm used for the four-layer hierarchy into a collaborating strategy class. > The key changes introduced to support a four-layer hierarchy were to override the algorithm of choosing pairs for balancing. Unit tests were created to test the new algorithm. > The algorithm now makes sure to choose the target and source node on the same node group for balancing as the first priority. Then the overall balancing policy is: first doing balancing between nodes within the same nodegroup then the same rack and off rack at last. Also, we need to check no duplicated replicas live in the same node group after balancing. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira