Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A0CD1173E5 for ; Tue, 23 Jun 2015 04:00:09 +0000 (UTC) Received: (qmail 97739 invoked by uid 500); 23 Jun 2015 04:00:09 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 97693 invoked by uid 500); 23 Jun 2015 04:00:09 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 97681 invoked by uid 99); 23 Jun 2015 04:00:09 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Jun 2015 04:00:09 +0000 Date: Tue, 23 Jun 2015 04:00:09 +0000 (UTC) From: "Ming Ma (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-1431) Balancer should work with the logic of BlockPlacementPolicy MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14597133#comment-14597133 ] Ming Ma commented on HDFS-1431: ------------------------------- When I met with [~andrew.wang], [~ctrezzo], [~atm], [~cmccabe] the other day, we had brief discussion about balancer. To make balancer use BlockPlacementPolicy, alternatively we can run balancer inside namenode. Namenode already has the necessary information. It needs to provide balancer throttling with some refactoring. But overall it seems it shouldn't create much overhead on namenode. It will be great to heard from others about this approach on potential issues such as scale and performance. > Balancer should work with the logic of BlockPlacementPolicy > ----------------------------------------------------------- > > Key: HDFS-1431 > URL: https://issues.apache.org/jira/browse/HDFS-1431 > Project: Hadoop HDFS > Issue Type: Improvement > Components: balancer & mover > Affects Versions: 0.22.0 > Reporter: Scott Chen > Assignee: Scott Chen > Attachments: HDFS-1431.txt > > > Currently Balancer does not obtain information from BlockPlacementPolicy so it can transfer the blocks without checking with BlockPlacementPolicy. > This causes the policy break after balancing the cluster. > There are some new policies proposed in HDFS-1094 and MAPREDUCE-1831 in which the block placement follows some pattern. > The pattern can be broken by Balancer. > I propose that we add the following method in BlockPlacementPolicy: > {code} > abstract public boolean canBeMoved(String fileName, Block block, > DatanodeInfo source, DatanodeInfo destination); > {code} > And make Balancer use it in > {code} > private boolean isGoodBlockCandidate(Source source, > BalancerDatanode target, BalancerBlock block) > {code} > What do you think? -- This message was sent by Atlassian JIRA (v6.3.4#6332)