Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 6ECE4200C1C for ; Tue, 10 Jan 2017 16:13:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 6D85F160B4B; Tue, 10 Jan 2017 15:13:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id BED07160B3D for ; Tue, 10 Jan 2017 16:12:59 +0100 (CET) Received: (qmail 90222 invoked by uid 500); 10 Jan 2017 15:12:58 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 90120 invoked by uid 99); 10 Jan 2017 15:12:58 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Jan 2017 15:12:58 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id AD5BF2C2A6B for ; Tue, 10 Jan 2017 15:12:58 +0000 (UTC) Date: Tue, 10 Jan 2017 15:12:58 +0000 (UTC) From: "Kihwal Lee (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-7967) Reduce the performance impact of the balancer MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 10 Jan 2017 15:13:00 -0000 [ https://issues.apache.org/jira/browse/HDFS-7967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15815227#comment-15815227 ] Kihwal Lee commented on HDFS-7967: ---------------------------------- I have reviewed the patch and am fine with it. We have used a variant of this over a year and this is the latest improved version. We can commit this to branch-2 and 2.8 now, but then we will likely forget about the remaining work and move on. So I think we need to discuss what we are going to do for trunk. [~daryn], would you share your thoughts and concerns on the state of trunk and possible solutions? > Reduce the performance impact of the balancer > --------------------------------------------- > > Key: HDFS-7967 > URL: https://issues.apache.org/jira/browse/HDFS-7967 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: namenode > Affects Versions: 2.0.0-alpha > Reporter: Daryn Sharp > Assignee: Daryn Sharp > Priority: Critical > Attachments: HDFS-7967-branch-2.8.patch, HDFS-7967-branch-2.patch, HDFS-7967.branch-2-1.patch, HDFS-7967.branch-2.001.patch, HDFS-7967.branch-2.002.patch, HDFS-7967.branch-2.8-1.patch, HDFS-7967.branch-2.8.001.patch, HDFS-7967.branch-2.8.002.patch > > > The balancer needs to query for blocks to move from overly full DNs. The block lookup is extremely inefficient. An iterator of the node's blocks is created from the iterators of its storages' blocks. A random number is chosen corresponding to how many blocks will be skipped via the iterator. Each skip requires costly scanning of triplets. > The current design also only considers node imbalances while ignoring imbalances within the nodes's storages. A more efficient and intelligent design may eliminate the costly skipping of blocks via round-robin selection of blocks from the storages based on remaining capacity. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org