Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 7F154200B9F for ; Tue, 11 Oct 2016 15:07:23 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7DC82160AF4; Tue, 11 Oct 2016 13:07:23 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C2F48160AE6 for ; Tue, 11 Oct 2016 15:07:22 +0200 (CEST) Received: (qmail 1635 invoked by uid 500); 11 Oct 2016 13:07:21 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 1596 invoked by uid 99); 11 Oct 2016 13:07:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Oct 2016 13:07:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 457572C4C73 for ; Tue, 11 Oct 2016 13:07:21 +0000 (UTC) Date: Tue, 11 Oct 2016 13:07:21 +0000 (UTC) From: "Kihwal Lee (JIRA)" To: hdfs-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (HDFS-8041) Consider remaining space during block blockplacement if dfs space is highly utilized MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 11 Oct 2016 13:07:23 -0000 [ https://issues.apache.org/jira/browse/HDFS-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kihwal Lee resolved HDFS-8041. ------------------------------ Resolution: Won't Fix While this jira makes balancing more aggressive when free space is low, there are corner cases where it can perform poorly. AvailableSpaceBlockPlacementPolicy might be a better tool. Closing, won't fix. > Consider remaining space during block blockplacement if dfs space is highly utilized > ------------------------------------------------------------------------------------ > > Key: HDFS-8041 > URL: https://issues.apache.org/jira/browse/HDFS-8041 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Kihwal Lee > Assignee: Kihwal Lee > Labels: BlockPlacementPolicy > Attachments: HDFS-8041.v1.patch, HDFS-8041.v2.patch, HDFS-8041.v3.patch, HDFS-8041.v4.patch > > > This feature is helpful in avoiding smaller nodes (i.e. heterogeneous environment) getting constantly being full when the overall space utilization is over a certain threshold. When the utilization is low, balancer can keep up, but once the average per-node byte goes over the capacity of the smaller nodes, they get full so quickly even after perfect balance. > This jira proposes an improvement that can be optionally enabled in order to slow down the rate of space usage growth of smaller nodes if the overall storage utilization is over a configured threshold. It will not replace balancer, rather will help balancer keep up. Also, the primary replica placement will not be affected. Only the replicas typically placed in a remote rack will be subject to this check. > The appropriate threshold is cluster configuration specific. There is no generally good value to set, thus it is disabled by default. We have seen cases where the threshold of 85% - 90% would help. Figuring when {{totalSpaceUsed / numNodes}} becomes close to the capacity of a smaller node is helpful in determining the threshold. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org