Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 97138 invoked from network); 11 Jul 2007 21:29:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Jul 2007 21:29:26 -0000 Received: (qmail 82894 invoked by uid 500); 11 Jul 2007 21:29:29 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 82608 invoked by uid 500); 11 Jul 2007 21:29:28 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 82590 invoked by uid 99); 11 Jul 2007 21:29:27 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jul 2007 14:29:27 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Jul 2007 14:29:24 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 9B473714204 for ; Wed, 11 Jul 2007 14:29:04 -0700 (PDT) Message-ID: <10776086.1184189344633.JavaMail.jira@brutus> Date: Wed, 11 Jul 2007 14:29:04 -0700 (PDT) From: "Koji Noguchi (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-1463) dfs should report total size of all the space that dfs is using In-Reply-To: <18116733.1181066965892.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12511871 ] Koji Noguchi commented on HADOOP-1463: -------------------------------------- > From my understanding dfs.datanode.du.pct and dfs.datanode.du.reserved are two different ways of specifying reserved space. > I used to think the same. But there shouldn't be two config variables that serve the same purpose. So, assuming "dfs.datanode.du.reserved" is the one for "space reserved for non-dfs usage *whether it is used or unused*" I'd want {noformat} MIN( Total_Capacity - cur_space_used_by_Datanode - dfs.datanode.du.reserved, cur_disk_available) {noformat} I'm not sure where "dfs.datanode.du.pct" should fit. Maybe {noformat} MIN( Total_Capacity - cur_space_used_by_Datanode - dfs.datanode.du.reserved, cur_disk_available) * dfs.datanode.du.pct {noformat} > dfs should report total size of all the space that dfs is using > --------------------------------------------------------------- > > Key: HADOOP-1463 > URL: https://issues.apache.org/jira/browse/HADOOP-1463 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Affects Versions: 0.12.3 > Reporter: Hairong Kuang > Assignee: Hairong Kuang > Fix For: 0.14.0 > > > Currently namenode reports two statistics back to the client: > 1. The total capacity of dfs. This is a sum of all datanode's capacities, each of which is calculated by datanode summing all data directories disk space. > 2. The total remaining space of dfs. This is a sum of all datanodes's remaining space. Each datanode's remaining space is calculated by using the following formula: remaining space = unused space - capacity*unusableDiskPercentage - reserved space. So the remaining space shows how much space that the dfs can still use, but it does not show the size of unused space. > Each dfs client caculates the total dfs used space by substracting remaining space from the total capacity. So the used space does not accurately shows the space that dfs is using. However it is a very important number that dfs should provide. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.