Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 69229 invoked from network); 6 Jun 2007 17:32:47 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 6 Jun 2007 17:32:47 -0000 Received: (qmail 5568 invoked by uid 500); 6 Jun 2007 17:32:50 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 5547 invoked by uid 500); 6 Jun 2007 17:32:50 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 5538 invoked by uid 99); 6 Jun 2007 17:32:50 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jun 2007 10:32:50 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jun 2007 10:32:46 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 12C5971418E for ; Wed, 6 Jun 2007 10:32:26 -0700 (PDT) Message-ID: <10936247.1181151146073.JavaMail.jira@brutus> Date: Wed, 6 Jun 2007 10:32:26 -0700 (PDT) From: "Raghu Angadi (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-1463) dfs should report total size of all the space that dfs is using In-Reply-To: <18116733.1181066965892.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502000 ] Raghu Angadi commented on HADOOP-1463: -------------------------------------- The meaning of "Reserved" that current implements is : "Space that Datanode should try to keep *free* for immediate or future use by either Datanode or some other application". In that sense I think the calculates well. Note that this calculation does not require costly du. After chatting with Hairong, I think Koji's impression of "Reserved" is : "Disk space that Datanode avoids in its calculations whether it is used or free". Eg. if there is a partition of 100 GB, 50% is reserved, and DFS occupies 40 GB, 25GB is "available", then "remaining" should be 10GB. This requires equivalent of 'du' for Datanode's data. With the previous interpretation remaining will be zero. If we want the latter, then we need either do 'du' or maintain disk used, to be sent with every heartbeat. > dfs should report total size of all the space that dfs is using > --------------------------------------------------------------- > > Key: HADOOP-1463 > URL: https://issues.apache.org/jira/browse/HADOOP-1463 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Affects Versions: 0.12.3 > Reporter: Hairong Kuang > Fix For: 0.14.0 > > > Currently namenode reports two statistics back to the client: > 1. The total capacity of dfs. This is a sum of all datanode's capacities, each of which is calculated by datanode summing all data directories disk space. > 2. The total remaining space of dfs. This is a sum of all datanodes's remaining space. Each datanode's remaining space is calculated by using the following formula: remaining space = unused space - capacity*unusableDiskPercentage - reserved space. So the remaining space shows how much space that the dfs can still use, but it does not show the size of unused space. > Each dfs client caculates the total dfs used space by substracting remaining space from the total capacity. So the used space does not accurately shows the space that dfs is using. However it is a very important number that dfs should provide. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.