Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A8144619E for ; Fri, 8 Jul 2011 11:25:05 +0000 (UTC) Received: (qmail 58973 invoked by uid 500); 8 Jul 2011 11:25:02 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 58837 invoked by uid 500); 8 Jul 2011 11:24:57 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 58826 invoked by uid 99); 8 Jul 2011 11:24:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Jul 2011 11:24:56 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=MIME_QP_LONG_LINE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sagar_shukla@persistent.co.in designates 202.54.11.87 as permitted sender) Received: from [202.54.11.87] (HELO bmapps.persistent.co.in) (202.54.11.87) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Jul 2011 11:24:49 +0000 X-AuditID: 0a4e0006-b7bd6ae000000bf1-ff-4e16e87af72d Received: from puneexchange.persistent.co.in ( [10.78.0.1]) (using TLS with cipher AES128-SHA (AES128-SHA/128 bits)) (Client did not present a certificate) by bmapps.persistent.co.in (BMAPPS @ Persistent Systems Ltd.) with SMTP id 60.2D.03057.A78E61E4; Fri, 8 Jul 2011 16:52:35 +0530 (IST) Received: from Exchange.persistent.co.in ([169.254.1.92]) by CAS2 ([10.77.224.47]) with mapi; Fri, 8 Jul 2011 16:54:26 +0530 From: Sagar Shukla To: "common-user@hadoop.apache.org" Date: Fri, 8 Jul 2011 16:54:24 +0530 Subject: RE: Difference between DFS Used and Non-DFS Used Thread-Topic: Difference between DFS Used and Non-DFS Used Thread-Index: Acw9X/xQnMsvK9LsSbq7uCzhC4jz8QAAUohA Message-ID: References: In-Reply-To: Accept-Language: en-US, en-IN Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US, en-IN Content-Type: text/plain; charset="iso-8859-1" content-transfer-encoding: quoted-printable MIME-Version: 1.0 X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFprGKsWRmVeSWpSXmKPExsXC5cfAqFv9QszP4ORULYsNjzqZHRg9JnRt YQxgjGpgtEnMy8svSSxJVUhJLU62VfJJTU/M0XXJLE7OSczMTS1SUshMsVUyUlIoyElMTs1N zSuxVUosKEjNS1Gy41LAADZAZZl5Cql5yfkpmXnptkqewf66FhamlrqGSnYunsHOPo6evq5B XLYwkLCKNePysS3sBT/EK9b2LmRqYJwo3MXIySEhYCLx/M1lZghbTOLCvfVsXYxcHEICK5gk lu6aAJYQEqiTePBuAhuIzSZgJHF32h9WEFtEwFniydRf7CA2i4CKxONZf8BqhAUsJVbemQpV YyVx+NklFgjbSOLJ+Q1gNq9AkMS8T59YIZatZJKYdWAK2CBOgUCJqW3HmboYOTgYBWQlZs9g BAkzC4hL3HoynwniUAGJJXvOQx0tKvHy8T+wXYwCMhI7zh5nhajXk7gxdQobhK0tsWzha2aI vYISJ2c+YZnAKDoLydhZSFpmIWmZhaRlASPLKkbxpFxgvBTrFaQWFWcWlwAjSi85Xy8zbxMj JA2w7WC89VzoEKMAB6MSD6/iMk4/IdbEsuLK3EOMEhzMSiK8PpfE/IR4UxIrq1KL8uOLSnNS iw8xugLDbyKzFHdyfh7IyHhjAwPcHCVxXvVn/3yFBNKB6Sk7NRXoFJg5TBycUg2MOl6dM1Kk biVJlU79p2/9tmNhjpzhnJ/F25rNN4d/ezNrqjtDAcfyNbfbT5pY/F4wkfd1/K6T7Xte5qwv XmClWbtjo+VF75vVDVOObPjGmXto/7egxcXTtTzWXz9R98BI9Wn0Qc+S1g+y6708J9i2bPWy uSW2UND+hPfGZd0HMq0PB7AbWbfkKrEUZyQaajEXFScCAE2Frn4pAwAA X-Virus-Checked: Checked by ClamAV on apache.org Thanks Harsh. My first question still remains unanswered - "Why does it requ= ire non-DFS storage?". If it is cache data then it should get flushed from t= he system after certain interval of time. And if it is useful data then it s= hould have been part of used DFS data. I have a setup in which DFS used is use approx. 10 MB whereas non-DFS used i= s around 250 GB which is quite ridiculous. Thanks, Sagar -----Original Message----- From: Harsh J [mailto:harsh@cloudera.com] Sent: Friday, July 08, 2011 4:42 PM To: common-user@hadoop.apache.org Subject: Re: Difference between DFS Used and Non-DFS Used It is just for information's sake (cause it can be computed with the data collected). The space is accounted just to let you know that there's something being stored on the DataNodes apart from just the HDFS data, in case you are running out of space. On Fri, Jul 8, 2011 at 10:18 AM, Sagar Shukla wrote: > Hi Harsh, > =A0 =A0 Thanks for your reply. > > But why does it require non-DFS storage ? And why that space is accounted= differently from regular DFS storage ? > > Ideally, it should have been part of same storage. > > Thanks, > Sagar > > -----Original Message----- > From: Harsh J [mailto:harsh@cloudera.com] > Sent: Thursday, July 07, 2011 6:04 PM > To: common-user@hadoop.apache.org > Subject: Re: Difference between DFS Used and Non-DFS Used > > DFS used is a count of all the space used by the dfs.data.dirs. The > non-dfs used space is whatever space is occupied beyond that (which > the DN does not account for). > > On Thu, Jul 7, 2011 at 3:29 PM, Sagar Shukla > wrote: >> Hi, >> =A0 =A0 =A0 What is the difference between DFS Used and Non-DFS used ? >> >> Thanks, >> Sagar >> >> DISCLAIMER >> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D >> This e-mail may contain privileged and confidential information which is= the property of Persistent Systems Ltd. It is intended only for the use of= the individual or entity to which it is addressed. If you are not the inten= ded recipient, you are not authorized to read, retain, copy, print, distribu= te or use this message. If you have received this communication in error, pl= ease notify the sender and delete all copies of this message. Persistent Sys= tems Ltd. does not accept any liability for virus infected mails. >> >> > > > > -- > Harsh J > > DISCLAIMER > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > This e-mail may contain privileged and confidential information which is t= he property of Persistent Systems Ltd. It is intended only for the use of th= e individual or entity to which it is addressed. If you are not the intended= recipient, you are not authorized to read, retain, copy, print, distribute= or use this message. If you have received this communication in error, plea= se notify the sender and delete all copies of this message. Persistent Syste= ms Ltd. does not accept any liability for virus infected mails. > > -- Harsh J DISCLAIMER=0A= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=0A= This e-mail may contain privileged and confidential information which is the= property of Persistent Systems Ltd. It is intended only for the use of the= individual or entity to which it is addressed. If you are not the intended= recipient, you are not authorized to read, retain, copy, print, distribute= or use this message. If you have received this communication in error, plea= se notify the sender and delete all copies of this message. Persistent Syste= ms Ltd. does not accept any liability for virus infected mails.=0A=