Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 54F70CC68 for ; Thu, 26 Apr 2012 23:17:10 +0000 (UTC) Received: (qmail 47460 invoked by uid 500); 26 Apr 2012 23:17:10 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 47312 invoked by uid 500); 26 Apr 2012 23:17:10 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 47304 invoked by uid 99); 26 Apr 2012 23:17:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Apr 2012 23:17:10 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Apr 2012 23:17:08 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id DF8004226A6 for ; Thu, 26 Apr 2012 23:16:48 +0000 (UTC) Date: Thu, 26 Apr 2012 23:16:48 +0000 (UTC) From: "Kihwal Lee (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1319688830.851.1335482208916.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <974944667.37015.1331165576997.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-3061) Cached directory size in INodeDirectory can get permantently out of sync with computed size, causing quota issues MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13263239#comment-13263239 ] Kihwal Lee commented on HDFS-3061: ---------------------------------- One example: {noformat} 012-04-18 00:06:04,246 WARN org.apache.hadoop.hdfs.server.namenode.NameNode: Inconsistent diskspace for directory xxxxx. Cached: 7877252111770 Computed: 4159086535 {noformat} The delta grows everyday. > Cached directory size in INodeDirectory can get permantently out of sync with computed size, causing quota issues > ----------------------------------------------------------------------------------------------------------------- > > Key: HDFS-3061 > URL: https://issues.apache.org/jira/browse/HDFS-3061 > Project: Hadoop HDFS > Issue Type: Bug > Components: name-node > Affects Versions: 0.20.203.0 > Environment: 0.20.203 with HDFS-1377 and HDFS-2053 patches applied > Reporter: Alex Holmes > Attachments: QuotaTestSimple.java > > > It appears that there's a condition under which a HDFS directory with a space quota set can get to a point where the cached size for the directory can permanently differ from the computed value. When this happens the following command: > {code} > hadoop fs -count -q /tmp/quota-test > {code} > results in the following output in the NameNode logs: > {code} > WARN org.apache.hadoop.hdfs.server.namenode.NameNode: Inconsistent diskspace for directory quota-test. Cached: 6000 Computed: 6072 > {code} > I've observed both transient and persistent instances of this happening. In the transient instances this warning goes away, but in the persistent instances every invocation of the {{fs -count -q}} command yields the above warning. > I've seen instances where the actual disk usage of a directory is 25% of the cached value in INodeDirectory, which creates problems since the quota code uses this cached value to determine whether block write requests are permitted. > This isn't easy to reproduce - I am able to (inconsistently) get HDFS into this state with a simple program which: > # Writes files into HDFS > # When a DSQuotaExceededException is encountered removes all files created in step 1 > # Repeat step 1 > I'm going to try and come up with a more repeatable test case to reproduce this issue. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira