Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 92125 invoked from network); 17 May 2010 21:00:12 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 17 May 2010 21:00:12 -0000 Received: (qmail 50750 invoked by uid 500); 17 May 2010 21:00:12 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 50720 invoked by uid 500); 17 May 2010 21:00:12 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 50712 invoked by uid 99); 17 May 2010 21:00:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 May 2010 21:00:12 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 May 2010 21:00:09 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o4HKxmJW028343 for ; Mon, 17 May 2010 20:59:48 GMT Message-ID: <23633225.91841274129988291.JavaMail.jira@thor> Date: Mon, 17 May 2010 16:59:48 -0400 (EDT) From: "Scott Chen (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-1143) Implement Background deletion In-Reply-To: <18585901.9311273604022647.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-1143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12868404#action_12868404 ] Scott Chen commented on HDFS-1143: ---------------------------------- Had some offline discussion with Dhruba. The incrDeletedFileCound() is used for updating the metrics only. It is OK that the clients don't wait the metrics to be updated. > Implement Background deletion > ----------------------------- > > Key: HDFS-1143 > URL: https://issues.apache.org/jira/browse/HDFS-1143 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Affects Versions: 0.22.0 > Reporter: Dmytro Molkov > Assignee: Scott Chen > Fix For: 0.22.0 > > Attachments: HDFS-1143.txt > > > Right now if you try to delete massive number of files from the namenode it will freeze (sometimes for minutes). Most of the time is spent going through the blocks map and invalidating all the blocks. > This can probably be improved by having a background GC process. The deletion will basically just remove the inode being deleted and then give the subtree that was just deleted to the background thread running cleanup. > This way the namenode becomes available for the clients soon after deletion, and all the heavy operations are done in the background. > Thoughts? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.