Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 3032 invoked from network); 28 Aug 2008 21:38:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 28 Aug 2008 21:38:09 -0000 Received: (qmail 90336 invoked by uid 500); 28 Aug 2008 21:38:04 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 90271 invoked by uid 500); 28 Aug 2008 21:38:04 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 90143 invoked by uid 99); 28 Aug 2008 21:38:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Aug 2008 14:38:03 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Aug 2008 21:37:14 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 60668234C1D0 for ; Thu, 28 Aug 2008 14:37:44 -0700 (PDT) Message-ID: <267007734.1219959464393.JavaMail.jira@brutus> Date: Thu, 28 Aug 2008 14:37:44 -0700 (PDT) From: "Allen Wittenauer (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-1869) access times of HDFS files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12626756#action_12626756 ] Allen Wittenauer commented on HADOOP-1869: ------------------------------------------ Maybe this is a dumb question but how will hadoop archives (htars?) interact with access times? Also, at least to me, an API called touchAC() seems very non-obvious as to its purpose. (esp if I'm doing this on Windows) > access times of HDFS files > -------------------------- > > Key: HADOOP-1869 > URL: https://issues.apache.org/jira/browse/HADOOP-1869 > Project: Hadoop Core > Issue Type: New Feature > Components: dfs > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Fix For: 0.19.0 > > Attachments: accessTime1.patch, accessTime4.patch, accessTime5.patch > > > HDFS should support some type of statistics that allows an administrator to determine when a file was last accessed. > Since HDFS does not have quotas yet, it is likely that users keep on accumulating files in their home directories without much regard to the amount of space they are occupying. This causes memory-related problems with the namenode. > Access times are costly to maintain. AFS does not maintain access times. I thind DCE-DFS does maintain access times with a coarse granularity. > One proposal for HDFS would be to implement something like an "access bit". > 1. This access-bit is set when a file is accessed. If the access bit is already set, then this call does not result in a transaction. > 2. A FileSystem.clearAccessBits() indicates that the access bits of all files need to be cleared. > An administrator can effectively use the above mechanism (maybe a daily cron job) to determine files that are recently used. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.