hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Doug Cutting (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-2566) need FileSystem#globStatus method
Date Wed, 16 Jan 2008 00:25:34 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559299#action_12559299
] 

Doug Cutting commented on HADOOP-2566:
--------------------------------------

A few comments:
- should stat2paths be a public method on FileSystem?  I'd prefer it were either private or
perhaps on FileUtil.
- globPaths() isn't deprecated.  Do we think we'll keep this, or should it be deprecated?
 It is handy in some cases, but, on the other hand, we'd like to force folks to examine their
uses of it, since in most cases performance will become abysmal once the FileStatus cache
is removed, and we don't want to surprise folks with that.  Thoughts?


> need FileSystem#globStatus method
> ---------------------------------
>
>                 Key: HADOOP-2566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2566
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Doug Cutting
>            Assignee: Hairong Kuang
>             Fix For: 0.16.0
>
>         Attachments: globStatus.patch
>
>
> To remove the cache of FileStatus in DFSPath (HADOOP-2565) without hurting performance,
we must use file enumeration APIs that return FileStatus[] rather than Path[].  Currently
we have FileSystem#globPaths(), but that method should be deprecated and replaced with a FileSystem#globStatus().
> We need to deprecate FileSystem#globPaths() in 0.16 in order to remove the cache in 0.17.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message