hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raghu Angadi (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-2151) FileSyste.globPaths does not validate the return list of Paths
Date Tue, 06 Nov 2007 00:08:50 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12540306

Raghu Angadi commented on HADOOP-2151:

This is because globPaths decides invoke {{listPaths}} on directories only if there is a glob
specified for that 'component' of path.

I think we should change it to use {{listPaths}} at every component if there is a glob specified
anywhere to start with. Otherwise just return the path. Minimizing {{listPaths}} is not required
when a glob is specified since the user is already willing to pay the cost. 

> FileSyste.globPaths does not validate the return list of Paths
> --------------------------------------------------------------
>                 Key: HADOOP-2151
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2151
>             Project: Hadoop
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.14.3, 0.15.0
>            Reporter: lohit vijayarenu
>             Fix For: 0.16.0
> FileSystem.globPaths does not validate the return list of Paths.
> Here is an example. 
> Consider a directory structure like
> /user/foo/DIR1/FILE1
> /user/foo/DIR2
> now if we pass an input path like "/user/foo/*/FILE1" to FileSystem.globPaths()
> It returns 2 entries as shown below
> /user/foo/DIR1/FILE1
> /user/foo/DIR2/FILE1
> Should globPaths validate this and return only valid Paths? This behavior was caught
in FileSystem.validateInput() where an IOException is thrown while processing such a directory

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message