hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-14235) S3A Path does not understand colon (:) when globbing
Date Fri, 24 Mar 2017 23:54:42 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-14235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15941346#comment-15941346

ASF GitHub Bot commented on HADOOP-14235:

GitHub user kazuyukitanimura opened a pull request:


    HADOOP-14235. S3A Path does not understand colon (:) when globbing

    Hi @steveloughran (not sure who else I need to involve here)
    I explained the issue at https://issues.apache.org/jira/browse/HADOOP-14235
    This pull request fixes the issue and does not break other things as far as I know. (I
also ran the unit tests).
    Probably, #204 should also fix this issue. This pull request is for a short-term solution
in case anyone is interested.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/bloomreach/hadoop HADOOP-14235

Alternatively you can review and apply these changes as the patch at:


To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #206
commit ff37d08ed314d7b2e9b7d8aff648e38e21fceacb
Author: kazu <kazu@bloomreach.com>
Date:   2017-03-24T23:36:01Z

    HADOOP-14235. S3A Path does not understand colon (:) when globbing


> S3A Path does not understand colon (:) when globbing
> ----------------------------------------------------
>                 Key: HADOOP-14235
>                 URL: https://issues.apache.org/jira/browse/HADOOP-14235
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs/s3
>    Affects Versions: 2.8.0, 3.0.0-alpha1, 3.0.0-alpha2, 2.8.1
>         Environment: EC2, AWS
>            Reporter: Kazuyuki Tanimura
> S3 paths, colons ":" are valid character in S3 paths. However, the Java URI class, which
is used in the Path class, does not allow it.
> This becomes a problem particularly when we are globbing S3 paths. The globber thinks
paths with colons are invalid paths and throws URISyntaxException.
> The reason is we are sharing Globber.java with all other Fs. Some of the rules for regular
Fs are not applicable to S3 just like this colon as an example.
> Same issue is reported here https://issues.apache.org/jira/browse/SPARK-20061
> The good news is I have a one line fix that I am about to send a pull request.
> However, for a right fix, we should separate the S3 globber from the Globber.java as
proposed at https://issues.apache.org/jira/browse/HADOOP-13371

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org

View raw message