hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hairong Kuang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-1968) Wildcard input syntax (glob) should support {}
Date Thu, 04 Oct 2007 22:28:50 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Hairong Kuang updated HADOOP-1968:

    Attachment: curlyGlob.patch

This patch allows a glob to use curly brackets as descripbed in the jira. It also makes sure
that a file name that contains Java Regex special characters does not get interpreated as
an instruction.

There is one problem left with globs which is that glob escape does not work. See HADOOP-1995
for more details. I will fix the escape problem once HADOOP-1995 is resolved.

> Wildcard input syntax (glob) should support {}
> ----------------------------------------------
>                 Key: HADOOP-1968
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1968
>             Project: Hadoop
>          Issue Type: Improvement
>    Affects Versions: 0.14.1
>            Reporter: eric baldeschwieler
>            Assignee: Hairong Kuang
>             Fix For: 0.15.0
>         Attachments: curlyGlob.patch
> We have users who have organized data by day and would like to select several days in
a single input specification.  For example they would like to be able to say:
> '/data/2007{0830,0831,0901}/typeX/'
> To input 3 days data into map-reduce (or Pig in this case). 
> (Also the use of regexp to resolve glob paterns looks like it might introduce some other
bugs.  I'd appreciate it if someone took another look at the code to see if there are any
file characters that could
> be interpreted as regexp "instructions").

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message