Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 7363 invoked from network); 4 Oct 2007 22:29:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Oct 2007 22:29:51 -0000 Received: (qmail 88458 invoked by uid 500); 4 Oct 2007 22:29:39 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 88281 invoked by uid 500); 4 Oct 2007 22:29:38 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 88272 invoked by uid 99); 4 Oct 2007 22:29:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Oct 2007 15:29:38 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Oct 2007 22:29:41 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id E7A6F714208 for ; Thu, 4 Oct 2007 15:28:50 -0700 (PDT) Message-ID: <10605161.1191536930946.JavaMail.jira@brutus> Date: Thu, 4 Oct 2007 15:28:50 -0700 (PDT) From: "Hairong Kuang (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Updated: (HADOOP-1968) Wildcard input syntax (glob) should support {} In-Reply-To: <23978431.1191015650938.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hairong Kuang updated HADOOP-1968: ---------------------------------- Attachment: curlyGlob.patch This patch allows a glob to use curly brackets as descripbed in the jira. It also makes sure that a file name that contains Java Regex special characters does not get interpreated as an instruction. There is one problem left with globs which is that glob escape does not work. See HADOOP-1995 for more details. I will fix the escape problem once HADOOP-1995 is resolved. > Wildcard input syntax (glob) should support {} > ---------------------------------------------- > > Key: HADOOP-1968 > URL: https://issues.apache.org/jira/browse/HADOOP-1968 > Project: Hadoop > Issue Type: Improvement > Affects Versions: 0.14.1 > Reporter: eric baldeschwieler > Assignee: Hairong Kuang > Fix For: 0.15.0 > > Attachments: curlyGlob.patch > > > We have users who have organized data by day and would like to select several days in a single input specification. For example they would like to be able to say: > '/data/2007{0830,0831,0901}/typeX/' > To input 3 days data into map-reduce (or Pig in this case). > (Also the use of regexp to resolve glob paterns looks like it might introduce some other bugs. I'd appreciate it if someone took another look at the code to see if there are any file characters that could > be interpreted as regexp "instructions"). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.