hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Ding (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-1071) Support comma separated file/directory names in load statements
Date Wed, 04 Nov 2009 01:11:32 GMT
Support comma separated file/directory names in load statements
---------------------------------------------------------------

                 Key: PIG-1071
                 URL: https://issues.apache.org/jira/browse/PIG-1071
             Project: Pig
          Issue Type: New Feature
            Reporter: Richard Ding


Currently Pig Latin support following LOAD syntax:

{code}
LOAD 'data' [USING loader function] [AS schema];      
{code}

where data is the name of the file or directory, including files specified with Hadoop-supported
globing syntax. This name is passed to the loader function.

This feature is to support loaders that can load multiple files from different directories
and allows users to pass in the file names in a comma separated string.

For example, these will be valid load statements:

{code}
LOAD '/usr/pig/test1/a,/usr/pig/test2/b' USING someloader()';
{code}

and 

{code}
LOAD '/usr/pig/test1/{a,c},/usr/pig/test2/b' USING someloader();
{code}

This comma separated string is passed to the loader.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message