hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-789) coupling load and store in script no longer works
Date Wed, 29 Apr 2009 01:28:30 GMT
coupling load and store in script no longer works
-------------------------------------------------

                 Key: PIG-789
                 URL: https://issues.apache.org/jira/browse/PIG-789
             Project: Pig
          Issue Type: Bug
          Components: impl
    Affects Versions: 0.3.0
            Reporter: Alan Gates


Many user's pig script do something like this:

a = load '/user/pig/tests/data/singlefile/studenttab10k' as (name, age, gpa);
c = filter a by age > 500;
e = group c by (name, age);
f = foreach e generate group, COUNT($1);
store f into 'bla';
f1 = load 'bla';
g = order f1 by $1;
dump g;

With the inclusion of the multi-query phase2 patch this appears to no longer work.  You get
an error:

2009-04-28 18:24:50,776 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2100: hdfs://wilbur11.labs.corp.sp1.yahoo.com/user/gates/bla
does not exist.

We shouldn't be checking for bla's existence here because it will be created eventually by
the script.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message