pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-789) coupling load and store in script no longer works
Date Wed, 29 Apr 2009 01:28:30 GMT
coupling load and store in script no longer works

                 Key: PIG-789
                 URL: https://issues.apache.org/jira/browse/PIG-789
             Project: Pig
          Issue Type: Bug
          Components: impl
    Affects Versions: 0.3.0
            Reporter: Alan Gates

Many user's pig script do something like this:

a = load '/user/pig/tests/data/singlefile/studenttab10k' as (name, age, gpa);
c = filter a by age > 500;
e = group c by (name, age);
f = foreach e generate group, COUNT($1);
store f into 'bla';
f1 = load 'bla';
g = order f1 by $1;
dump g;

With the inclusion of the multi-query phase2 patch this appears to no longer work.  You get
an error:

2009-04-28 18:24:50,776 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2100: hdfs://wilbur11.labs.corp.sp1.yahoo.com/user/gates/bla
does not exist.

We shouldn't be checking for bla's existence here because it will be created eventually by
the script.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message