hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-1654) Pig should check schema alias duplication at any levels.
Date Tue, 28 Sep 2010 23:03:33 GMT
Pig should check schema alias duplication at any levels.

                 Key: PIG-1654
                 URL: https://issues.apache.org/jira/browse/PIG-1654
             Project: Pig
          Issue Type: Bug
            Reporter: Xuefu Zhang
            Assignee: Xuefu Zhang
             Fix For: 0.9.0

The following script appears valid to Pig but it shouldn't:

A = load 'file' as (a:tuple( u:int, u:bytearray, w:long), b:int, c:chararray);
dump A;

Pig tries to launch map/reduce jobs for this.

However, for the following script, Pig correctly reports error message:

A = load 'file' as (a:int, b:long, c:bytearray);
dump A;

Error message is:
2010-09-28 15:53:37,390 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1108: Duplicate
schema alias: b in "A"

Thus, Pig only checks alias duplication at the top level, which is confirmed by looking at
the code. The right behavior is that the same check should be applied at all levels. 

This should be addressed in the new parser.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message