pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eli Finkelshteyn (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-2636) "duplicate uid in schema" error from script that uses multiple FOREACH on same relation
Date Tue, 15 Jan 2013 06:40:13 GMT

    [ https://issues.apache.org/jira/browse/PIG-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13553545#comment-13553545

Eli Finkelshteyn commented on PIG-2636:

Using cogroup instead of join is a work around for this (i.e. just replace "join" with "cogroup").
You'll need to handle the result set slightly differently, but anything possible with a join
is also possible with a cogroup.

I'm pretty sure the issue here is the same as the one in https://issues.apache.org/jira/browse/PIG-3020
and is solved for 0.11.
> "duplicate uid in schema" error from script that uses multiple FOREACH on same relation
> ---------------------------------------------------------------------------------------
>                 Key: PIG-2636
>                 URL: https://issues.apache.org/jira/browse/PIG-2636
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.9.0
>            Reporter: Pete Gieser
>            Priority: Minor
> The following script:
> A = LOAD 'bug.in' AS a:tuple(x:int, y:int);
> B1 = FOREACH A GENERATE a.x, a.y;
> B2 = FOREACH A GENERATE a.x, a.y;
> C = JOIN B1 BY x, B2 by x;
> yields the following error:
> org.apache.pig.impl.plan.PlanValidationException: ERROR 2270: Logical plan invalid state:
duplicate uid in schema : B1::x#35:int,B1::y#36:int,B2::x#35:int,B2::y#36:int

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message