hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1048) inner join using 'skewed' produces multiple rows for keys with single row in both input relations
Date Wed, 04 Nov 2009 01:37:32 GMT

    [ https://issues.apache.org/jira/browse/PIG-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12773333#action_12773333
] 

Alan Gates commented on PIG-1048:
---------------------------------

When attempting to apply this patch to the 0.5 branch, I got the following error:

Testcase: testSkewedJoinOneValue took 145.739 sec
    Caused an ERROR
Unable to open iterator for alias E
org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for
alias E
    at org.apache.pig.PigServer.openIterator(PigServer.java:475)
    at org.apache.pig.test.TestSkewedJoin.testSkewedJoinOneValue(TestSkewedJoin.java:340)
Caused by: org.apache.pig.backend.executionengine.ExecException: ERROR 2997: Unable to recreate
exception from backed error: java.lang.RuntimeException: Error in configuring object
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.Launcher.getErrorMessages(Launcher.java:237)
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.Launcher.getStats(Launcher.java:181)
    at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:209)
    at org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.execute(HExecutionEngine.java:265)
    at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:773)
    at org.apache.pig.PigServer.store(PigServer.java:522)
    at org.apache.pig.PigServer.openIterator(PigServer.java:458)

> inner join using 'skewed' produces multiple rows for keys with single row in both input
relations
> -------------------------------------------------------------------------------------------------
>
>                 Key: PIG-1048
>                 URL: https://issues.apache.org/jira/browse/PIG-1048
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Thejas M Nair
>            Assignee: Sriranjan Manjunath
>             Fix For: 0.6.0
>
>         Attachments: pig_1048.patch
>
>
> ${code}
> grunt> cat students.txt                           
> asdfxc  M       23      12.44
> qwer    F       21      14.44
> uhsdf   M       34      12.11
> zxldf   M       21      12.56
> qwer    F       23      145.5
> oiue    M       54      23.33
>  l1 = load 'students.txt';            
> l2 = load 'students.txt';                  
> j = join l1 by $0, l2 by $0 ; 
> store j into 'tmp.txt'             
> grunt> cat tmp.txt
> oiue    M       54      23.33   oiue    M       54      23.33
> oiue    M       54      23.33   oiue    M       54      23.33
> qwer    F       21      14.44   qwer    F       21      14.44
> qwer    F       21      14.44   qwer    F       23      145.5
> qwer    F       23      145.5   qwer    F       21      14.44
> qwer    F       23      145.5   qwer    F       23      145.5
> uhsdf   M       34      12.11   uhsdf   M       34      12.11
> uhsdf   M       34      12.11   uhsdf   M       34      12.11
> zxldf   M       21      12.56   zxldf   M       21      12.56
> zxldf   M       21      12.56   zxldf   M       21      12.56
> asdfxc  M       23      12.44   asdfxc  M       23      12.44
> asdfxc  M       23      12.44   asdfxc  M       23      12.44$
> ${code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message