hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Santhosh Srinivasan (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-306) count with multiple group by keys fails
Date Wed, 23 Jul 2008 00:57:31 GMT

     [ https://issues.apache.org/jira/browse/PIG-306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Santhosh Srinivasan updated PIG-306:
------------------------------------

    Attachment: cogroup_schema.patch

The patch fixes the following:

1. Cogroup schema compuation
2. LOMapLookup fieldSchema computation


Unit test cases that still fail are:

 [junit] Running org.apache.pig.test.TestEvalPipeline
[junit] Tests run: 8, Failures: 0, Errors: 1, Time elapsed: 179.904 sec
[junit] Test org.apache.pig.test.TestEvalPipeline FAILED

[junit] Running org.apache.pig.test.TestFilterOpNumeric
[junit] Tests run: 8, Failures: 0, Errors: 1, Time elapsed: 56.124 sec
[junit] Test org.apache.pig.test.TestFilterOpNumeric FAILED

[junit] Running org.apache.pig.test.TestBuiltin
[junit] Tests run: 23, Failures: 1, Errors: 1, Time elapsed: 14.8 sec
[junit] Test org.apache.pig.test.TestBuiltin FAILED

[junit] Running org.apache.pig.test.TestStoreOld
[junit] Tests run: 3, Failures: 0, Errors: 2, Time elapsed: 21.453 sec
[junit] Test org.apache.pig.test.TestStoreOld FAILED

> count with multiple group by keys fails
> ---------------------------------------
>
>                 Key: PIG-306
>                 URL: https://issues.apache.org/jira/browse/PIG-306
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: types_branch
>            Reporter: Alan Gates
>            Assignee: Santhosh Srinivasan
>             Fix For: types_branch
>
>         Attachments: cogroup_schema.patch
>
>
> The query:
> {code}
> a = load 'myfile' as (name, age, gpa);                                              
                         
> b = group a by (name, age);                                                         
                                                                           
> c = foreach b generate group.name, group.age, COUNT(a.gpa);                         
                                                                           
> store c into 'outfile';
> {code}
> generates
> 07-12 16:55:54,348 [main] ERROR org.apache.pig.impl.mapReduceLayer.Launcher - Error message
from task (reduce) tip_200807090821_0580_r_000000 java.lang.ClassCastException: org.apache.pig.data.DataByteArray
cannot be cast to org.apache.pig.data.Tuple
> at org.apache.pig.impl.physicalLayer.expressionOperators.POProject.getNext(POProject.java:262)
> at org.apache.pig.impl.physicalLayer.relationalOperators.POForEach.processPlan(POForEach.java:189)
> at org.apache.pig.impl.physicalLayer.relationalOperators.POForEach.getNext(POForEach.java:148)
> at org.apache.pig.impl.mapReduceLayer.PigMapReduce$Reduce.reduce(PigMapReduce.java:164)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:333)
> at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2071)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message