hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amir Youssefi (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-324) Combiner Error
Date Fri, 18 Jul 2008 02:09:31 GMT
Combiner Error

                 Key: PIG-324
                 URL: https://issues.apache.org/jira/browse/PIG-324
             Project: Pig
          Issue Type: Bug
         Environment: Pig + Hadoop 17 
            Reporter: Amir Youssefi

A = load '...' USING PigStorage('\t') AS (c1, c2, c3, n1);
B = group A by (c1,c2,c3);
C = foreach B generate flatten(group), SUM(A.n1);
store C into ...;

Runs with combiner and errors out. 

java.io.IOException: For input string: "..." additional info: iteration = 1bag size = 2 partial
sum = 0.0
previous tupple = (...)
 at org.apache.pig.builtin.SUM.sum(SUM.java:95)
 at org.apache.pig.builtin.SUM$Final.exec(SUM.java:63)
 at org.apache.pig.builtin.SUM$Final.exec(SUM.java:60)
 at org.apache.pig.impl.eval.FuncEvalSpec$1.add(FuncEvalSpec.java:116)
 at org.apache.pig.impl.eval.GenerateSpec$CrossProductItem.<init>(GenerateSpec.java:159)
 at org.apache.pig.impl.eval.GenerateSpec$1.add(GenerateSpec.java:79)
 at org.apache.pig.backend.hadoop.executionengine.mapreduceExec.PigMapReduce.reduce(PigMapReduce.java:165)
 at org.apache.pig.backend.hadoop.executionengine.mapreduceExec.PigMapReduce.reduce(PigMapReduce.java:80)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:391)
 at org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2124)

Work-around was that I put out combiner: 

C = foreach B generate SUM(A.n1),flatten(group);

and it worked. Input data has some private information in it so I cannot post it. Let me know
if it was not possible to solve it without having it. Then we compile a similar input. 

c1,c2,c3 are alphabetic, 
n1 is numeric.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message