pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Koji Noguchi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-5224) Extra foreach from ColumnPrune preventing Accumulator usage
Date Thu, 25 May 2017 16:15:04 GMT

    [ https://issues.apache.org/jira/browse/PIG-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16024946#comment-16024946
] 

Koji Noguchi commented on PIG-5224:
-----------------------------------

bq. That's only if user write "foreach" statement carefully. If he project a column but never
used in the script, Column pruner will still think this is a column should remove.

Ah, you're right (as always) :)    
Committing pig-5224-v2.patch.

> Extra foreach from ColumnPrune preventing Accumulator usage
> -----------------------------------------------------------
>
>                 Key: PIG-5224
>                 URL: https://issues.apache.org/jira/browse/PIG-5224
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Koji Noguchi
>            Assignee: Koji Noguchi
>         Attachments: pig-5224-v0-testonly.patch, pig-5224-v1.patch, pig-5224-v2.patch
>
>
> {code}
> A = load 'input' as (id:int, fruit);
> B = foreach A generate id; -- to enable columnprune
> C = group B by id;
> D = foreach C {
>     o = order B by id;
>     generate org.apache.pig.test.utils.AccumulatorBagCount(o);
> }
> STORE D into ...
> {code}
> Pig fails to use Accumulator interface for this UDF.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message