hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1210) fieldsToRead send the same fields more than once in some cases
Date Sat, 30 Jan 2010 02:17:34 GMT

    [ https://issues.apache.org/jira/browse/PIG-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12806610#action_12806610
] 

Daniel Dai commented on PIG-1210:
---------------------------------

List is the data structure needed for the construct of RequiredFields. Yes, we could Set,
but we need to check if any of our code assume the order within the list, since if we use
Set, we lose the order. We can think about that in the new logical plan.

> fieldsToRead send the same fields more than once in some cases
> --------------------------------------------------------------
>
>                 Key: PIG-1210
>                 URL: https://issues.apache.org/jira/browse/PIG-1210
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: 0.6.0
>
>         Attachments: PIG-1210-1.patch, PIG-1210-2.patch
>
>
> This bug will happen if the following condition meet:
> 1. LoadFunc is susceptible to duplicated fields in fieldsToRead. The only LoadFunc we
notice now is Zebra.
> 2. The first item in FOREACH statement contains reference to the same input more than
once.
> For example, the following script will be affected:
> a = load '11' using org.apache.hadoop.zebra.pig.TableLoader('a0');
> b = foreach a generate a0+a0;

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message