pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-3789) tuple in POStream binaryInputQueue keep changing
Date Wed, 19 Mar 2014 20:55:43 GMT

    [ https://issues.apache.org/jira/browse/PIG-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13940973#comment-13940973
] 

Daniel Dai commented on PIG-3789:
---------------------------------

That changes MR code. In MR, we don't need a copy in Tuple.readFields, since it will be copied
in POPackage. It seems we will now add one more ArrayList creation for every input record
in MR, right?

> tuple in POStream binaryInputQueue keep changing
> ------------------------------------------------
>
>                 Key: PIG-3789
>                 URL: https://issues.apache.org/jira/browse/PIG-3789
>             Project: Pig
>          Issue Type: Sub-task
>          Components: tez
>    Affects Versions: tez-branch
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: tez-branch
>
>         Attachments: PIG-3789-1.patch, PIG-3789-2.patch
>
>
> Similar to the comments in POSimpleTezLoad:
> {code}
>     /**
>      * Previously, we reused the same Result object for all results, but we found
>      * certain operators (e.g. POStream) save references to the Result object and
>      * expect it to be constant.
>      */
> {code}
> Tuples put into binaryInputQueue get changed when it is actually processed. Not exactly
sure why, but make a copy of the tuple solves the issue.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message