hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] Closed: (PIG-1230) Streaming input in POJoinPackage should use nonspillable bag to collect tuples
Date Fri, 14 May 2010 06:47:56 GMT

     [ https://issues.apache.org/jira/browse/PIG-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Daniel Dai closed PIG-1230.
---------------------------


> Streaming input in POJoinPackage should use nonspillable bag to collect tuples
> ------------------------------------------------------------------------------
>
>                 Key: PIG-1230
>                 URL: https://issues.apache.org/jira/browse/PIG-1230
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.6.0
>            Reporter: Ashutosh Chauhan
>            Assignee: Ashutosh Chauhan
>             Fix For: 0.7.0
>
>         Attachments: pig-1230.patch, pig-1230_1.patch, pig-1230_2.patch
>
>
> Last table of join statement is streamed through instead of collecting all its tuple
in a bag. As a further optimization of that, tuples of that relation are collected in chunks
in a bag. Since we don't want to spill the tuples from this bag, NonSpillableBag should be
used to hold tuples for this relation. Initially, DefaultDataBag was used, which was later
changed to InternalCachedBag as a part of PIG-1209.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message