hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ying He (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1037) better memory layout and spill for sorted and distinct bags
Date Mon, 26 Oct 2009 22:40:59 GMT

    [ https://issues.apache.org/jira/browse/PIG-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770246#action_12770246

Ying He commented on PIG-1037:

Alan, thanks for the feedback.

For the calculation of average size, I think the cost to calculate 100 times should be very
minimal. It shouldn't be noticeable of any performance impact.  so I'd like to keep it logically
correct.  It might be possible of very big tuples, such as those with Map type of fields.

For the comments and synchronization, I am going to make the change.

> better memory layout and spill for sorted and distinct bags
> -----------------------------------------------------------
>                 Key: PIG-1037
>                 URL: https://issues.apache.org/jira/browse/PIG-1037
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Olga Natkovich
>            Assignee: Ying He
>         Attachments: PIG-1037.patch, PIG-1037.patch2

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message