hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pradeep Kamath (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-465) PERFORMANCE: removing keys from the value
Date Thu, 09 Oct 2008 00:33:44 GMT

     [ https://issues.apache.org/jira/browse/PIG-465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Pradeep Kamath updated PIG-465:

    Attachment: PIG-465.patch

> PERFORMANCE: removing keys from the value
> -----------------------------------------
>                 Key: PIG-465
>                 URL: https://issues.apache.org/jira/browse/PIG-465
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: types_branch
>            Reporter: Olga Natkovich
>            Assignee: Pradeep Kamath
>             Fix For: types_branch
>         Attachments: PIG-465.patch
> Currently, reducers get the key data twice: once in the key and once in the value. If
grouping key is the large part of the value, this causes large data replication and performance
> The key should not be sent as part of the value. Instead, a metadata should used to assist
in reconstructing the row from the key and the remaining data

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message