hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-8) Use binary comparison in data types
Date Fri, 16 Nov 2007 23:31:43 GMT

     [ https://issues.apache.org/jira/browse/PIG-8?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Olga Natkovich updated PIG-8:

    Attachment: BinaryCompare.patch

Please, see attached changes to fix this issue. All unit tests passed and 80 end-to-end tests
passed as well. I am seeing 40% performance improvement on the group by test.

> Use binary comparison in data types
> -----------------------------------
>                 Key: PIG-8
>                 URL: https://issues.apache.org/jira/browse/PIG-8
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>            Reporter: Olga Natkovich
>            Assignee: Olga Natkovich
>         Attachments: BinaryCompare.patch
> Currently, each data type in Pig has compareTo function as the means to compare objects
during maps side sort. This is very inefficient since 2 objects need to be created for each
comparison. Using binary comparison should speed things up significantly(2-3x) for most queries.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message