pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohini Palaniswamy (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PIG-4657) [Pig on Tez] Optimize GroupBy and Distinct key comparison
Date Thu, 13 Aug 2015 19:48:46 GMT

     [ https://issues.apache.org/jira/browse/PIG-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Rohini Palaniswamy updated PIG-4657:
------------------------------------
    Attachment: PIG-4657-1.patch

> [Pig on Tez] Optimize GroupBy and Distinct key comparison
> ---------------------------------------------------------
>
>                 Key: PIG-4657
>                 URL: https://issues.apache.org/jira/browse/PIG-4657
>             Project: Pig
>          Issue Type: Sub-task
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>         Attachments: PIG-4657-1.patch
>
>
>    While bytes comparator cannot be used for joins till TEZ-2715 is available, they can
be used for group by and distinct if they have only one Tez input. If there is more than one
input due to union optimization (OrderedGroupedMergedKVInput) , full comparator has to be
still used as OrderedGroupedMergedKVInput uses the comparator to merge the two underlying
inputs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message