Return-Path: X-Original-To: apmail-pig-dev-archive@www.apache.org Delivered-To: apmail-pig-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2F5371882A for ; Wed, 12 Aug 2015 23:56:46 +0000 (UTC) Received: (qmail 65722 invoked by uid 500); 12 Aug 2015 23:56:45 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 65665 invoked by uid 500); 12 Aug 2015 23:56:45 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 65651 invoked by uid 500); 12 Aug 2015 23:56:45 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 65647 invoked by uid 99); 12 Aug 2015 23:56:45 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Aug 2015 23:56:45 +0000 Date: Wed, 12 Aug 2015 23:56:45 +0000 (UTC) From: "Rohini Palaniswamy (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (PIG-4657) [Pig on Tez] Optimize GroupBy and Distinct key comparison MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Rohini Palaniswamy created PIG-4657: --------------------------------------- Summary: [Pig on Tez] Optimize GroupBy and Distinct key comparison Key: PIG-4657 URL: https://issues.apache.org/jira/browse/PIG-4657 Project: Pig Issue Type: Sub-task Reporter: Rohini Palaniswamy Assignee: Rohini Palaniswamy Fix For: 0.16.0 While bytes comparator cannot be used for joins till TEZ-2715 is available, they can be used for group by and distinct if they have only one Tez input. If there is more than one input due to union optimization (OrderedGroupedMergedKVInput) , full comparator has to be still used as OrderedGroupedMergedKVInput uses the comparator to merge the two underlying inputs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)