Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 8C487200B5E for ; Wed, 27 Jul 2016 01:05:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8AF0D160AA5; Tue, 26 Jul 2016 23:05:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0353A160AA4 for ; Wed, 27 Jul 2016 01:05:21 +0200 (CEST) Received: (qmail 48187 invoked by uid 500); 26 Jul 2016 23:05:20 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 47929 invoked by uid 500); 26 Jul 2016 23:05:20 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 47897 invoked by uid 99); 26 Jul 2016 23:05:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jul 2016 23:05:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 99E712C0D65 for ; Tue, 26 Jul 2016 23:05:20 +0000 (UTC) Date: Tue, 26 Jul 2016 23:05:20 +0000 (UTC) From: "Rohini Palaniswamy (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (PIG-4958) Tez autoparallelism estimation for order by is higher than mapreduce MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 26 Jul 2016 23:05:22 -0000 [ https://issues.apache.org/jira/browse/PIG-4958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-4958: ------------------------------------ Attachment: (was: PIG-4958-1.patch) > Tez autoparallelism estimation for order by is higher than mapreduce > -------------------------------------------------------------------- > > Key: PIG-4958 > URL: https://issues.apache.org/jira/browse/PIG-4958 > Project: Pig > Issue Type: Bug > Reporter: Rohini Palaniswamy > Assignee: Rohini Palaniswamy > Fix For: 0.17.0 > > Attachments: PIG-4958-withoutsecurity.patch > > > The input size is calculated from the size of the samples in memory. Size in memory is usually 4x or more than the serialized size. Mapreduce estimates the number of reducers based on serialized size. -- This message was sent by Atlassian JIRA (v6.3.4#6332)