Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B88FB107D1 for ; Wed, 20 Nov 2013 01:04:17 +0000 (UTC) Received: (qmail 38087 invoked by uid 500); 20 Nov 2013 01:04:17 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 38034 invoked by uid 500); 20 Nov 2013 01:04:17 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 38025 invoked by uid 500); 20 Nov 2013 01:04:17 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 38020 invoked by uid 99); 20 Nov 2013 01:04:17 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 20 Nov 2013 01:04:17 +0000 Date: Wed, 20 Nov 2013 01:04:17 +0000 (UTC) From: "Prasanth J (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-5849) Improve the stats of operators based on heuristics in the absence of any column statistics MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth J updated HIVE-5849: ----------------------------- Attachment: HIVE-5849.2.patch.txt Regenerated output for more test cases. > Improve the stats of operators based on heuristics in the absence of any column statistics > ------------------------------------------------------------------------------------------ > > Key: HIVE-5849 > URL: https://issues.apache.org/jira/browse/HIVE-5849 > Project: Hive > Issue Type: Sub-task > Components: Query Processor, Statistics > Reporter: Prasanth J > Assignee: Prasanth J > Fix For: 0.13.0 > > Attachments: HIVE-5849.1.patch.txt, HIVE-5849.2.patch.txt > > > In the absence of any column statistics, operators will simply use the statistics from its parents. It is useful to apply some heuristics to update basic statistics (number of rows and data size) in the absence of any column statistics. This will be worst case scenario. -- This message was sent by Atlassian JIRA (v6.1#6144)