hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth J (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-5849) Improve the stats of operators based on heuristics in the absence of any column statistics
Date Wed, 20 Nov 2013 18:24:41 GMT

     [ https://issues.apache.org/jira/browse/HIVE-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Prasanth J updated HIVE-5849:
-----------------------------

    Attachment: HIVE-5849.3.patch.txt

Fixed failing test case bucketmapjoin7.q in TestMinimrCliDriver.

> Improve the stats of operators based on heuristics in the absence of any column statistics
> ------------------------------------------------------------------------------------------
>
>                 Key: HIVE-5849
>                 URL: https://issues.apache.org/jira/browse/HIVE-5849
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor, Statistics
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>             Fix For: 0.13.0
>
>         Attachments: HIVE-5849.1.patch.txt, HIVE-5849.2.patch.txt, HIVE-5849.3.patch.txt
>
>
> In the absence of any column statistics, operators will simply use the statistics from
its parents. It is useful to apply some heuristics to update basic statistics (number of rows
and data size) in the absence of any column statistics. This will be worst case scenario.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message