hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17934) Merging Statistics are promoted to COMPLETE (most of the time)
Date Thu, 09 Nov 2017 00:00:01 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16244958#comment-16244958
] 

Ashutosh Chauhan commented on HIVE-17934:
-----------------------------------------

I see changes which I am not sure we want. e.g., if 2 TS has COMPLETE basic stats, stats for
join (or group by) following it becomes PARTIAL after this patch. Any reason for such a change?

> Merging Statistics are promoted to COMPLETE (most of the time)
> --------------------------------------------------------------
>
>                 Key: HIVE-17934
>                 URL: https://issues.apache.org/jira/browse/HIVE-17934
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>            Reporter: Zoltan Haindrich
>            Assignee: Zoltan Haindrich
>         Attachments: HIVE-17934.01.patch, HIVE-17934.02.patch, HIVE-17934.03.patch, HIVE-17934.04.patch,
HIVE-17934.05.patch, HIVE-17934.06wip01.patch
>
>
> in case multiple partition statistics are merged the STATS state is computed based on
the datasize and rowcount;
> the merge may hide away non-existent stats in case there are other partition or operators
which do contribute to the datasize and the rowcount.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message