hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-11160) Auto-gather column stats
Date Sat, 05 Mar 2016 00:09:40 GMT

    [ https://issues.apache.org/jira/browse/HIVE-11160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15181323#comment-15181323
] 

Ashutosh Chauhan commented on HIVE-11160:
-----------------------------------------

* Can you add explain for insert statements ?
* It seems we run analyze table for all partitions, we should run it only for new partitions
getting generated in the query ?

> Auto-gather column stats
> ------------------------
>
>                 Key: HIVE-11160
>                 URL: https://issues.apache.org/jira/browse/HIVE-11160
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-11160.01.patch, HIVE-11160.02.patch, HIVE-11160.03.patch, HIVE-11160.04.patch,
HIVE-11160.05.patch
>
>
> Hive will collect table stats when set hive.stats.autogather=true during the INSERT OVERWRITE
command. And then the users need to collect the column stats themselves using "Analyze" command.
In this patch, the column stats will also be collected automatically. More specifically, INSERT
OVERWRITE will automatically create new column stats. INSERT INTO will automatically merge
new column stats with existing ones.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message