hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (Jira)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-22979) Support total file size in statistics annotation
Date Fri, 06 Mar 2020 17:57:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-22979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17053645#comment-17053645
] 

Jesus Camacho Rodriguez commented on HIVE-22979:
------------------------------------------------

Sounds like a good idea, thanks!

> Support total file size in statistics annotation
> ------------------------------------------------
>
>                 Key: HIVE-22979
>                 URL: https://issues.apache.org/jira/browse/HIVE-22979
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 4.0.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>            Priority: Minor
>              Labels: pull-request-available
>         Attachments: HIVE-22979.1.patch, HIVE-22979.2.patch
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> Hive statistics annotation provide estimated Statistics for each operator. The data size
provided in TableScanOperator is raw data size (after decompression and decoding), but there
are some optimizations that can be performed based on total file size on disk (scan cost estimation).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message