hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-19684) Hive stats optimizer wrongly uses stats against non native tables
Date Sat, 26 May 2018 16:37:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-19684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ashutosh Chauhan updated HIVE-19684:
------------------------------------
       Resolution: Fixed
    Fix Version/s: 4.0.0
           Status: Resolved  (was: Patch Available)

Pushed to master. Thanks, Slim!
cc: [~vgarg] Please consider this for branch-3.

> Hive stats optimizer wrongly uses stats against non native tables
> -----------------------------------------------------------------
>
>                 Key: HIVE-19684
>                 URL: https://issues.apache.org/jira/browse/HIVE-19684
>             Project: Hive
>          Issue Type: Bug
>          Components: Druid integration, Physical Optimizer
>            Reporter: slim bouguerra
>            Assignee: slim bouguerra
>            Priority: Blocker
>             Fix For: 4.0.0
>
>         Attachments: HIVE-19684.2.patch, HIVE-19684.3.patch, HIVE-19684.patch
>
>
> Stats of non native tables are inaccurate, thus queries over non native tables can not
optimized by stats optimizer.
> Take example of query 
> {code}
> Explain select count(*) from (select `__time` from druid_test_table limit 1) as src ;
> {code} 
> the plan will be reduced to 
> {code}
> POSTHOOK: query: explain extended select count(*) from (select `__time` from druid_test_table
limit 1) as src
> POSTHOOK: type: QUERY
> STAGE DEPENDENCIES:
>   Stage-0 is a root stage
> STAGE PLANS:
>   Stage: Stage-0
>     Fetch Operator
>       limit: 1
>       Processor Tree:
>         ListSink
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message