hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (Jira)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-22263) MV has distinct on columns and query has count(distinct) on one of the columns, we do not trigger rewriting
Date Wed, 12 Feb 2020 05:52:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-22263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jesus Camacho Rodriguez updated HIVE-22263:
-------------------------------------------
    Attachment: HIVE-22263.patch

> MV has distinct on columns and query has count(distinct) on one of the columns, we do
not trigger rewriting
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-22263
>                 URL: https://issues.apache.org/jira/browse/HIVE-22263
>             Project: Hive
>          Issue Type: Sub-task
>          Components: CBO, Materialized views
>    Affects Versions: 3.1.2
>            Reporter: Steve Carlin
>            Assignee: Jesus Camacho Rodriguez
>            Priority: Critical
>         Attachments: HIVE-22263.patch, count-distinct.sql, count-distinct2.sql
>
>
> Count distinct issues with materialized views.  Two scripts attached
> 1) 
> create materialized view base_aview stored as orc as select distinct c1 c1, c2 c2 from
base;
> explain extended select count(distinct c1) from base group by c2 ;
> 2)
> create materialized view base_aview stored as orc as SELECT c1 c1, c2 c2, sum(c2) FROM
base group by 1,2;
> explain extended select count(distinct c1) from base group by c2;



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message