hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pengcheng Xiong (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-15685) count(distinct) generates different result than expected
Date Fri, 20 Jan 2017 22:24:26 GMT

     [ https://issues.apache.org/jira/browse/HIVE-15685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Pengcheng Xiong updated HIVE-15685:
-----------------------------------
    Attachment: HIVE-15685.01.patch

> count(distinct) generates different result than expected
> --------------------------------------------------------
>
>                 Key: HIVE-15685
>                 URL: https://issues.apache.org/jira/browse/HIVE-15685
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-15685.01.patch
>
>
> Following query with count(distinct) generates different result than expected on hive
master:
> {noformat}
> select count(distinct ss_ticket_number), count(distinct ss_sold_date_sk) from store_sales;
> {noformat}
> Expected output generated using postgres:
> {noformat}
> select count(distinct ss_ticket_number), count(distinct ss_sold_date_sk) from store_sales;
>  count  | count 
> --------+-------
>  240000 |  1823
> (1 row)
> {noformat}
> Actual output
> {noformat}
> 240000	1824
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message