spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yin Huai (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-10746) count ( distinct columnref) over () returns wrong result set
Date Wed, 18 Nov 2015 23:50:11 GMT

    [ https://issues.apache.org/jira/browse/SPARK-10746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15012395#comment-15012395
] 

Yin Huai commented on SPARK-10746:
----------------------------------

btw, for now, distinct aggregation in window function is not supported (Hive will silently
drop the distinct keyword). 

also, can you post a case to reproduce the problem? What is the data?

> count ( distinct columnref) over () returns wrong result set
> ------------------------------------------------------------
>
>                 Key: SPARK-10746
>                 URL: https://issues.apache.org/jira/browse/SPARK-10746
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.5.0
>            Reporter: N Campbell
>
> Same issue as report against HIVE (HIVE-9534) 
> Result set was expected to contain 5 rows instead of 1 row as others vendors (ORACLE,
Netezza etc) would.
> select count( distinct column) over () from t1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message