spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "J.P Feng (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-19037) Run count(distinct name) from sub query found some errors
Date Fri, 30 Dec 2016 13:53:58 GMT

    [ https://issues.apache.org/jira/browse/SPARK-19037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15787714#comment-15787714
] 

J.P Feng commented on SPARK-19037:
----------------------------------

errors logs:

> Run count(distinct name) from sub query found some errors
> ---------------------------------------------------------
>
>                 Key: SPARK-19037
>                 URL: https://issues.apache.org/jira/browse/SPARK-19037
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Shell, SQL
>    Affects Versions: 2.1.0
>         Environment: spark 2.1.0, scala 2.11 
>            Reporter: J.P Feng
>              Labels: distinct, sparkSQL, sub-query
>
> when i use spark-shell or spark-sql to execute count(distinct name) from subquery, some
errors occur:
> select count(distinct name) from (select * from mytest limit 10) as a
> if i do this in hive-server2, i can get the correct result.
> if i just execute select count(name) from (select * from mytest limit 10) as a, i can
also get the right result.
> besides, i found the same errors when i use max(), distinct(),groupby() with subquery.
> I think there maybe some bugs when doing key-reduce jobs with subquery.
> I will add the errors in new comment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message