spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "J.P Feng (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-19037) Run count(distinct name) from sub query found some errors
Date Fri, 30 Dec 2016 13:53:58 GMT
J.P Feng created SPARK-19037:
--------------------------------

             Summary: Run count(distinct name) from sub query found some errors
                 Key: SPARK-19037
                 URL: https://issues.apache.org/jira/browse/SPARK-19037
             Project: Spark
          Issue Type: Bug
          Components: Spark Shell, SQL
    Affects Versions: 2.1.0
         Environment: spark 2.1.0, scala 2.11 
            Reporter: J.P Feng


when i use spark-shell or spark-sql to execute count(distinct name) from subquery, some errors
occur:

select count(distinct name) from (select * from mytest limit 10) as a

if i do this in hive-server2, i can get the correct result.

if i just execute select count(name) from (select * from mytest limit 10) as a, i can also
get the right result.

besides, i found the same errors when i use max(), distinct(),groupby() with subquery.

I think there maybe some bugs when doing key-reduce jobs with subquery.

I will add the errors in new comment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message