spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Witold Jędrzejewski (JIRA) <j...@apache.org>
Subject [jira] [Created] (SPARK-16683) Group by does not work after multiple joins of the same dataframe
Date Fri, 22 Jul 2016 14:38:20 GMT
Witold Jędrzejewski created SPARK-16683:
-------------------------------------------

             Summary: Group by does not work after multiple joins of the same dataframe
                 Key: SPARK-16683
                 URL: https://issues.apache.org/jira/browse/SPARK-16683
             Project: Spark
          Issue Type: Bug
          Components: Optimizer
    Affects Versions: 1.6.2, 1.6.1, 1.6.0, 2.0.0
         Environment: local and yarn
            Reporter: Witold Jędrzejewski


When I join a dataframe, group by a field from it, then join it again by different field and
group by field from it, second aggregation does not trigger.

The detailed description and minimal example, workaround and possible cause are in the attachment,
in a form of Zeppelin notebook.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message