spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (SPARK-13213) BroadcastNestedLoopJoin is very slow
Date Thu, 18 Feb 2016 09:59:18 GMT

     [ https://issues.apache.org/jira/browse/SPARK-13213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Apache Spark reassigned SPARK-13213:
------------------------------------

    Assignee:     (was: Apache Spark)

> BroadcastNestedLoopJoin is very slow
> ------------------------------------
>
>                 Key: SPARK-13213
>                 URL: https://issues.apache.org/jira/browse/SPARK-13213
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>            Reporter: Davies Liu
>
> Since we have improve the performance of CartisianProduct, which should be faster and
robuster than BroacastNestedLoopJoin, we should do CartisianProduct instead of BroacastNestedLoopJoin,
especially  when the broadcasted table is not that small.
> Today, we hit a query that take very long time but still not finished, once decrease
the threshold for broadcast (disable BroacastNestedLoopJoin), it just finished in seconds.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message