spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "marymwu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-24799) A solution of dealing with data skew in left,right,inner join
Date Fri, 13 Jul 2018 09:11:00 GMT
marymwu created SPARK-24799:
-------------------------------

             Summary: A solution of dealing with data skew in left,right,inner join
                 Key: SPARK-24799
                 URL: https://issues.apache.org/jira/browse/SPARK-24799
             Project: Spark
          Issue Type: New Feature
          Components: SQL
    Affects Versions: 2.3.0, 2.2.0, 2.1.0, 2.0.0
            Reporter: marymwu
             Fix For: 2.3.0


For the left,right,inner join statment execution, this solution is mainling about to devide
the partions where the data skew has occured into serveral partions with smaller data scale,
in order to parallelly execute more tasks to increase effeciency.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message