hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chao (JIRA)" <>
Subject [jira] [Commented] (HIVE-8913) Make SparkMapJoinResolver handle runtime skew join [Spark Branch]
Date Thu, 11 Dec 2014 03:31:12 GMT


Chao commented on HIVE-8913:

(cc [~jxiang]) Not sure - maybe we can try that.
The interesting thing is: the copying of IOContext is supposed to work only in caching RDD
case, but this test doesn't even have caching, so in {{SparkMapRecordHandler}}, the input
path should not be null in the first place..

> Make SparkMapJoinResolver handle runtime skew join [Spark Branch]
> -----------------------------------------------------------------
>                 Key: HIVE-8913
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Spark
>            Reporter: Rui Li
>            Assignee: Rui Li
>         Attachments: HIVE-8913.1-spark.patch, HIVE-8913.2-spark.patch
> Sub-task of HIVE-8406.
> Now we have {{SparkMapJoinResolver}} in place. But at the moment, it doesn't handle the
map join task created by upstream SkewJoinResolver, i.e. those wrapped in a ConditionalTask.
We have to implement this part for runtime skew join to work on spark. To do so, we can borrow
logic from {{MapJoinResolver}}.

This message was sent by Atlassian JIRA

View raw message