hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Szehon Ho (JIRA)" <>
Subject [jira] [Commented] (HIVE-8202) Support SMB Join for Hive on Spark [Spark Branch]
Date Fri, 31 Oct 2014 05:23:33 GMT


Szehon Ho commented on HIVE-8202:

Thanks, Xuefu

> Support SMB Join for Hive on Spark [Spark Branch]
> -------------------------------------------------
>                 Key: HIVE-8202
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Xuefu Zhang
>            Assignee: Szehon Ho
>             Fix For: spark-branch
>         Attachments: HIVE-8202.1-spark.patch, HIVE-8202.2-spark.patch, HIVE-8202.3-spark.patch,
HIVE-8202.4-spark.patch, HIVE-8202.5-spark.patch, HIVE-8202.6-spark.patch, HIVE-8202.7-spark.patch,
HIVE-8202.8-spark.patch, HIVE-8202.9-spark.patch, Hive on Spark SMB Join.docx, Hive on Spark
SMB Join.pdf
> SMB joins are used wherever the tables are sorted and bucketed. It's a map-side join.
The join boils down to just merging the already sorted tables, allowing this operation to
be faster than an ordinary map-join.
> The task is to research and support the conversion from regular SMB join to SMB map join
for Spark execution engine.

This message was sent by Atlassian JIRA

View raw message