hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xuefu Zhang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8621) Aggregate all small table join data into broadcast variables
Date Mon, 27 Oct 2014 22:58:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14185971#comment-14185971
] 

Xuefu Zhang commented on HIVE-8621:
-----------------------------------

Hi [~ssatish], to make explicit, if the table is not bucket, which means n=1, we will broadcast
m x 1 variables. Szehon and I was discussing about this, and found that broardcasting m x
n variable is a general case that covers map join for both unbucketed tables and bucketed
tables. This shouldn't impact your overall design and implementation. We are not clear about
how to generate n variables for bucketed table yet, but it seems feasible. Please update the
title of ticket if you agree to this minor change. Sorry for the confusion.

> Aggregate all small table join data into broadcast variables
> ------------------------------------------------------------
>
>                 Key: HIVE-8621
>                 URL: https://issues.apache.org/jira/browse/HIVE-8621
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Suhas Satish
>            Assignee: Suhas Satish
>
> This is a sub-task of map-join for spark 
> https://issues.apache.org/jira/browse/HIVE-7613
> This can use the baseline patch for map-join
> https://issues.apache.org/jira/browse/HIVE-8616



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message