hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lianhui Wang (JIRA)" <>
Subject [jira] [Commented] (HIVE-4506) use one map reduce to join multiple small tables
Date Tue, 07 May 2013 02:19:16 GMT


Lianhui Wang commented on HIVE-4506:

Fern, can you provide your sql?
if these tables used the same column in join clause, it used one mr.
SELECT /*+mapjoin(src2,src3)*/ src1.key, src3.value FROM src src1 JOIN src src2 ON (src1.key
= src2.key) JOIN src src3 ON (src1.key = src3.key);

> use one map reduce to join multiple small tables 
> -------------------------------------------------
>                 Key: HIVE-4506
>                 URL:
>             Project: Hive
>          Issue Type: Wish
>    Affects Versions: 0.10.0
>            Reporter: Fern
>            Priority: Minor
> I know we can use map side join for small table.
> by my test, if I use HQL like this
> ------
> select /*+mapjoin(b,c)*/...
> from a
> left join b
> on ...
> left join c
> on ...
> -------
> b and c are both small tables, I expect do the join in one map reduce using map side
join. Actually, it would generate two map-reduce jobs by sequence.
> Sorry, currently I am just a user of hive and not dig into the code, so this is what
I expect but I have no idea about how to improve now. 

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message