hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aniket Mokashi <aniket...@gmail.com>
Subject Re: Map side join
Date Mon, 18 Jun 2012 08:08:25 GMT
Hive also have something called uniquejoin. May be you are looking for
that. I cannot find documentation for your reference but you can do a jira
search.
It allows you to perform joining multiple sources with same key, mapside.
(all sources should have the same key)

~Aniket

On Wed, Jun 13, 2012 at 8:20 AM, Tucker, Matt <Matt.Tucker@disney.com>wrote:

> Hi,
>
> Assuming that 4 tables are small enough to fit in the Distributed Cache,
> the joins between the tables all need to join against a common key.
>
> Example:
> set hive.auto.convert.join=true;
> SELECT *
> FROM large
>    JOIN smalla ON
>        large.key = smalla.key1
>    JOIN smallb ON
>        large.key = smallb.key2
>    JOIN smallc ON
>        large.key = smallc.key3
>    JOIN smalld ON
>        large.key = smalld.key4;
>
> Having a different join key will push the join off into a different task,
> as will the order of the join condition. In this example, large.key was
> always on the left side of the join conditions.
>
>
> Matt Tucker
>
> -----Original Message-----
> From: Abhishek [mailto:abhishek.dodda1@gmail.com]
> Sent: Wednesday, June 13, 2012 11:13 AM
> To: user@hive.apache.org
> Subject: Map side join
>
> Hi all,
>
> How map side join in hive, can be used to join multiple tables(suppose 5
> tables).
>
> Regards
> Abhishek
>
> Sent from my iPhone
>



-- 
"...:::Aniket:::... Quetzalco@tl"

Mime
View raw message