hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From krish ws <krisws.2...@gmail.com>
Subject Re: hive table clustering - question
Date Sun, 27 Apr 2014 00:36:51 GMT
Hi All,
         Can someone provide has any idea on my above question?

Appreciate the help


On Thu, Apr 24, 2014 at 7:15 PM, krish ws <krisws.2006@gmail.com> wrote:

>  Hi,
>       I have a question related to hive table *bucketing* based on
> multiple columns(*Clustered by* on a common set of columns).
>
> How would be the join performance if I am joining this table to itself
> based on few columns that I have specified in *clustered by *condition(not
> all)?
>
> Will the hashing differs based on few columns vs using all columns that I
> specified in the *Clustered by* clause on a table?
>
> Regards
> Krish
>

Mime
View raw message