hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gang Luo <lgpub...@yahoo.com.cn>
Subject skew join in hive
Date Mon, 21 Jun 2010 15:20:17 GMT
Hi,
I see the skew handling strategy as mentioned in hive-964. Here are some questions.
1. how to get the big keys for a table? Launch a mr job to build histogram on each table?
2. now that we get big/skewed keys, do we also have small/non-skewed keys? Do we process these
non-skewed keys in the same way (replicate join), or in the traditional way (redistribution
join)?

Thanks,
-Gang



      

Mime
View raw message