hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/LanguageManual/Joins" by NamitJain
Date Wed, 31 Mar 2010 21:19:56 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/LanguageManual/Joins" page has been changed by NamitJain.


     SELECT a.key, a.val
     FROM a LEFT SEMI JOIN b on (a.key = b.key)
+  * If all but one of the tables being joined are small, the join can be performed as a map
only job. The query  
+ {{{
+   SELECT /*+ MAPJOIN(b) */ a.key, a.value
+   FROM a join b on a.key = b.key
+ }}}
+ does not need a reducer. For every mapper of A, B is read completely. The restriction is
that '''a FULL/RIGHT OUTER JOIN b''' cannot be performed
   * If the tables being joined are bucketized, and the buckets are a multiple of each other,
the buckets can be joined with each other. If table A has 8 buckets are table B has 4 buckets,
the following join  
    SELECT /*+ MAPJOIN(b) */ a.key, a.value

View raw message