hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Remus Rusanu" <>
Subject Re: Review Request 13059: HIVE-4850 Implement vector mode map join
Date Thu, 03 Oct 2013 14:17:56 GMT

This is an automatically generated e-mail. To reply, visit:

(Updated Oct. 3, 2013, 2:17 p.m.)

Review request for hive, Eric Hanson and Jitendra Pandey.

Bugs: HIVE-4850

Repository: hive-git


This is not the final iteration, but I thought is easier to discuss it with a review.
This implementation works, handles multiple aliases and multiple values per key. The implementation
uses the exiting hash tables saved by the local task for the map join, which are row mode
hash tables (have row mode keys and store row mode writable object values). Going forward
we should avoid the size-of-big-table conversions of big table keys to row-mode and conversion
of small table values to vector data. This would require either converting on-the-fly the
hash tables to vector friendly ones (when loaded) or changing the local task tahstable sink
to create a vectorization friendly hash. First approach may have memory consumption problems
(potentially two hash tables end up in memory, would have to stream the transformation or
transform as reading from serialized format... nasty).

Diffs (updated)

  ql/src/java/org/apache/hadoop/hive/ql/exec/ d320b47 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ 86db044 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ 153b8ea 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ 8ab5395 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ cde1a59 
  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ 8b4c615 
  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ PRE-CREATION

  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ 9955d09

  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ PRE-CREATION

  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ 6df3551

  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ 02ebe14 
  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ ff13f89 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/ df1c5a6 
  ql/src/java/org/apache/hadoop/hive/ql/plan/ a72ec8b 



Manually run some join queries on alltypes_orc table.


Remus Rusanu

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message