hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/Roadmap" by TimArmstrong
Date Sat, 18 Jun 2011 20:10:19 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/Roadmap" page has been changed by TimArmstrong:
http://wiki.apache.org/hadoop/Hive/Roadmap?action=diff&rev1=62&rev2=63

Comment:
removed claim

  Priorities are denoted as P0 > P1 > ...
  
  === Query Optimization ===
-  * [P0] Optimizing JOIN followed by GROUP BY (claimed by Tim Armstrong) 
+  * [P0] Optimizing JOIN followed by GROUP BY
    * A lot of analytics queries are JOINs followed by GROUP BY (join keys and group by keys
may or may not be the same or related). We need a better optimization for this kind of query
(optimize number of MapReduce Jobs vs. optimize data transfer size etc.)
   * [P0] Optimize JOINs using Bloom Filters
    * This is to optimize the case where two big tables are joined but the results are small.


Mime
View raw message