hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/Roadmap" by TimArmstrong
Date Sat, 18 Jun 2011 20:10:19 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/Roadmap" page has been changed by TimArmstrong:

removed claim

  Priorities are denoted as P0 > P1 > ...
  === Query Optimization ===
-  * [P0] Optimizing JOIN followed by GROUP BY (claimed by Tim Armstrong) 
+  * [P0] Optimizing JOIN followed by GROUP BY
    * A lot of analytics queries are JOINs followed by GROUP BY (join keys and group by keys
may or may not be the same or related). We need a better optimization for this kind of query
(optimize number of MapReduce Jobs vs. optimize data transfer size etc.)
   * [P0] Optimize JOINs using Bloom Filters
    * This is to optimize the case where two big tables are joined but the results are small.

View raw message