pig-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Pig Wiki] Update of "PigMultiQueryPerformanceSpecification" by GuntherHagleitner
Date Wed, 20 May 2009 08:41:58 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Pig Wiki" for change notification.

The following page has been changed by GuntherHagleitner:
http://wiki.apache.org/pig/PigMultiQueryPerformanceSpecification

------------------------------------------------------------------------------
      as (user, action, timespent, query_term, ip_addr, timestamp,
          estimated_revenue, page_info, page_links);
  B = group A by user;
- C = foreach B generate A.user, MAX(A.estimated_revenue);
+ C = foreach B generate group, MAX(A.estimated_revenue);
  store C into 'highest_values';
  D = group A by query_term;
  E = foreach D generate group, SUM(A.timespent);
@@ -403, +403 @@

  
  Will be executed as:
  
- [TBD]
+ attachment:mapreduce-mapreduce.png
  
  If a split happens in a reduce plan, splittees have to be map-only jobs to be merged into
the splitter.
  If there are map-reduce splittees the reduce will result in a tmp store and the splittees
are run in separate

Mime
View raw message