hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Trivial Update of "HadoopMapReduce" by LarsFrancke
Date Mon, 09 Nov 2009 02:36:11 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "HadoopMapReduce" page has been changed by LarsFrancke.
The comment on this change is: linkfix.
http://wiki.apache.org/hadoop/HadoopMapReduce?action=diff&rev1=25&rev2=26

--------------------------------------------------

  When Mapper output is collected it is partitioned, which means
  that it will be written to the output specified by the
  [[http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapreduce/Partitioner.html|Partitioner]].
The default [[http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapreduce/lib/partition/HashPartitioner.html|HashPartitioner]]
uses the
- hashcode function on the key's class (which means that this hashcode function must be good
in order to achieve an even workload across the reduce tasks).  See [[http://svn.apache.org/viewcvs.cgi/hadoop/core/trunk/src/mapred/org/apache/hadoop/mapred/MapTask.java?view=markup|MapTask]]
for details.
+ hashcode function on the key's class (which means that this hashcode function must be good
in order to achieve an even workload across the reduce tasks).
+ See [[http://svn.apache.org/viewvc/hadoop/mapreduce/trunk/src/java/org/apache/hadoop/mapred/package.html?view=markup|MapTask]]
for details.
  
- N input files will generate M map tasks to be run and each map
+ ''N'' input files will generate ''M'' map tasks to be run and each map
  task will generate as many output files as there are reduce
  tasks configured in the system. Each output file will be
  targeted at a specific reduce task and the map output pairs from

Mime
View raw message