hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "HadoopMapReduce" by DanielTunkelang
Date Thu, 09 Jul 2009 03:56:48 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by DanielTunkelang:
http://wiki.apache.org/hadoop/HadoopMapReduce

The comment on the change is:
updated link to deprecated FileSplit class

------------------------------------------------------------------------------
  == Map ==
  
  As the Map operation is parallelized the input file set is first
- split to several pieces called [http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapred/FileSplit.html
FileSplits]. If an individual file
+ split to several pieces called [http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapreduce/lib/input/FileSplit.html
FileSplits]. If an individual file
  is so large that it will affect seek time it will be split to
  several Splits. The splitting does not know anything about the
  input file's internal logical structure, for example

Mime
View raw message