hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "ImportantConcepts" by TedDunning
Date Fri, 20 Jul 2007 03:13:04 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by TedDunning:

  Some notable terms that may confuse you:
  * Hadoop - Hadoop itself refers to the overall system that runs jobs, distributes tasks
(pieces of these jobs) and stores data in a parallel and distributed fashion.
+ * [:HadoopMapReduce:Map/reduce] - Is the style in which most programs running on Hadoop
are written.  In this style, input is broken in tiny pieces which are processed independently
(the map part).  The results of these independent processes are then collated into groups
and processed as groups (the reduce part).  Follow the link for a much more complete description.
  * Job -  In hadoop, the combination of all of the jars and classes needed to run a map/reduce
program is called a job.  All of these components are themselves collected into a jar which
is usually referred to as a job file.  To execute a job, you normally will use the command:

View raw message