hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "WordCount" by TedDunning
Date Fri, 20 Jul 2007 03:00:44 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by TedDunning:
http://wiki.apache.org/lucene-hadoop/WordCount

------------------------------------------------------------------------------
  To run the example, the command syntax is[[BR]]
  bin/hadoop jar hadoop-*-examples.jar wordcount [-m <#maps>] [-r <#reducers>]
<in-dir> <out-dir>
  
+ All of the files in the input directory (called in-dir in the command line above) are read
and the counts of words in the input are written to the output directory (called out-dir above).
 It is assumed that both inputs and outputs are stored in HDFS.  If your input is not already
in HDFS, but is rather in a local file system somewhere, you need to copy the data into HDFS
using a command like this:[[BR]]
+ bin/hadoop dfs -mkdir <hdfs-dir>
+ bin/hadoop dfs -copyFromLocal <local-dir> <hdfs-dir>
+ 
+ There is a bit of document on the hadoop dfs command at [[hadoop-0.1-dev/bin/hadoop dfs]],
but the descriptions of some of the commands appears to be incorrect or out of date.
+ 

Mime
View raw message