hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Lucene-hadoop Wiki] Update of "FAQ" by DougCutting
Date Fri, 08 Jun 2007 21:12:03 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by DougCutting:
http://wiki.apache.org/lucene-hadoop/FAQ

The comment on the change is:
update performance numbers and mention pipes

------------------------------------------------------------------------------
  
  == 3. How well does Hadoop scale? ==
  
- Hadoop has been demonstrated on clusters of up to 902 nodes.  Sort performance is [https://issues.apache.org/jira/secure/attachment/12356727/post-H-1270.png
good] (sort500 i.e. 5TB of data on 500 nodes takes around 2 1/2 hours) and still improving.
+ Hadoop has been demonstrated on clusters of up to 2000 nodes.  Sort performance is good
(sorting 9TB of data on 900 nodes takes around 4 hours) and [attachment:sort900-20070607.png
improving].
  
  == 4. Do I have to write my application in Java? ==
  
- No.  There are several ways to incorporate non-Java code.  HadoopStreaming permits any shell
command to be used as a map or reduce function, and Hadoop is also developing [http://svn.apache.org/viewvc/lucene/hadoop/trunk/src/c%2B%2B/
C and C++ APIs].
+ No.  There are several ways to incorporate non-Java code.  HadoopStreaming permits any shell
command to be used as a map or reduce function, and Hadoop is also developing [http://svn.apache.org/viewvc/lucene/hadoop/trunk/src/c%2B%2B/
C and C++ APIs] and a [http://www.swig.org/ SWIG]-compatible [http://issues.apache.org/jira/browse/HADOOP-234
pipes] API.
  
  == 5. How can I help to make Hadoop better? ==
  

Mime
View raw message