hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by LarsGeorge
Date Tue, 04 Nov 2008 08:46:28 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by LarsGeorge:

The comment on the change is:
Added description for WorldLingo

    * We use a small Hadoop cluster in the scope of our general research activities at [http://www.vklabs.com
VK Labs] to get a faster data access from web applications.
    * We also use Hadoop for filtering and indexing listing, processing log analysis, and
for recommendation data.  
+  * [http://www.worldlingo.com/ WorldLingo]
+   * Hardware: 44 servers (each server has: 2 dual core CPUs, 2TB storage, 8GB RAM)
+   * Each server runs Xen with one Hadoop/HBase instance and another instance with web or
application servers, giving us 88 usable virtual machines.
+   * We run two separate Hadoop/HBase clusters with 22 nodes each.
+   * Hadoop is primarily used to run HBase and Map/Reduce jobs scanning over the HBase tables
to perform specific tasks.
+   * HBase is used as a scalable and fast storage back end for millions of documents. 
+   * Currently we store 12million documents with a target of 450million in the near future.
   * [http://www.yahoo.com/ Yahoo!]
    * More than 100,000 CPUs in ~20,000 computers running Hadoop
    * Our biggest cluster: 2000 nodes (2*4cpu boxes w 4TB disk each)

View raw message