hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by KenKrugler
Date Thu, 05 Nov 2009 21:47:45 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "PoweredBy" page has been changed by KenKrugler.
http://wiki.apache.org/hadoop/PoweredBy?action=diff&rev1=159&rev2=160

--------------------------------------------------

    * We handle about 200TB per week
    * Our clusters vary from 10 to 500 nodes
    * Hypertable is also supported by Baidu
+ 
+  * [[http://bixolabs.com/|Bixo Labs]] - Elastic web mining
+   * The Bixolabs elastic web mining platform uses Hadoop + Cascading to quickly build scalable
web mining applications.
+   * We're doing a 200M page/5TB crawl as part of the [[http://bixolabs.com/datasets/public-terabyte-dataset-project/|public
terabyte dataset project]].
+   * This runs as a 20 machine [[http://aws.amazon.com/elasticmapreduce/|Elastic MapReduce]]
cluster.
  
   * [[http://www.cascading.org/|Cascading]] - Cascading is a feature rich API for defining
and executing complex and fault tolerant data processing workflows on a Hadoop cluster.
  

Mime
View raw message