hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by DhruvBansal
Date Wed, 05 May 2010 17:38:06 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "PoweredBy" page has been changed by DhruvBansal.
The comment on this change is: Refined description of how Infochimps uses Hadoop.
http://wiki.apache.org/hadoop/PoweredBy?action=diff&rev1=195&rev2=196

--------------------------------------------------

    * Used Hadoop and 18 nodes/52 cores to [[http://www.isi.edu/ant/address/whole_internet/|plot
the entire internet]].
  
   * [[http://infochimps.org|Infochimps]]
-   * 30 nodes for processing big data such as Twitter data and MySpace data.
-   * EC2
+   * 30 node AWS EC2 cluster (varying instance size, currently EBS-backed) managed by Chef
& Poolparty running Hadoop 0.20.2+228, Pig 0.5.0+30, Azkaban 0.04, [[http://github.com/infochimps/wukong|Wukong]]
+   * Used for ETL & data analysis on terascale datasets, especially social network data
  
   * [[http://www.iterend.com/|Iterend]]
    * using 10 node hdfs cluster to store and process retrieved data.

Mime
View raw message