hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by JoydeepSensarma
Date Thu, 21 Feb 2008 17:14:34 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by JoydeepSensarma:
http://wiki.apache.org/hadoop/PoweredBy

The comment on the change is:
add entry for facebook

------------------------------------------------------------------------------
    * Our clusters vary from 1 to 100 nodes.
  
   * [http://www.cascading.org/ Cascading] - A Java library to assist in creating and managing
complex MapReduce routines
+  * [http://www.facebook.com/ Facebook] 
+   * We use Hadoop to store copies of internal log and dimension data sources and use it
as a source for reporting/analytics and machine learning. 
+   * Currently have around a hundred machines - low end commodity boxes with about 1.5TB
of storage each. Our data sets are currently are of the order of 10s of TB and we routine
process multiple TBs of data everyday.
+   * We are heavy users of both streaming as well as the Java apis. We have built a higher
level data warehousing framework using these features (that we will open source at some point).
We have also written a read-only FUSE implementation over hdfs.
   * [http://www.hadoop.co.kr/ Hadoop Korean User Group]
    * 50 node cluster In the Korea university network environment. 
     * Pentium 4 PC, HDFS 4TB Storage

Mime
View raw message