hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by KevinWeil
Date Thu, 18 Mar 2010 20:42:23 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "PoweredBy" page has been changed by KevinWeil.
http://wiki.apache.org/hadoop/PoweredBy?action=diff&rev1=184&rev2=185

--------------------------------------------------

    * 6 node cluster with 96 total cores, 8GB RAM and 2 TB storage per machine.
  
   * [[http://www.twitter.com|Twitter]]
-   * We use hadoop to store and process tweets, log files, and many other types of data generated
across Twitter.  All data is stored as compressed LZO files.
+   * We use Hadoop to store and process tweets, log files, and many other types of data generated
across Twitter.  We use Cloudera's CDH2 distribution of Hadoop, and store all data as compressed
LZO files.
    * We use both Scala and Java to access Hadoop's MapReduce APIs
    * We use Pig heavily for both scheduled and ad-hoc jobs, due to its ability to accomplish
a lot with few statements.
    * We employ committers on Pig, Avro, Hive, and Cassandra, and contribute much of our internal
Hadoop work to opensource (see [[http://github.com/kevinweil/hadoop-lzo|hadoop-lzo]])

Mime
View raw message