hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "PoweredBy" by JeanBaptisteNote
Date Sat, 02 Aug 2014 12:52:52 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "PoweredBy" page has been changed by JeanBaptisteNote:
https://wiki.apache.org/hadoop/PoweredBy?action=diff&rev1=427&rev2=428

Comment:
Add Criteo's cluster to the lot

  
   * ''[[http://www.weblab.infosci.cornell.edu/|Cornell University Web Lab]] ''
    * ''Generating web graphs on 100 nodes (dual 2.4GHz Xeon Processor, 2 GB RAM, 72GB Hard
Drive) ''
+ 
+  * ''[[http://criteo.com|Criteo]] - Criteo is a global leader in online performance advertising
''
+   * ''[[http://labs.criteo.com/blog|Criteo R&D]] uses Hadoop as a consolidated platform
for storage, analytics and back-end processing, including Machine Learning algorithms ''
+   * ''We currently have a dedicated cluster of 850 nodes, 30PB storage, 65TB RAM, 16000
cores running full steam 24/7, and growing by the day ''
+   * ''Each node has 24 HT cores, 96GB RAM, 42TB HDD ''
+   * ''Hardware and platform management is done through [[http://www.getchef.com/|Chef]],
we run YARN ''
+   * ''We run a mix of ad-hoc Hive queries for BI, [[http://www.cascading.org/|Cascading]]
jobs, raw mapreduce jobs, and streaming [[http://www.mono-project.com/|Mono]] jobs, as well
as some Pig ''
  
   * ''[[http://www.crs4.it|CRS4]] ''
    * ''Hadoop deployed dynamically on subsets of a 400-node cluster ''

Mime
View raw message