hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hive/PoweredBy" by JeffHammerbacher
Date Sat, 14 Nov 2009 01:49:14 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hive/PoweredBy" page has been changed by JeffHammerbacher.
http://wiki.apache.org/hadoop/Hive/PoweredBy?action=diff&rev1=13&rev2=14

--------------------------------------------------

  We use Hadoop to store copies of internal log and dimension data sources and use it as a
source for reporting/analytics and machine learning.
  Currently have a 640 machine cluster with ~5000 cores and 2PB raw storage. Each (commodity)
node has 8 cores and 4 TB of storage.
  
+ *  [[http://www.grooveshark.com|Grooveshark]]
+ We use Hive for user analytics, dataset cleaning, and machine learning R&D.
+ 
  *  [[http://www.hi5.com|hi5]]
  We use Hive for analytics, machine learning and social graph analysis.
  
@@ -30, +33 @@

  *  [[http://www.trendingtopics.org|Trending Topics]]
  Hot Wikipedia Topics, Served Fresh Daily.  Powered by Cloudera Hadoop Distribution &
Hive on EC2.  We use Hive for log data normalization and building sample datasets for trend
detection R&D.
  
- *  [[http://www.grooveshark.com|Grooveshark]]
- We use Hive for user analytics, dataset cleaning, and machine learning R&D.
- 

Mime
View raw message