hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hbase/PoweredBy" by StevenNoels
Date Thu, 13 Dec 2012 07:26:02 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The "Hbase/PoweredBy" page has been changed by StevenNoels:

  [[http://www.kalooga.com|Kalooga]] is a discovery service for image galleries. We use Hadoop,
HBase and Pig on a 20-node cluster for our crawling, analysis and events processing.
- [[http://www.lilyproject.org|Lily]] is an open source content repository, backed by HBase
and SOLR from Outerthought - scalable content applications.
+ [[http://www.ngdata.com|NGDATA]] delivers [[http://www.lilyproject.org|Lily]], the consumer
intelligence solution that delivers a unique combination of  Big Data management, machine
learning technologies and consumer intelligence applications in one integrated solution to
allow better, and more dynamic, consumer insights. Lily allows companies to process and analyze
massive structured and unstructured data, scale storage elastically and locate actionable
data quickly from large data sources in near real time. 
  [[http://www.mahalo.com|Mahalo]], "...the world's first human-powered search engine". All
the markup that powers the wiki is stored in HBase. It's been in use for a few months now.
!MediaWiki - the same software that power Wikipedia - has version/revision control. Mahalo's
in-house editors produce a lot of revisions per day, which was not working well in a RDBMS.
An hbase-based solution for this was built and tested, and the data migrated out of MySQL
and into HBase. Right now it's at something like 6 million items in HBase. The upload tool
runs every hour from a shell script to back up that data, and on 6 nodes takes about 5-10
minutes to run - and does not slow down production at all.

View raw message