hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Busbey <bus...@cloudera.com>
Subject Re: What companies are using HBase to serve a customer-facing product?
Date Fri, 05 Dec 2014 22:01:16 GMT
Hi Jeremy!

We'll probably need more information to answer your questions.

In particular, what kind of read or write SLA are you looking to meet? At
what scale of concurrent users? What size of retrievals?

Normally, "customer facing application" means something in the human
interactive time scale, but how tight that bound needs to be varies widely
(e.g. 99% in < 1s, 99% in <500ms, 99% in < 5ms, 99% in < 1ms).

The combination of your latency needs and the expected concurrent workload
will probably end up driving your cluster needs more-so than the data set
size. (For reference, 60TB of raw data will probably fit in an HBase
cluster with only 1-8 worker nodes depending on hdd choice and compression.)

Your questions about # of deployments and supportability should then be
driven by the needed cluster size rather than data set size.

On Fri, Dec 5, 2014 at 3:37 PM, jeremy p <athomewithagroovebox@gmail.com>
wrote:

> Hey all,
>
> So, I'm currently evaluating HBase as a solution for querying a very large
> data set (think 60+ TB). We'd like to use it to directly power a
> customer-facing product. My question is threefold :
>
> 1) What companies use HBase to serve a customer-facing product? I'm not
> interested in evaluations, experiments, or POC.  I'm also not interested in
> offline BI or analytics.  I'm specifically interested in cases where HBase
> serves as the data store for a customer-facing product.
>
> 2) Of the companies that use HBase to serve a customer-facing product,
> which ones use it to query data sets of 60TB or more?
>
> 3) Of companies use HBase to query 60+ TB data sets and serve a
> customer-facing product, how many employees are required to support their
> HBase installation?  In other words, if I were to start a team tomorrow,
> and their purpose was to maintain a 60+ TB HBase installation for a
> customer-facing product, how many people should I hire?
>
> 4) Of companies use HBase to query 60+ TB data sets and serve a
> customer-facing product, what kind of measures do they take for disaster
> recovery?
>
> If you can, please point me to articles, videos, and other materials.
> Obviously, the larger the company, the better case it will make for HBase.
>
> Thank you!
>



-- 
Sean

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message