hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From AnandaVelMurugan Chandra Mohan <ananthu2...@gmail.com>
Subject Single table Vs Multiple table. Need inputs !!!
Date Wed, 13 Jun 2012 09:37:10 GMT

I am planning to use HBase DB for storing multiple reports of a component.
Each report has a unique set of metadata.

My query requirement is like this. I need to query and find out number of
reports of each component. Then I would have to filter those reports based
on some random metadata.

Initially I decided to use one table with multiple column families to
handle each report type. But HBase documentation instructs us to keep the
column families minimal. Also if a new report type gets added in future, I
will not be able to add new column family.

So I have two alternatives,

   - Have one column family for report metadata and fill the values in
   column based on report type. This will create sparse rows which I can
   - Create separate table for each report type and manage queries using

Any one have any better alternatives? Anyone see any performance
bottlenecks in these two alternatives. Please help me in deciding a durable
solutions. Thanks.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message