hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: Rowkey design
Date Mon, 30 Nov 2015 03:22:34 GMT
bq. duplicate data to two different tables, one with (salt-productId-timestamp)
and other with (salt-productId-place) keys

I suggest think twice about the above schema. It may become tricky keeping
data in the two tables in sync.
Meaning, when update to table1 succeeds but update to table2 fails, you
need to take additional action either retrying write to table2 or rolling
back update to table1.


On Sun, Nov 29, 2015 at 2:19 PM, Marko Dinic <hacker.marko@gmail.com> wrote:

> Hello, everyone!
> I'm new to HBase and I need help designing rowkeys for use case that looks
> like this:
> - Products are listed, where each product has a product id.
> - Each product has a timestamp.
> - Each product is created in certain place (e.g. city)
> - Each product is created by some unit (e.g. factory)
> I would like to be able to scan products from a certain time period on one
> hand, from a certain place, or from a certain unit.
> I read about salting to avoid hot-spotting and I understand that rows are
> sequential by rowkey. This will allow me to scan for a certain time period
> using with following rowkey:
> salt-productId-timestamp
> And I can specify the period using STARTROW, ENDROW.
> What confuses me is how to include place (and maybe unit) into key and be
> able to select products from certain place during certain time period?
> If I limit myself to be able to scan by one of the above (time range OR
> place) I have an idea to duplicate data to two different tables, one with
> (salt-productId-timestamp) and other with (salt-productId-place) keys. Is
> that recommend or not?
> So, how to construct my keys?
> I should emphasize that i need this data to be input to MAPREDUCE JOB.
> Any help is greatly appreciated.
> --
> Best regards,
> Marko

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message