hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Leo Alekseyev <dnqu...@gmail.com>
Subject Best strategy for row updates
Date Mon, 01 Nov 2010 09:28:31 GMT
We are populating some HBase tables from daily data streams that are
stored in Hive.  When we see a row key that's already in the table,
the data should be appended to that row's record.  What is the best
way to achieve this?..  Should we be using the Java API?..  Rely on
HBase cell timestamping?..  Create compound keys (row_id+date) and
periodically run a separate MR job to coalesce all the data belonging
to the same row_id?..

Any pointers greatly appreciated!

--Leo

Mime
View raw message