hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Felix Sprick <fspr...@gmail.com>
Subject Scans on salted rowkeys
Date Wed, 11 May 2011 09:21:46 GMT
Hi guys,

I am using rowkeys with a pattern like [minute]_[timestamp] because my
main use case is to read time ranges over a couple of hours and I want
to read in parallel from as many nodes in the cluster as possible,
thus, distributing the data in minute buckets across the cluster.

Problem now is that I am not sure how to do sequential reads (for
example all records between 11:10 and 12:00) and for defining such
time frames as input to my MapReduce jobs.

Any ideas?


View raw message