hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Meil <doug.m...@explorysmedical.com>
Subject Re: HBase for ad-hoc aggregate queries
Date Wed, 11 Jan 2012 19:05:38 GMT

re:  "My understanding is MapReduce is more of a batch framework,"


re:  "and because the MR jobs write back to a new table."

They can write to where-ever they need to write (HDFS, Hbase, etc.)

Probably want to check out the Hbase Book/RefGuide on the Architecture,
DataModel, and MapReduce chapters.


On 1/11/12 1:59 PM, "kfarmer" <kfarmer@camstar.com> wrote:

>I'm taking a look at moving our datastore from Oracle to HBase, and
>trying to
>understand how HBase could be used for ad-hoc aggregation queries across
>My understanding is MapReduce is more of a batch framework, so if we want
>query to come back to the user's request in a few seconds, that won't work
>because of the overheard of running MR and because the MR jobs write back
>a new table.  Is that correct?
>Instead should we be pre-aggregating data as we load into separate tables,
>and then when a user queries instead just do a scan on these
>View this message in context:
>Sent from the HBase User mailing list archive at Nabble.com.

View raw message