hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Hauck <michael.ha...@maxviva.com>
Subject Re: MapReduce newbie questions
Date Fri, 10 Jul 2009 09:10:57 GMT
Thanks for your help.

> First, I recommend upgrading to the latest HBase 0.19 release, 0.19.3.
Do i have to migrate/convert/import the data ?

> You have a few choices, but in short you want to use filters.
> http://hadoop.apache.org/hbase/docs/r0.19.3/api/org/apache/hadoop/hbase/filter/package-summary.html
> Specifically, you should look at the RegExpRowFilter:
> http://hadoop.apache.org/hbase/docs/r0.19.3/api/org/apache/hadoop/hbase/filter/RegExpRowFilter.html
Should be handy.

> You could set up the regular expression to only return stuff from the 
> month you want.  Inside the MR job you would know every row returned 
> would come from the month in question and would be able to look at the 
> key to determine the agency_id and day.
> There's an example in TIFB docs:
> http://hadoop.apache.org/hbase/docs/r0.19.3/api/org/apache/hadoop/hbase/mapred/TableInputFormatBase.html
The are the inputColumns in this example only the family names or do
they have to be fully qualified?

Michael Hauck <michael.hauck@maxviva.com>

View raw message