hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wei Tan <w...@us.ibm.com>
Subject RE: HBase MapReduce - Using mutiple tables as source
Date Mon, 06 Aug 2012 14:22:01 GMT
A related question: may I have multiple tables as output, in a single Map 
I understand that this is achievable by running multiple MR jobs, each 
with a different output table specified in the reduce class. What I want 
is to scan a source table once and generate multiple tables at one time.

Best Regards,

Wei Tan 
Research Staff Member 
IBM T. J. Watson Research Center
19 Skyline Dr, Hawthorne, NY  10532
wtan@us.ibm.com; 914-784-6752

From:   "Amlan Roy" <amlan.roy@cleartrip.com>
To:     <user@hbase.apache.org>, 
Date:   08/06/2012 09:05 AM
Subject:        RE: HBase MapReduce - Using mutiple tables as source


If TableMapper and TableMapReduceUtil.initTableMapperJob() does not 
multiple tables as input, can I use Hadoop Mapper/Reducer classes and
specify the the input/output format myself?

What I want to do is, I want to read two tables in the map phase and want 
reduce them together. What is the best solution available in 0.92.0 (I
understand the best solution is coming in version 0.96.0).


-----Original Message-----
From: Ioakim Perros [mailto:imperros@gmail.com] 
Sent: Monday, August 06, 2012 5:11 PM
To: user@hbase.apache.org
Subject: Re: HBase MapReduce - Using mutiple tables as source


Isn't that the case that you can always initiate a scanner inside a map 
job (referring to another table from which had been set into the 
configuration of TableMapReduceUtil.initTableMapperJob(...) ) ?

Hope this serves as temporary solution.

On 08/06/2012 02:35 PM, Mohammad Tariq wrote:
> Hello Amlan,
>      Issue is still unresolved...Will get fixed in 0.96.0.
> Regards,
>      Mohammad Tariq
> On Mon, Aug 6, 2012 at 5:01 PM, Amlan Roy <amlan.roy@cleartrip.com> 
>> Hi,
>> While writing a MapReduce job for HBase, can I use multiple tables as
>> I think TableMapReduceUtil.initTableMapperJob() takes a single table as
>> parameter. For my requirement, I want to specify multiple tables and 
>> instances. I read about MultiTableInputCollection in the document
>> https://issues.apache.org/jira/browse/HBASE-3996. But I don't find it 
>> HBase-0.92.0.
>> Regards,
>> Amlan

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message