hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From James Kennedy <james.kenn...@troove.net>
Subject HBase HClient - Can we make it multi-table?
Date Mon, 25 Jun 2007 18:02:31 GMT
I have an app that needs to access multiple HBase tables concurrently.  
The current HClient can only have one table open at a time even though 
it caches region servers of multiple tables as they are looked up.

This means that my application layer must open multiple HClients, one 
per table, perhaps caching those HClients in a pool to reuse them (and 
their cached table data) as appropriate.


Shall I write an HClient patch that makes the HClient  multi-table 

Essentially this would mean changing the interface to add table name 
parameters to methods like put(), commit(), startUpdate() etc, maintain 
a list of opened tables (as opposed to the single tableServers field), 
and ensure that it is thread-safe.  I would also keep for each table a  
<row, lockid and currentServer and currentRegion> map for startUpdate() 
calls so that the the different rows of the same table could be updated 
concurrently.  This would also make it possible to  check for the rare 
case of clientid = rand.nextLong() collisions, if that's deemed necessary.

Alternatively, perhaps what we need is something like an 
HClientManager/Factory which manages table info but  gives out 
single-table HClients on request to be used for one-off transactions.  
This keeps the HClient simpler while keeping the cached regionInfo shared. 


View raw message