accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Donald Miner <>
Subject Accumulo / HBase migration
Date Tue, 09 Jul 2013 17:26:38 GMT
Has anyone developed tools to migrate data from an existing HBase
implementation to Accumulo? My team has done it "manually" in the past but
it seems like it would be reasonable to write a process that handled the
steps in a more automated fashion.

Here are a few sample designs I've kicked around:

HBase -> mapreduce -> mappers bulk write to accumulo -> Accumulo
HBase -> mapreduce -> tfiles via AccumuloFileOutputFormat -> Accumulo bulk
load -> Accumulo
HBase -> bulk export -> map-only mapreduce to translate hfiles into tfiles
(how hard would this be??) -> Accumulo bulk load -> Accumulo

I guess this could be extended to go the other way around (and also include
Cassandra perhaps).

Maybe we'll start working on this soon. I just wanted to kick the idea out
there to see if it's been done before or if anyone has some gut reactions
to the process.


 This communication is the property of ClearEdge IT Solutions, LLC and may 
contain confidential and/or privileged information. Any review, 
retransmissions, dissemination or other use of or taking of any action in 
reliance upon this information by persons or entities other than the 
intended recipient is prohibited. If you receive this communication in 
error, please immediately notify the sender and destroy all copies of the 
communication and any attachments.

View raw message