hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From prem yadav <ipremya...@gmail.com>
Subject Re: Loading data from S3
Date Wed, 08 Aug 2012 13:52:23 GMT
I have used the tool Hbackup from https://github.com/urbanairship/hbackup

I will look into S3distcp. The name suggests ot should be sufficient for me
to load the data.
However I have a more generic question. How do people who backup the Hbase
data tables to S3 test the restore.

My backup ran for about a day and there were a couple of exceptions in the
logs. How do I test the table? Do I need to recreate the hadoop/Hbase
cluster and test whether everything went well?

On Wed, Aug 8, 2012 at 6:54 PM, Dan Young <danoyoung@gmail.com> wrote:

> Have you looked into s3distcp ?
> Regards ,
> Dano
> On Aug 8, 2012 7:21 AM, "prem yadav" <ipremyadav@gmail.com> wrote:
>> Hi,
>> I recently used a backup tool to back up all my HDFS data to S3. The data
>> is on S3 in multiparts.
>> I need to test the restore now. Could you please give me some pointers on
>> how to test this.
>> 1) Do I need to create another cluster? The data is around 3 TB in size.
>> 2) How do I upload multipart data from S3 to HDFS cluster?
>> regards,
>> Prem

View raw message