hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pablo Musa <pa...@psafe.com>
Subject HDFS Backup for Hadoop Update
Date Tue, 26 Feb 2013 22:39:14 GMT
Hello guys,
I am starting the update from hadoop 0.20 to a newer version which changes
HDFS format(2.0). I read a lot of tutorials and they say that data loss is
possible (as expected). In order to avoid HDFS data loss I am will probably
backup all HDFS structure (7TB per node). However, this is a huge amount
of data and it will take a lot of time in which my service would be 

I was thinking about a simple approach: copying all files to a different 
I tried to find some parallel files compactor to fasten the process, but 
not find it.

How do you guys did it?
Is there some trick?

Thank you in advance,
Pablo Musa

View raw message