hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Olivier Renault <orena...@hortonworks.com>
Subject Re: Multi node maintenance for HDFS?
Date Thu, 16 Jun 2016 17:21:09 GMT
Hi Stephan,

It happens that I’ve been working on this during the last two days. It was much easier that
I was expecting.


Let me know if you’ve got any questions

Kind regards,

From: Stephan Hoermann <srhoermann@gmail.com>
Date: Thursday, 16 June 2016 at 04:29
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: Multi node maintenance for HDFS?


How do people do multi node maintenance for HDFS without data loss?

We want to apply the ideas of immutable infrastructure to how we manage our machines. We prebuild
an OS image with the configuration and roll it out to our nodes. When we have a patch we build
a new image and roll that out again. It takes us about 10 to 15 minutes to do that.

For our data nodes we want to keep the data on a separate partition/disks so that when we
rebuild we rejoin HDFS with the data don't start a replication storm.

Now in order to scale this and quickly roll out upgrades we can't really do a one node at
a time upgrade so we need to be able to take out a percentage of the nodes at a time. Ideally
we would like to do this while keeping the replication count of each block at 2 (so we can
still handle failure while we are doing an upgrade) and without starting a replication strategy.

Right now it doesn't look like that is really supported. Is anyone else doing multi node upgrades
and how do you solve these problems?

We are considering changing the replication strategy so that we divide all our nodes into
3 evenly sized buckets and at maintenance remove a subset from one bucket at a time. Does
anyone have experience with doing something similar?


View raw message