hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohammad Tariq <donta...@gmail.com>
Subject Re: Hadoop hardware failure recovery
Date Fri, 10 Aug 2012 18:43:35 GMT
Hello Aji,

   Hadoop's redundancy feature allows data to be replicated over the
entire cluster. So, even if entire disk is gone or even the entire
machine for that matter, your data is still there in other node(s).
But, we need to keep one thing in mind that the 'master' node is the
single point of failure in a Hadoop cluster. If the machine running
master process(es) is down, you are trapped. For more detail you can
visit the home page at : redundancy feature

    Mohammad Tariq

On Sat, Aug 11, 2012 at 12:08 AM, Aji Janis <aji1705@gmail.com> wrote:
> I am very new to Hadoop. I am considering setting up a Hadoop cluster
> consisting of 5 nodes where each node has 3 internal hard drives. I
> understand HDFS has a configurable redundancy feature but what happens if an
> entire drive crashes (physically) for whatever reason? How does Hadoop
> recover, if it can, from this situation? What else should I know before
> setting up my cluster this way? Thanks in advance.

View raw message