Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3FB1317389 for ; Sun, 5 Apr 2015 05:39:37 +0000 (UTC) Received: (qmail 71617 invoked by uid 500); 5 Apr 2015 05:39:30 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 71475 invoked by uid 500); 5 Apr 2015 05:39:30 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 71465 invoked by uid 99); 5 Apr 2015 05:39:29 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Apr 2015 05:39:29 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_REPLY,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of arthur.hk.chan@gmail.com designates 209.85.214.171 as permitted sender) Received: from [209.85.214.171] (HELO mail-ob0-f171.google.com) (209.85.214.171) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 05 Apr 2015 05:39:03 +0000 Received: by obvd1 with SMTP id d1so6351528obv.0 for ; Sat, 04 Apr 2015 22:39:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=c8xDNgNiQIQBm5ETqT20b7BG6HwkOohNz2TbFRD+GM0=; b=EfW8Bs5wa6H+Q4O/EGJJaKo20do+AnJmP+G0RQU9IUNtfbo5rx/LU07c9ctz/pdBSL auL/jyLwtauIKFoELbhEVjaNg0x/2VrmgM/1P3XjM+tsaKroihcijNI5p2nqzIjz7tqB LQWJDv0/u1zbT2JB4jJ6cGcv0JQ8Hg9IMryPe7dX59Cf/CN0UNwucoYvPZ6m4cE+ET1U YuN/810U/kVNcOn/8pYBviBdnstSXGzFO0HGWdHSPB6VoZ8DOWHLdice4gHrOoaFnwYv JpAxpcH8ONEfZ/RNb453jxtUkS5CV63iCJ195gZW6t+hkpInhCZrVmntR0ZY3LAuru60 UOyQ== MIME-Version: 1.0 X-Received: by 10.182.241.99 with SMTP id wh3mr11646376obc.81.1428212342016; Sat, 04 Apr 2015 22:39:02 -0700 (PDT) Received: by 10.202.173.10 with HTTP; Sat, 4 Apr 2015 22:39:01 -0700 (PDT) In-Reply-To: References: Date: Sun, 5 Apr 2015 13:39:01 +0800 Message-ID: Subject: Re: How will Hadoop handle it when a datanode server with total hardware failure? From: Arthur Chan To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a11c2ea344f4e750512f3999e X-Virus-Checked: Checked by ClamAV on apache.org --001a11c2ea344f4e750512f3999e Content-Type: text/plain; charset=UTF-8 Hi, I use the default replication factor 3 here, the cluster has 10 nodes, each of my datanode has 8 hard disks. If one of the nodes is down because of hardware failure, i.e. the 8 hard disks will no longer be available immediately during the down time of this machine, does it mean that I will have data lost? (8 hard disks > 3 replicated) Or what would be the maximum number of servers that are allowed to be down without data lost here? Regards Arthur On Wednesday, December 17, 2014, Harshit Mathur wrote: > Hi Arthur, > > In HDFS there will be block level replication, In case of total failure of > a datanode the lost blocks will get under replicated hence the namenode > will create copy of these under replicated blocks on some other datanode. > > BR, > Harshit > > On Wed, Dec 17, 2014 at 11:35 AM, Arthur.hk.chan@gmail.com > < > arthur.hk.chan@gmail.com > > wrote: >> >> Hi, >> >> If each of my datanode servers has 8 hard disks (a 10-node cluster) and >> I use the default replication factor of 3, how will Hadoop handle it when a >> datanode with total hardware failure suddenly? >> >> Regards >> Arthur >> > > > > -- > Harshit Mathur > --001a11c2ea344f4e750512f3999e Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi,=C2=A0

I use the default replication factor 3 here,= the cluster has 10 nodes, each of my datanode has 8 hard disks.=C2=A0 If o= ne of the nodes is down because of hardware failure, i.e. the 8 hard disks = will no longer be available immediately during the down time of this machin= e, does it mean that I will have data lost? (8 hard disks > =C2=A03 repl= icated)

Or what would be the maximum numb= er of servers that are allowed to be down without data lost here?=C2=A0

Regards
Arthur

On Wednesday, December 17, 2014, Harsh= it Mathur <mathursharp@gmail.co= m> wrote:
Hi= Arthur,

In HDFS there will be block level replication, I= n case of total failure of a datanode the lost blocks will get under replic= ated hence the namenode will create copy of these under replicated blocks o= n some other datanode.

BR,
Harshit

On Wed, De= c 17, 2014 at 11:35 AM, Arthur.hk.chan@gmail.co= m <arthur.hk.chan@gmai= l.com> wrote:
Hi,

If each of=C2=A0 my datanode servers has 8 hard disks (a 10-node cluster) a= nd I use the default replication factor of 3, how will Hadoop handle it whe= n a datanode with total hardware failure suddenly?

Regards
Arthur
=C2=A0


--
Ha= rshit Mathur
--001a11c2ea344f4e750512f3999e--