Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
From: "Kartashov, Andy" <Andy.Kartashov@mpac.ca>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: RE: a question on NameNode
Thread-Topic: a question on NameNode
Thread-Index: Ac3GYfL0z0JgG/x0TF+CAn6LqJHBDAAKnVGAAAo4jLD//7a4AIAAUuTw
Date: Mon, 19 Nov 2012 15:14:44 +0000
Message-ID: <BD42F346AE90F544A731516A805D1B8AD86ED3@SMAIL1.prd.mpac.ca>
References: <BD42F346AE90F544A731516A805D1B8AD86E38@SMAIL1.prd.mpac.ca>
 <106F2F9A-2A45-4B79-B392-4BBCCB2B04E5@123.org>
 <BD42F346AE90F544A731516A805D1B8AD86E72@SMAIL1.prd.mpac.ca>
 <5DD097F1-A97A-4C36-B8EF-2CB549EE32DB@123.org>
In-Reply-To: <5DD097F1-A97A-4C36-B8EF-2CB549EE32DB@123.org>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_BD42F346AE90F544A731516A805D1B8AD86ED3SMAIL1prdmpacca_"
MIME-Version: 1.0


--_000_BD42F346AE90F544A731516A805D1B8AD86ED3SMAIL1prdmpacca_
Content-Type: text/plain; charset="us-ascii"

Thank you Kai.. One more question please.

Does MapReduce run tasks of redundant blocks ?

Say you have only 1 block of data replicated 3 times, one block over each of three DNodes, block 1 - DN1 / block 1(replica #1) - DN2 / block1 (replica #2) - DN3

Will MR attempt:


a.       to start 3 Map tasks (one per replicated block) end execute them all

b.      to start 3 Map tasks (one per replicated block) end drop the other two as soon as one of the three executed successfully

c.       will start only 1 Map task (for just one block avoiding all replicated ones) and will attempt to start (another one of the replicated blocks) when and only when the initially task running (say on DN1)failed

Thanks,

From: Kai Voigt [mailto:k@123.org]
Sent: Monday, November 19, 2012 10:01 AM
To: user@hadoop.apache.org
Subject: Re: a question on NameNode


Am 19.11.2012 um 15:43 schrieb "Kartashov, Andy" <Andy.Kartashov@mpac.ca<mailto:Andy.Kartashov@mpac.ca>>:


So, what if DN2 is down, i.e. it is not sending any blocks' report.  Then NN (I guess) will figure out that it has 2 blocks (3,4) that has no home and that (without replication) it has no way of reconstructing the file A.txt. It must spit the error then.

One major feature of HDFS is its redundancy. Blocks are stored more than once (three times by default), so chances are good that another DataNode will have that block and report it during the safe mode phase. So the file will be accessible.

Kai

--
Kai Voigt
k@123.org<mailto:k@123.org>


NOTICE: This e-mail message and any attachments are confidential, subject to copyright and may be privileged. Any unauthorized use, copying or disclosure is prohibited. If you are not the intended recipient, please delete and contact the sender immediately. Please consider the environment before printing this e-mail. AVIS : le pr?sent courriel et toute pi?ce jointe qui l'accompagne sont confidentiels, prot?g?s par le droit d'auteur et peuvent ?tre couverts par le secret professionnel. Toute utilisation, copie ou divulgation non autoris?e est interdite. Si vous n'?tes pas le destinataire pr?vu de ce courriel, supprimez-le et contactez imm?diatement l'exp?diteur. Veuillez penser ? l'environnement avant d'imprimer le pr?sent courriel

--_000_BD42F346AE90F544A731516A805D1B8AD86ED3SMAIL1prdmpacca_
Content-Type: text/html; charset="us-ascii"

<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<base href=""><style>
<!--
@font-face
	{font-family:Helvetica}
@font-face
	{font-family:Helvetica}
@font-face
	{font-family:Calibri}
@font-face
	{font-family:Tahoma}
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman","serif"}
a:link, span.MsoHyperlink
	{color:blue;
	text-decoration:underline}
a:visited, span.MsoHyperlinkFollowed
	{color:purple;
	text-decoration:underline}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
	{margin-top:0in;
	margin-right:0in;
	margin-bottom:0in;
	margin-left:.5in;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman","serif"}
span.apple-style-span
	{}
span.EmailStyle18
	{font-family:"Calibri","sans-serif";
	color:#1F497D}
.MsoChpDefault
	{font-size:10.0pt}
@page WordSection1
	{margin:1.0in 1.0in 1.0in 1.0in}
div.WordSection1
	{}
ol
	{margin-bottom:0in}
ul
	{margin-bottom:0in}
-->
</style>
</head>
<body lang="EN-CA" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">Thank you Kai.. One more question please.</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">Does MapReduce run tasks of redundant blocks ?</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">Say you have only 1 block of data replicated 3 times, one block over each of three DNodes, block 1 &#8211; DN1 / block 1(replica #1) &#8211; DN2 / block1 (replica #2)
 &#8211; DN3</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">Will MR attempt:</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<p class="MsoListParagraph" style="text-indent:-.25in"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D"><span style="">a.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">to start 3 Map tasks (one per replicated block) end execute them all
</span></p>
<p class="MsoListParagraph" style="text-indent:-.25in"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D"><span style="">b.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">to start 3 Map tasks (one per replicated block) end drop the other two as soon as one of the three executed successfully
</span></p>
<p class="MsoListParagraph" style="text-indent:-.25in"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D"><span style="">c.<span style="font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">will start only 1 Map task (for just one block avoiding all replicated ones) and will attempt to start (another one of the replicated blocks) when and only
 when the initially task running (say on DN1)failed </span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">Thanks,</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">&nbsp;</span></p>
<div>
<div style="border:none; border-top:solid #B5C4DF 1.0pt; padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span lang="EN-US" style="font-size:10.0pt; font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;">From:</span></b><span lang="EN-US" style="font-size:10.0pt; font-family:&quot;Tahoma&quot;,&quot;sans-serif&quot;"> Kai Voigt [mailto:k@123.org]
<br>
<b>Sent:</b> Monday, November 19, 2012 10:01 AM<br>
<b>To:</b> user@hadoop.apache.org<br>
<b>Subject:</b> Re: a question on NameNode</span></p>
</div>
</div>
<p class="MsoNormal">&nbsp;</p>
<p class="MsoNormal">&nbsp;</p>
<div>
<div>
<p class="MsoNormal">Am 19.11.2012 um 15:43 schrieb &quot;Kartashov, Andy&quot; &lt;<a href="mailto:Andy.Kartashov@mpac.ca">Andy.Kartashov@mpac.ca</a>&gt;:</p>
</div>
<p class="MsoNormal"><br>
<br>
</p>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt; font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; color:#1F497D">So, what if DN2 is down, i.e. it is not sending any blocks&#8217; report.&nbsp; Then NN (I guess) will figure out that it has 2 blocks (3,4) that has no home and that
 (without replication) it has no way of reconstructing the file A.txt. It must spit the error then.</span></p>
</div>
</div>
</div>
<p class="MsoNormal">&nbsp;</p>
</div>
<div>
<p class="MsoNormal">One major feature of HDFS is its redundancy. Blocks are stored more than once (three times by default), so chances are good that another DataNode will have that block and report it during the safe mode phase. So the file will be accessible.</p>
</div>
<div>
<p class="MsoNormal">&nbsp;</p>
</div>
<div>
<p class="MsoNormal">Kai</p>
</div>
<p class="MsoNormal">&nbsp;</p>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:13.5pt; font-family:&quot;Helvetica&quot;,&quot;sans-serif&quot;; color:black">--&nbsp;</span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:13.5pt; font-family:&quot;Helvetica&quot;,&quot;sans-serif&quot;; color:black">Kai Voigt</span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:13.5pt; font-family:&quot;Helvetica&quot;,&quot;sans-serif&quot;; color:black"><a href="mailto:k@123.org">k@123.org</a></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:13.5pt; font-family:&quot;Helvetica&quot;,&quot;sans-serif&quot;; color:black">&nbsp;</span></p>
</div>
</div>
<p class="MsoNormal"><br>
<br>
</p>
</div>
<p class="MsoNormal">&nbsp;</p>
</div>
NOTICE: This e-mail message and any attachments are confidential, subject to copyright and may be privileged. Any unauthorized use, copying or disclosure is prohibited. If you are not the intended recipient, please delete and contact the sender immediately.
 Please consider the environment before printing this e-mail. AVIS : le pr&eacute;sent courriel et toute pi&egrave;ce jointe qui l'accompagne sont confidentiels, prot&eacute;g&eacute;s par le droit d'auteur et peuvent &ecirc;tre couverts par le secret professionnel. Toute utilisation, copie
 ou divulgation non autoris&eacute;e est interdite. Si vous n'&ecirc;tes pas le destinataire pr&eacute;vu de ce courriel, supprimez-le et contactez imm&eacute;diatement l'exp&eacute;diteur. Veuillez penser &agrave; l'environnement avant d'imprimer le pr&eacute;sent courriel
</body>
</html>

--_000_BD42F346AE90F544A731516A805D1B8AD86ED3SMAIL1prdmpacca_--