Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of sluangsay@pragsis.com
 designates 80.26.83.175 as permitted sender)
From: "Sourygna Luangsay" <sluangsay@pragsis.com>
To: <user@hadoop.apache.org>
Subject: is HDFS RAID "data locality" efficient?
Date: Wed, 8 Aug 2012 18:46:03 +0200
Message-ID: <00bc01cd7585$4c585550$e508fff0$@com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_00BD_01CD7596.0FE12550"
Thread-Index: Ac11hUnlfD9NmqAkTJK9aGrOp3YEwg==
Content-Language: es

This is a multipart message in MIME format.

------=_NextPart_000_00BD_01CD7596.0FE12550
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Hi folks!

=20

I have just read about the HDFS RAID feature that was added to Hadoop =
0.21
or 0.22. and I am quite curious to know if people use it, what kind of =
use
they have and what they think about Map/Reduce data locality.

=20

First big actor of this technology is Facebook, that claims to save many =
PB
with it (see http://www.slideshare.net/ydn/hdfs-raid-facebook
<http://www.slideshare.net/ydn/hdfs-raid-facebook%20slides%204%20and%205>=

slides 4 and 5).

=20

I understand the following advantages with HDFS RAID:

-          You can save space

-          System tolerates more missing blocks

=20

Nonetheless, one of the drawback I see is M/R data locality.

As far as I understand, the advantage of having 3 replicas of each =
blocks is
not only security if one server fails or a block is corrupted,
but also the possibility to have as far as 3 tasktrackers executing the =
map
task with =93local data=94.

If you consider the 4th slide of the Facebook presentation, such
infrastructure decreases this possibility to only 1 tasktracker.

That means that if this tasktracker is very busy executing other tasks, =
you
have the following choice:

-          Waiting this tasktracker to finish executing (part of) the
current tasks (freeing map slots for instance)

-          Executing the map task for this block in another tasktracker,
transferring the information of the block through the network

In both cases, you=B4ll get a M/R penalty (please, tell me if I am =
wrong).

=20

Has somebody considered such penalty or has some benchmarks to share =
with
us?

=20

One of the scenario I can think in order to take advantage of HDFS RAID
without suffering this penalty is:

-          Using normal HDFS with default replication=3D3 for my =
=93fresh data=94

-          Using HDFS RAID for my historical data (that is barely used =
by
M/R)

=20

And you, what are you using HDFS RAID for?

=20

Regards,

=20

Sourygna Luangsay


------=_NextPart_000_00BD_01CD7596.0FE12550
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Diso-8859-1">
<meta name=3DGenerator content=3D"Microsoft Word 12 (filtered medium)">
<style>
<!--
 /* Font Definitions */
 @font-face
	{font-family:Wingdings;
	panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
	{mso-style-priority:34;
	margin-top:0cm;
	margin-right:0cm;
	margin-bottom:0cm;
	margin-left:36.0pt;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;}
@page Section1
	{size:612.0pt 792.0pt;
	margin:70.85pt 3.0cm 70.85pt 3.0cm;}
div.Section1
	{page:Section1;}
 /* List Definitions */
 @list l0
	{mso-list-id:2108233384;
	mso-list-type:hybrid;
	mso-list-template-ids:-617822590 -2100530108 201981955 201981957 =
201981953 201981955 201981957 201981953 201981955 201981957;}
@list l0:level1
	{mso-level-number-format:bullet;
	mso-level-text:-;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-18.0pt;
	font-family:"Calibri","sans-serif";
	mso-fareast-font-family:Calibri;}
ol
	{margin-bottom:0cm;}
ul
	{margin-bottom:0cm;}
-->
</style>
<!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext=3D"edit">
  <o:idmap v:ext=3D"edit" data=3D"1" />
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=3DES link=3Dblue vlink=3Dpurple>

<div class=3DSection1>

<p class=3DMsoNormal>Hi folks!<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><span lang=3DEN-US>I have just read about the HDFS =
RAID
feature that was added to Hadoop 0.21 or 0.22. and I am quite curious to =
know
if people use it, what kind of use<br>
they have and what they think about Map/Reduce data =
locality.<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>First big actor of this =
technology is
Facebook, that claims to save many PB with it (see <a
href=3D"http://www.slideshare.net/ydn/hdfs-raid-facebook%20slides%204%20a=
nd%205">http://www.slideshare.net/ydn/hdfs-raid-facebook
slides 4 and 5</a>).<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>I understand the following =
advantages with
HDFS RAID:<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>You can save =
space<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>System tolerates more =
missing
blocks<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>Nonetheless, one of the drawback =
I see is
M/R data locality.<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>As far as I understand, the =
advantage of
having 3 replicas of each blocks is not only security if one server =
fails or a
block is corrupted,<br>
but also the possibility to have as far as 3 tasktrackers executing the =
map
task with &#8220;local data&#8221;.<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>If you consider the =
4<sup>th</sup> slide of
the Facebook presentation, such infrastructure decreases this =
possibility to
only 1 tasktracker.<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>That means that if this =
tasktracker is very
busy executing other tasks, you have the following =
choice:<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>Waiting this =
tasktracker to
finish executing (part of) the current tasks (freeing map slots for =
instance)<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>Executing the map task =
for this
block in another tasktracker, transferring the information of the block =
through
the network<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>In both cases, you=B4ll get a =
M/R penalty
(please, tell me if I am wrong).<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>Has somebody considered such =
penalty or has
some benchmarks to share with us?<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>One of the scenario I can think =
in order to
take advantage of HDFS RAID without suffering this penalty =
is:<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>Using normal HDFS with =
default
replication=3D3 for my &#8220;fresh data&#8221;<o:p></o:p></span></p>

<p class=3DMsoListParagraph style=3D'text-indent:-18.0pt;mso-list:l0 =
level1 lfo1'><![if !supportLists]><span
lang=3DEN-US><span style=3D'mso-list:Ignore'>-<span style=3D'font:7.0pt =
"Times New =
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span></span><![endif]><span lang=3DEN-US>Using HDFS RAID for my
historical data (that is barely used by M/R)<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>And you, what are you using HDFS =
RAID for?<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>Regards,<o:p></o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span lang=3DEN-US>Sourygna =
Luangsay<o:p></o:p></span></p>

</div>

</body>

</html>

------=_NextPart_000_00BD_01CD7596.0FE12550--