Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
From: "Joseph Naegele" <jnaegele@grierforensics.com>
To: <user@hadoop.apache.org>
Subject: HDFS behavior and dfsUsed file
Date: Sun, 13 Mar 2016 17:46:43 -0400
Message-ID: <03c701d17d71$d5d11180$81733480$@grierforensics.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_03C8_01D17D50.4EC0F820"
Thread-Index: AdF9cCE+LYR1Xg71S3+aNlQ9ixLXyA==
Content-Language: en-us

------=_NextPart_000_03C8_01D17D50.4EC0F820
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: 7bit

I believe I've encountered some curious HDFS behavior using Hadoop 2.7.1.
Unfortunately I'm in a situation where I need to manually migrate the
contents of two volumes used by HDFS to a new volume, on each node. After
doing so there are a few file conflicts coming from the two original
volumes, specifically the top-level VERSION file, scanner.cursor file, and
"dfsUsed" file. If the dfsUsed file is deleted, when restarting the cluster
the blocks on each DataNode are erased completely and a new dfsUsed file is
generated, this time showing that the volume is nearly empty.

 
I understand that the "dfsUsed" file is an important piece of metadata for
HDFS, but I would expect that if the file disappeared (a very rare corner
case, I admit), that HDFS could just regenerate it by verifying the blocks
on disk against what is expected by the NameNode. More importantly, I
wouldn't expect HDFS to actually delete valid blocks from disk just because
that one text file went missing. Immediately after starting the cluster I
ran "hdfs fsck /" and it reported that every block was missing and therefore
corrupt. Prior to running "start-dfs.sh" and the "fsck" I had successfully
copied ~32 TB, then immediately afterward all 32 TB of blocks across 10
nodes disappeared from each node's filesystem.

 
Is this expected behavior?

 
If I'm going to *manually* migrate blocks from two source volumes to a new
destination volume, is there a "safe" way to do it? (e.g. generate a new,
valid "dfsUsed" file by hand? What about the VERSION files, which contain
unique storageIDs?)

 
Should I ask about this on the developer list?

 
Thanks,

Joe Naegele


------=_NextPart_000_03C8_01D17D50.4EC0F820
Content-Type: text/html;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40"><head><META =
HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii"><meta name=3DGenerator content=3D"Microsoft Word 15 =
(filtered medium)"><style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:#0563C1;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:#954F72;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri",sans-serif;
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Calibri",sans-serif;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--></head><body lang=3DEN-US =
link=3D"#0563C1" vlink=3D"#954F72"><div class=3DWordSection1><p =
class=3DMsoNormal>I believe I've encountered some curious HDFS behavior =
using Hadoop 2.7.1. Unfortunately I'm in a situation where I need to =
manually migrate the contents of two volumes used by HDFS to a new =
volume, on each node. After doing so there are a few file conflicts =
coming from the two original volumes, specifically the top-level VERSION =
file, scanner.cursor file, and &quot;dfsUsed&quot; file. If the dfsUsed =
file is deleted, when restarting the cluster the blocks on each DataNode =
are erased completely and a new dfsUsed file is generated, this time =
showing that the volume is nearly empty.<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p class=3DMsoNormal>I understand =
that the &quot;dfsUsed&quot; file is an important piece of metadata for =
HDFS, but I would expect that if the file disappeared (a very rare =
corner case, I admit), that HDFS could just regenerate it by verifying =
the blocks on disk against what is expected by the NameNode. More =
importantly, I wouldn't expect HDFS to actually delete valid blocks from =
disk just because that one text file went missing. Immediately after =
starting the cluster I ran &quot;hdfs fsck /&quot; and it reported that =
every block was missing and therefore corrupt. Prior to running =
&quot;start-dfs.sh&quot; and the &quot;fsck&quot; I had successfully =
copied ~32 TB, then immediately afterward all 32 TB of blocks across 10 =
nodes disappeared from each node's filesystem.<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p class=3DMsoNormal>Is this =
expected behavior?<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p class=3DMsoNormal>If I'm going =
to *manually* migrate blocks from two source volumes to a new =
destination volume, is there a &quot;safe&quot; way to do it? (e.g. =
generate a new, valid &quot;dfsUsed&quot; file by hand? What about the =
VERSION files, which contain unique storageIDs?)<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p class=3DMsoNormal>Should I ask =
about this on the developer list?<o:p></o:p></p><p =
class=3DMsoNormal><o:p>&nbsp;</o:p></p><p =
class=3DMsoNormal>Thanks,<o:p></o:p></p><p class=3DMsoNormal>Joe =
Naegele<o:p></o:p></p></div></body></html>
------=_NextPart_000_03C8_01D17D50.4EC0F820--