Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of maheswara@huawei.com
 designates 119.145.14.65 as permitted sender)
From: Uma Maheswara Rao G <maheswara@huawei.com>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Subject: RE: How to do HADOOP RECOVERY ???
Thread-Topic: How to do HADOOP RECOVERY ???
Thread-Index: Ac21sCHn5nGTw0DRQEK4m7sPCjDgkAADld3DAAB/KgsAAfiBsQ==
Date: Mon, 29 Oct 2012 11:40:22 +0000
Message-ID: 
 <1542FA4EE20C5048A5C2A3663BED2A6B30B0915C@szxeml531-mbx.china.huawei.com>
References: 
 <2ADA1B0170E3434DA763D609DCD01EB54DF3D33A@BLR-EC-MBX7.wipro.com>,<1542FA4EE20C5048A5C2A3663BED2A6B30B09106@szxeml531-mbx.china.huawei.com>,<2ADA1B0170E3434DA763D609DCD01EB54DF3D3D6@BLR-EC-MBX7.wipro.com>
In-Reply-To: <2ADA1B0170E3434DA763D609DCD01EB54DF3D3D6@BLR-EC-MBX7.wipro.com>
Accept-Language: en-US, zh-CN
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_1542FA4EE20C5048A5C2A3663BED2A6B30B0915Cszxeml531mbxchi_"
MIME-Version: 1.0

--_000_1542FA4EE20C5048A5C2A3663BED2A6B30B0915Cszxeml531mbxchi_
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

I am not sure, I understood your scenario correctly here. Here is one possi=
bility for this situation with your explained case.


>>I have saved the dfs.name.dir seprately, and started with fresh cluster..=
.
  When you start fresh cluster, have you used same DNs? if so, blocks will =
be invalidated as your name space is fresh now(infact it can not register u=
ntill you clean the data dirs in DN as namespace id differs).

  Now, you are keeping the older image back and starting again. So, your ol=
der image will expect the enough blocks to be reported from DNs to start. O=
therwise it will be in safe mode. How it is coming out of safemode?


or if you continue with the same cluster  and additionally you saved the na=
mespace separately as a backup the current state, then added extra DN to th=
e cluster refering as fresh cluster?

 In this case, if you delete any existing files, data blocks will be invali=
dated in DN.

 After this if you go back to older cluster with the backedup namespace, th=
is deleted files infomation will not be known by by older image and it will=
 expect the blocks to be report and if not blocks available for a file then=
 that will be treated as corrupt.

>>I did -ls / operation and got this exception


>>mediaadmins-iMac-2:haadoop-0.20.2 mediaadmin$ HADOOP dfs -ls /user/hive/w=
arehouse/vw_cc/
>>Found 1 items

ls will show because namespace has this info for this file. But DNs does no=
t have any block related to it.

________________________________
From: yogesh.kumar13@wipro.com [yogesh.kumar13@wipro.com]
Sent: Monday, October 29, 2012 4:13 PM
To: user@hadoop.apache.org
Subject: RE: How to do HADOOP RECOVERY ???

Thanks Uma,

I am using hadoop-0.20.2 version.

UI shows.
Cluster Summary
379 files and directories, 270 blocks =3D 649 total. Heap Size is 81.06 MB =
/ 991.69 MB (8%)

WARNING : There are about 270 missing blocks. Please check the log or run f=
sck.

Configured Capacity     :       465.44 GB
DFS Used        :       20 KB
Non DFS Used    :       439.37 GB
DFS Remaining   :       26.07 GB
DFS Used%       :       0 %
DFS Remaining%  :       5.6 %
Live Nodes<http://localhost:50070/dfsnodelist.jsp?whatNodes=3DLIVE>       :=
       1
Dead Nodes<http://localhost:50070/dfsnodelist.jsp?whatNodes=3DDEAD>       :=
       0


Firstly I have configured single node cluster and worked over it, after tha=
t I have added another machine and made another one as a master + worker an=
d the fist machine as a worker only.

I have saved the dfs.name.dir seprately, and started with fresh cluster...

Now I have switched back to previous stage with single node with same old m=
achine having single node cluster.
I have given the path for dfs.name.dir where I have kept that.

Now I am running and getting this.

I did -ls / operation and got this exception


mediaadmins-iMac-2:haadoop-0.20.2 mediaadmin$ HADOOP dfs -ls /user/hive/war=
ehouse/vw_cc/
Found 1 items

-rw-r--r--   1 mediaadmin supergroup       1774 2012-10-17 16:15 /user/hive=
/warehouse/vw_cc/000000_0


mediaadmins-iMac-2:haadoop-0.20.2 mediaadmin$ HADOOP dfs -cat /user/hive/wa=
rehouse/vw_cc/000000_0


12/10/29 16:01:15 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0
12/10/29 16:01:15 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:  java.io.IOException: No live nodes contain=
 current block
12/10/29 16:01:18 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0
12/10/29 16:01:18 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:  java.io.IOException: No live nodes contain=
 current block
12/10/29 16:01:21 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0
12/10/29 16:01:21 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:  java.io.IOException: No live nodes contain=
 current block
12/10/29 16:01:24 WARN hdfs.DFSClient: DFS Read: java.io.IOException: Could=
 not obtain block: blk_-1280621588594166706_3595 file=3D/user/hive/warehous=
e/vw_cc/000000_0
    at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.chooseDataNode(DFSCl=
ient.java:1812)
    at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.blockSeekTo(DFSClien=
t.java:1638)
    at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.read(DFSClient.java:=
1767)
    at java.io.DataInputStream.read(DataInputStream.java:83)
    at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:47)
    at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:85)
    at org.apache.hadoop.fs.FsShell.printToStdout(FsShell.java:114)
    at org.apache.hadoop.fs.FsShell.access$100(FsShell.java:49)
    at org.apache.hadoop.fs.FsShell$1.process(FsShell.java:352)
    at org.apache.hadoop.fs.FsShell$DelayedExceptionThrowing.globAndProcess=
(FsShell.java:1898)
    at org.apache.hadoop.fs.FsShell.cat(FsShell.java:346)


I looked at NN Logs for one of the file..

it showing

2012-10-29 15:26:02,560 INFO org.apache.hadoop.hdfs.server.namenode.FSNames=
ystem.audit: ugi=3Dnull    ip=3Dnull    cmd=3Dopen    src=3D/user/hive/ware=
house/vw_cc/000000_0    dst=3Dnull    perm=3Dnull
.
.
.
.

Please suggest

Regards
Yogesh Kumar


________________________________
From: Uma Maheswara Rao G [maheswara@huawei.com]
Sent: Monday, October 29, 2012 3:52 PM
To: user@hadoop.apache.org
Subject: RE: How to do HADOOP RECOVERY ???


Which version of Hadoop are you using?


Do you have all DNs running? can you check UI report, wehther all DN are a =
live?

Can you check the DN disks are good or not?

Can you grep the NN and DN logs with one of the corrupt blockID from below?


Regards,

Uma

________________________________
From: yogesh.kumar13@wipro.com [yogesh.kumar13@wipro.com]
Sent: Monday, October 29, 2012 2:03 PM
To: user@hadoop.apache.org
Subject: How to do HADOOP RECOVERY ???

Hi All,

I run this command

hadoop fsck -Ddfs.http.address=3Dlocalhost:50070 /

and found that some blocks are missing and corrupted

results comes like..

/user/hive/warehouse/tt_report_htcount/000000_0: MISSING 2 blocks of total =
size 71826120 B..
/user/hive/warehouse/tt_report_perhour_hit/000000_0: CORRUPT block blk_7543=
8572351073797

/user/hive/warehouse/tt_report_perhour_hit/000000_0: MISSING 1 blocks of to=
tal size 1531 B..
/user/hive/warehouse/vw_cc/000000_0: CORRUPT block blk_-1280621588594166706

/user/hive/warehouse/vw_cc/000000_0: MISSING 1 blocks of total size 1774 B.=
.
/user/hive/warehouse/vw_report2/000000_0: CORRUPT block blk_863718613985497=
7656

/user/hive/warehouse/vw_report2/000000_0: CORRUPT block blk_401954159743863=
8886

/user/hive/warehouse/vw_report2/000000_0: MISSING 2 blocks of total size 71=
826120 B..
/user/zoo/foo.har/_index: CORRUPT block blk_3404803591387558276
.
.
.
.
.

Total size:    7600625746 B
 Total dirs:    205
 Total files:    173
 Total blocks (validated):    270 (avg. block size 28150465 B)
  ********************************
  CORRUPT FILES:    171
  MISSING BLOCKS:    269
  MISSING SIZE:        7600625742 B
  CORRUPT BLOCKS:     269
  ********************************
 Minimally replicated blocks:    1 (0.37037036 %)
 Over-replicated blocks:    0 (0.0 %)
 Under-replicated blocks:    0 (0.0 %)
 Mis-replicated blocks:        0 (0.0 %)
 Default replication factor:    1
 Average block replication:    0.0037037036
 Corrupt blocks:        269
 Missing replicas:        0 (0.0 %)
 Number of data-nodes:        1
 Number of racks:        1


Is there any way to recover them ?

Please help and suggest

Thanks & Regards
yogesh kumar

The information contained in this electronic message and any attachments to=
 this message are intended for the exclusive use of the addressee(s) and ma=
y contain proprietary, confidential or privileged information. If you are n=
ot the intended recipient, you should not disseminate, distribute or copy t=
his e-mail. Please notify the sender immediately and destroy all copies of =
this message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient shoul=
d check this email and any attachments for the presence of viruses. The com=
pany accepts no liability for any damage caused by any virus transmitted by=
 this email.

www.wipro.com

The information contained in this electronic message and any attachments to=
 this message are intended for the exclusive use of the addressee(s) and ma=
y contain proprietary, confidential or privileged information. If you are n=
ot the intended recipient, you should not disseminate, distribute or copy t=
his e-mail. Please notify the sender immediately and destroy all copies of =
this message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient shoul=
d check this email and any attachments for the presence of viruses. The com=
pany accepts no liability for any damage caused by any virus transmitted by=
 this email.

www.wipro.com

--_000_1542FA4EE20C5048A5C2A3663BED2A6B30B0915Cszxeml531mbxchi_
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<html dir=3D"ltr">
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Diso-8859-=
1">
<style id=3D"owaParaStyle" type=3D"text/css">P {
	MARGIN-TOP: 0px; MARGIN-BOTTOM: 0px
}
P {
	MARGIN-TOP: 0px; MARGIN-BOTTOM: 0px
}
</style>
</head>
<body fPStyle=3D"1" ocsi=3D"0">
<div style=3D"direction: ltr;font-family: Tahoma;color: #000000;font-size: =
10pt;">
<p>I am not sure, I understood your scenario correctly here. Here is one po=
ssibility for this situation with your explained case.</p>
<p>&nbsp;</p>
<p>&gt;&gt;<font color=3D"#003366">I have saved the dfs.name.dir seprately,=
 and started with fresh cluster...</font><br>
&nbsp; When you start fresh cluster, have you used same DNs? if so, blocks =
will be invalidated as your name space is fresh now(infact it can not regis=
ter untill you clean the data dirs in DN as namespace id differs).</p>
<p>&nbsp; Now, you are keeping the older image back and starting again. So,=
 your older image will expect the enough blocks to be reported from DNs to =
start. Otherwise it will be in safe mode. How it is coming out of safemode?=
</p>
<p>&nbsp;</p>
<p>or if you continue with the same cluster&nbsp; and additionally you save=
d the namespace separately as a backup the current state, then added extra =
DN to the cluster refering as fresh cluster?</p>
<p>&nbsp;In this case, if you delete any existing files, data blocks will b=
e invalidated in DN.</p>
<p>&nbsp;After this if you go back to older cluster with the backedup names=
pace, this deleted files infomation will not be known by by older image and=
 it will expect the blocks to be report and if not blocks available for a f=
ile then that will be treated as corrupt.</p>
<p>&gt;&gt;<font color=3D"#003366">I did -ls / operation and got this excep=
tion<br>
<br>
<br style=3D"COLOR: rgb(0,51,102)">
</font><span style=3D"FONT-WEIGHT: bold">&gt;&gt;mediaadmins-iMac-2:haadoop=
-0.20.2 mediaadmin$ HADOOP dfs -ls /user/hive/warehouse/vw_cc/</span><br st=
yle=3D"FONT-WEIGHT: bold">
<span style=3D"FONT-WEIGHT: bold">&gt;&gt;Found 1 items</span><br>
</p>
<p><strong>ls</strong> will show because namespace has this info for this f=
ile. But DNs does not have any block related to it.<br>
</p>
<div style=3D"FONT-FAMILY: Times New Roman; COLOR: #000000; FONT-SIZE: 16px=
">
<hr tabindex=3D"-1">
<div style=3D"DIRECTION: ltr" id=3D"divRpF437319"><font color=3D"#000000" s=
ize=3D"2" face=3D"Tahoma"><b>From:</b> yogesh.kumar13@wipro.com [yogesh.kum=
ar13@wipro.com]<br>
<b>Sent:</b> Monday, October 29, 2012 4:13 PM<br>
<b>To:</b> user@hadoop.apache.org<br>
<b>Subject:</b> RE: How to do HADOOP RECOVERY ???<br>
</font><br>
</div>
<div></div>
<div>
<div style=3D"FONT-FAMILY: Tahoma; DIRECTION: ltr; COLOR: #000000; FONT-SIZ=
E: 10pt">
<span style=3D"COLOR: rgb(0,51,102)">Thanks Uma,</span><br style=3D"COLOR: =
rgb(0,51,102)">
<br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">I am using hadoop-0.20.2 version.</spa=
n><br style=3D"COLOR: rgb(0,51,102)">
<br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">UI shows.</span><br>
<h3>Cluster Summary</h3>
<b></b><b>379 files and directories, 270 blocks =3D 649 total. Heap Size is=
 81.06 MB / 991.69 MB (8%)
<br>
</b><a class=3D"warning"><br>
WARNING : There are about 270 missing blocks. Please check the log or run f=
sck. <br>
<br>
</a>
<table>
<tbody>
<tr class=3D"rowNormal">
<td id=3D"col1">Configured Capacity</td>
<td id=3D"col2">:</td>
<td id=3D"col3">465.44 GB</td>
</tr>
<tr class=3D"rowAlt">
<td id=3D"col1">DFS Used</td>
<td id=3D"col2">:</td>
<td id=3D"col3">20 KB</td>
</tr>
<tr class=3D"rowNormal">
<td id=3D"col1">Non DFS Used</td>
<td id=3D"col2">:</td>
<td id=3D"col3">439.37 GB</td>
</tr>
<tr class=3D"rowAlt">
<td id=3D"col1">DFS Remaining</td>
<td id=3D"col2">:</td>
<td id=3D"col3">26.07 GB</td>
</tr>
<tr class=3D"rowNormal">
<td id=3D"col1">DFS Used%</td>
<td id=3D"col2">:</td>
<td id=3D"col3">0 %</td>
</tr>
<tr class=3D"rowAlt">
<td id=3D"col1">DFS Remaining%</td>
<td id=3D"col2">:</td>
<td id=3D"col3">5.6 %</td>
</tr>
<tr class=3D"rowNormal">
<td id=3D"col1"><a href=3D"http://localhost:50070/dfsnodelist.jsp?whatNodes=
=3DLIVE" target=3D"_blank">Live Nodes</a>
</td>
<td id=3D"col2">:</td>
<td id=3D"col3">1</td>
</tr>
<tr class=3D"rowAlt">
<td id=3D"col1"><a href=3D"http://localhost:50070/dfsnodelist.jsp?whatNodes=
=3DDEAD" target=3D"_blank">Dead Nodes</a>
</td>
<td id=3D"col2">:</td>
<td id=3D"col3">0</td>
</tr>
</tbody>
</table>
<br>
<br>
<span style=3D"COLOR: rgb(0,51,102)">Firstly I have configured single node =
cluster and worked over it, after that I have added another machine and mad=
e another one as a master &#43; worker and the fist machine as a worker onl=
y.</span><br style=3D"COLOR: rgb(0,51,102)">
<br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">I have saved the dfs.name.dir sepratel=
y, and started with fresh cluster...</span><br style=3D"COLOR: rgb(0,51,102=
)">
<br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">Now I have switched back to previous s=
tage with single node with same old machine having single node cluster.</sp=
an><br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">I have given the path for dfs.name.dir=
 where I have kept that.</span><br style=3D"COLOR: rgb(0,51,102)">
<br style=3D"COLOR: rgb(0,51,102)">
<span style=3D"COLOR: rgb(0,51,102)">Now I am running and getting this.<br>
<br>
I did -ls / operation and got this exception<br>
<br>
<br style=3D"COLOR: rgb(0,51,102)">
</span><span style=3D"FONT-WEIGHT: bold">mediaadmins-iMac-2:haadoop-0.20.2 =
mediaadmin$ HADOOP dfs -ls /user/hive/warehouse/vw_cc/</span><br style=3D"F=
ONT-WEIGHT: bold">
<span style=3D"FONT-WEIGHT: bold">Found 1 items</span><br>
<br>
-rw-r--r--&nbsp;&nbsp; 1 mediaadmin supergroup&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp; 1774 2012-10-17 16:15 /user/hive/warehouse/vw_cc/000000_0<br>
<br>
<br>
<span style=3D"FONT-WEIGHT: bold">mediaadmins-iMac-2:haadoop-0.20.2 mediaad=
min$ HADOOP dfs -cat /user/hive/warehouse/vw_cc/000000_0</span><br>
<br>
<br>
12/10/29 16:01:15 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0<br>
12/10/29 16:01:15 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:&nbsp; java.io.IOException: No live nodes co=
ntain current block<br>
12/10/29 16:01:18 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0<br>
12/10/29 16:01:18 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:&nbsp; java.io.IOException: No live nodes co=
ntain current block<br>
12/10/29 16:01:21 INFO hdfs.DFSClient: No node available for block: blk_-12=
80621588594166706_3595 file=3D/user/hive/warehouse/vw_cc/000000_0<br>
12/10/29 16:01:21 INFO hdfs.DFSClient: Could not obtain block blk_-12806215=
88594166706_3595 from any node:&nbsp; java.io.IOException: No live nodes co=
ntain current block<br>
12/10/29 16:01:24 WARN hdfs.DFSClient: DFS Read: java.io.IOException: Could=
 not obtain block: blk_-1280621588594166706_3595 file=3D/user/hive/warehous=
e/vw_cc/000000_0<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.choos=
eDataNode(DFSClient.java:1812)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.block=
SeekTo(DFSClient.java:1638)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.read(=
DFSClient.java:1767)<br>
&nbsp;&nbsp;&nbsp; at java.io.DataInputStream.read(DataInputStream.java:83)=
<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:4=
7)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.io.IOUtils.copyBytes(IOUtils.java:8=
5)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.fs.FsShell.printToStdout(FsShell.ja=
va:114)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.fs.FsShell.access$100(FsShell.java:=
49)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.fs.FsShell$1.process(FsShell.java:3=
52)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.fs.FsShell$DelayedExceptionThrowing=
.globAndProcess(FsShell.java:1898)<br>
&nbsp;&nbsp;&nbsp; at org.apache.hadoop.fs.FsShell.cat(FsShell.java:346)<br=
>
<br>
<br>
<span style=3D"FONT-WEIGHT: bold">I looked at NN Logs for one of the file..=
</span><br>
<br>
<span style=3D"FONT-WEIGHT: bold">it showing</span><br>
<br>
2012-10-29 15:26:02,560 INFO org.apache.hadoop.hdfs.server.namenode.FSNames=
ystem.audit: ugi=3Dnull&nbsp;&nbsp;&nbsp; ip=3Dnull&nbsp;&nbsp;&nbsp; cmd=
=3Dopen&nbsp;&nbsp;&nbsp; src=3D/user/hive/warehouse/vw_cc/000000_0&nbsp;&n=
bsp;&nbsp; dst=3Dnull&nbsp;&nbsp;&nbsp; perm=3Dnull<br>
.<br>
.<br>
.<br>
.<br>
<br>
Please suggest<br>
<br>
Regards <br>
Yogesh Kumar<br>
<br style=3D"COLOR: rgb(0,51,102)">
<br>
<br>
<div style=3D"FONT-FAMILY: Times New Roman; COLOR: rgb(0,0,0); FONT-SIZE: 1=
6px">
<hr tabindex=3D"-1">
<div style=3D"DIRECTION: ltr" id=3D"divRpF56971"><font color=3D"#000000" si=
ze=3D"2" face=3D"Tahoma"><b>From:</b> Uma Maheswara Rao G [maheswara@huawei=
.com]<br>
<b>Sent:</b> Monday, October 29, 2012 3:52 PM<br>
<b>To:</b> user@hadoop.apache.org<br>
<b>Subject:</b> RE: How to do HADOOP RECOVERY ???<br>
</font><br>
</div>
<div></div>
<div>
<div style=3D"FONT-FAMILY: Tahoma; DIRECTION: ltr; COLOR: rgb(0,0,0); FONT-=
SIZE: 10pt">
<p>Which version of Hadoop are you using?</p>
<p>&nbsp;</p>
<p>Do you have all DNs running? can you check UI report, wehther all DN are=
 a live?</p>
<p>Can you check the DN disks are good or not?</p>
<p>Can you grep the NN and DN logs with one of the corrupt blockID from bel=
ow?</p>
<p>&nbsp;</p>
<p>Regards,</p>
<p>Uma</p>
<div style=3D"FONT-FAMILY: Times New Roman; COLOR: rgb(0,0,0); FONT-SIZE: 1=
6px">
<hr tabindex=3D"-1">
<div style=3D"DIRECTION: ltr" id=3D"divRpF953477"><font color=3D"#000000" s=
ize=3D"2" face=3D"Tahoma"><b>From:</b> yogesh.kumar13@wipro.com [yogesh.kum=
ar13@wipro.com]<br>
<b>Sent:</b> Monday, October 29, 2012 2:03 PM<br>
<b>To:</b> user@hadoop.apache.org<br>
<b>Subject:</b> How to do HADOOP RECOVERY ???<br>
</font><br>
</div>
<div></div>
<div>
<div style=3D"FONT-FAMILY: Tahoma; DIRECTION: ltr; COLOR: rgb(0,0,0); FONT-=
SIZE: 10pt">
Hi All, <br>
<br>
I run this command<br>
<br>
<span style=3D"FONT-WEIGHT: bold">hadoop fsck -Ddfs.http.address=3Dlocalhos=
t:50070 /</span><br>
<br>
and found that some blocks are missing and corrupted<br>
<br>
results comes like..<br>
<br>
/user/hive/warehouse/tt_report_htcount/000000_0: MISSING 2 blocks of total =
size 71826120 B..<br>
/user/hive/warehouse/tt_report_perhour_hit/000000_0: CORRUPT block blk_7543=
8572351073797<br>
<br>
/user/hive/warehouse/tt_report_perhour_hit/000000_0: MISSING 1 blocks of to=
tal size 1531 B..<br>
/user/hive/warehouse/vw_cc/000000_0: CORRUPT block blk_-1280621588594166706=
<br>
<br>
/user/hive/warehouse/vw_cc/000000_0: MISSING 1 blocks of total size 1774 B.=
.<br>
/user/hive/warehouse/vw_report2/000000_0: CORRUPT block blk_863718613985497=
7656<br>
<br>
/user/hive/warehouse/vw_report2/000000_0: CORRUPT block blk_401954159743863=
8886<br>
<br>
/user/hive/warehouse/vw_report2/000000_0: MISSING 2 blocks of total size 71=
826120 B..<br>
/user/zoo/foo.har/_index: CORRUPT block blk_3404803591387558276<br>
.<br>
.<br>
.<br>
.<br>
.<br>
<br>
Total size:&nbsp;&nbsp;&nbsp; 7600625746 B<br>
&nbsp;Total dirs:&nbsp;&nbsp;&nbsp; 205<br>
&nbsp;Total files:&nbsp;&nbsp;&nbsp; 173<br>
&nbsp;Total blocks (validated):&nbsp;&nbsp;&nbsp; 270 (avg. block size 2815=
0465 B)<br>
&nbsp; ********************************<br>
&nbsp; CORRUPT FILES:&nbsp;&nbsp;&nbsp; 171<br>
&nbsp; MISSING BLOCKS:&nbsp;&nbsp;&nbsp; 269<br>
&nbsp; MISSING SIZE:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 7600625742 B<br>
&nbsp; CORRUPT BLOCKS: &nbsp;&nbsp;&nbsp; 269<br>
&nbsp; ********************************<br>
&nbsp;Minimally replicated blocks:&nbsp;&nbsp;&nbsp; 1 (0.37037036 %)<br>
&nbsp;Over-replicated blocks:&nbsp;&nbsp;&nbsp; 0 (0.0 %)<br>
&nbsp;Under-replicated blocks:&nbsp;&nbsp;&nbsp; 0 (0.0 %)<br>
&nbsp;Mis-replicated blocks:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 0 (0.0 %)=
<br>
&nbsp;Default replication factor:&nbsp;&nbsp;&nbsp; 1<br>
&nbsp;Average block replication:&nbsp;&nbsp;&nbsp; 0.0037037036<br>
&nbsp;Corrupt blocks:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 269<br>
&nbsp;Missing replicas:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 0 (0.0 %)<br>
&nbsp;Number of data-nodes:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 1<br>
&nbsp;Number of racks:&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; 1<br>
<br>
<br>
<br>
<br>
<span style=3D"FONT-WEIGHT: bold">Is there any way to recover them ?</span>=
<br>
<br>
Please help and suggest<br>
<br>
Thanks &amp; Regards<br>
yogesh kumar<br>
</div>
<p>The information contained in this electronic message and any attachments=
 to this message are intended for the exclusive use of the addressee(s) and=
 may contain proprietary, confidential or privileged information. If you ar=
e not the intended recipient, you
 should not disseminate, distribute or copy this e-mail. Please notify the =
sender immediately and destroy all copies of this message and any attachmen=
ts.</p>
<p>WARNING: Computer viruses can be transmitted via email. The recipient sh=
ould check this email and any attachments for the presence of viruses. The =
company accepts no liability for any damage caused by any virus transmitted=
 by this email.</p>
<p>www.wipro.com</p>
</div>
</div>
</div>
</div>
</div>
</div>
<p>The information contained in this electronic message and any attachments=
 to this message are intended for the exclusive use of the addressee(s) and=
 may contain proprietary, confidential or privileged information. If you ar=
e not the intended recipient, you
 should not disseminate, distribute or copy this e-mail. Please notify the =
sender immediately and destroy all copies of this message and any attachmen=
ts.</p>
<p>WARNING: Computer viruses can be transmitted via email. The recipient sh=
ould check this email and any attachments for the presence of viruses. The =
company accepts no liability for any damage caused by any virus transmitted=
 by this email.</p>
<p>www.wipro.com</p>
</div>
</div>
</div>
</body>
</html>

--_000_1542FA4EE20C5048A5C2A3663BED2A6B30B0915Cszxeml531mbxchi_--