hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jameson Li <hovlj...@gmail.com>
Subject hadoop 0.20 hdfs archives
Date Mon, 22 Oct 2012 04:07:21 GMT

Our hadoop version is hadoop-0.20-append.
When using hadoop archives, I can see that the file "_index" and
"_masterindex" is set to "repl=10" and the har part file is set to
“repl=3”(see below the fsck show).
So I have a doubt that as the har part file only has repl 3, when the block
is missing, so the har part will be broken?
If I want add the har file's high availability, after the archive mapreduce
I have to manually set the full file's repl to a high number as 10?

The fsck info:
hadoop fsck /hadoop/archive -files -blocks -racks
/hadoop/archive/t.har <dir>
/hadoop/archive/t.har/_index 34433 bytes, 1 block(s):  OK
0. blk_8341000632547263306_70582655 len=34433 repl=10
[/default/G06-202/x.y.z.126:50010, /default/G06-202/x.y.z.127:50010,
/default/B04-203/x.y.z.99:50010, /default/D08-203/x.y.z.100:50010,
/default/F04-203/x.y.z.144:50010, /default/G11-202/x.y.z.86:50010,
/default/G07-202/x.y.z.129:50010, /default/B04-203/x.y.z.91:50010,
/default/B05-203/x.y.z.61:50010, /default/B05-203/x.y.z.63:50010]

/hadoop/archive/t.har/_masterindex 25 bytes, 1 block(s):  OK
0. blk_5449096664018130780_70582655 len=25 repl=10
[/default/G06-202/x.y.z.125:50010, /default/G06-202/x.y.z.119:50010,
/default/B04-203/x.y.z.99:50010, /default/A04-203/x.y.z.56:50010,
/default/B06-203/x.y.z.69:50010, /default/G9-202/x.y.z.74:50010,
/default/G10-202/x.y.z.78:50010, /default/G10-202/x.y.z.77:50010,
/default/B04-203/x.y.z.91:50010, /default/B05-203/x.y.z.62:50010]

/hadoop/archive/t.har/part-0 133722259 bytes, 1 block(s):  OK
0. blk_-9118198399046694176_70582469 len=133722259 repl=3
[/default/B04-203/x.y.z.93:50010, /default/B04-203/x.y.z.91:50010,

 Total size: 133756717 B
 Total dirs: 1
 Total files: 3
 Total blocks (validated): 3 (avg. block size 44585572 B)
 Minimally replicated blocks: 3 (100.0 %)
 Over-replicated blocks: 0 (0.0 %)
 Under-replicated blocks: 0 (0.0 %)
 Mis-replicated blocks: 0 (0.0 %)
 Default replication factor: 3
 Average block replication: 7.6666665
 Corrupt blocks: 0
 Missing replicas: 0 (0.0 %)
 Number of data-nodes: xx
 Number of racks: yy


View raw message