Hi Thanh,
The scan period is the period that hadoop *attempts* to complete an entire node scan. That
is, if it's set to 3 weeks, HDFS will try to scan each block once every 3 weeks.
Obviously, depending on the bandwidth you have made available to the scanning thread, you
can specify impossibly small periods.
Brian
On Oct 13, 2010, at 7:01 PM, Thanh Do wrote:
> Hi again,
>
> Could any body explain to me about the scanning period
> policy of DataBlockScanner? That is who often it wake up
> and scan a block file.
> When looking at the code, I found
>
> static final long DEFAULT_SCAN_PERIOD_HOURS = 21*24L; // three weeks
>
>
> but definitely it does not wake up and pick a random block
> to verify every three weeks, right?
>
> Thanks a lot,
> Thanh
|