hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xinwei Qin (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-8710) Always read DU value from the cached "dfsUsed" file on datanode startup
Date Thu, 02 Jul 2015 09:07:05 GMT
Xinwei Qin  created HDFS-8710:
---------------------------------

             Summary: Always read DU value from the cached "dfsUsed" file on datanode startup
                 Key: HDFS-8710
                 URL: https://issues.apache.org/jira/browse/HDFS-8710
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: Xinwei Qin 
            Assignee: Xinwei Qin 


Currently, DataNode will cache DU value in "dfsUsed" file termly. When DataNode starts or
restarts, it will read in the cached DU value from "dfsUsed" file if the value is less than
600 seconds old, otherwise, it will run DU command, which is a very time-consuming operation(may
up to dozens of minutes) when DataNode has huge number of blocks.

Since slight imprecision of dfsUsed is not critical, and the DU value will be updated every
600 seconds (the default DU interval) after DataNode started, we can always read DU value
from the cached file (Regardless of whether this value is less than 600 seconds old or not)
and skip DU operation on DataNode startup to significantly shorten the startup time.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message