hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Mackrory (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-10797) Disk usage summary of snapshots causes renamed blocks to get counted twice
Date Thu, 25 Aug 2016 16:46:20 GMT
Sean Mackrory created HDFS-10797:
------------------------------------

             Summary: Disk usage summary of snapshots causes renamed blocks to get counted
twice
                 Key: HDFS-10797
                 URL: https://issues.apache.org/jira/browse/HDFS-10797
             Project: Hadoop HDFS
          Issue Type: Bug
            Reporter: Sean Mackrory


DirectoryWithSnapshotFeature.computeContentSummary4Snapshot calculates how much disk usage
is used by a snapshot by tallying up the files in the snapshot that have since been deleted
(that way it won't overlap with regular files whose disk usage is computed separately). However
that is determined from a diff that shows moved (to Trash or otherwise) or renamed files as
a deletion and a creation operation that may overlap with the list of blocks. Only the deletion
operation is taken into consideration, and this causes those blocks to get represented twice
in the disk usage tallying.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message