hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gautam Gopalakrishnan (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-8986) Add option to -du to calculate directory space usage excluding snapshots
Date Fri, 28 Aug 2015 03:42:45 GMT
Gautam Gopalakrishnan created HDFS-8986:
-------------------------------------------

             Summary: Add option to -du to calculate directory space usage excluding snapshots
                 Key: HDFS-8986
                 URL: https://issues.apache.org/jira/browse/HDFS-8986
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: snapshots
            Reporter: Gautam Gopalakrishnan


When running {{hadoop fs -du}} on a snapshotted directory (or one of its children), the report
includes space consumed by blocks that are only present in the snapshots. This is confusing
for end users.

{noformat}
$  hadoop fs -du -h -s /tmp/parent /tmp/parent/*
799.7 M  2.3 G  /tmp/parent
799.7 M  2.3 G  /tmp/parent/sub1

$ hdfs dfs -createSnapshot /tmp/parent snap1
Created snapshot /tmp/parent/.snapshot/snap1

$ hadoop fs -rm -skipTrash /tmp/parent/sub1/*
...

$ hadoop fs -du -h -s /tmp/parent /tmp/parent/*
799.7 M  2.3 G  /tmp/parent
799.7 M  2.3 G  /tmp/parent/sub1

$ hdfs dfs -deleteSnapshot /tmp/parent snap1
$ hadoop fs -du -h -s /tmp/parent /tmp/parent/*
0  0  /tmp/parent
0  0  /tmp/parent/sub1
{noformat}

It would be helpful if we had a flag, say -X, to exclude any snapshot related disk usage in
the output




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message