hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Konstantin Shvachko (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-6857) FsShell should report raw disk usage including replication factor
Date Sat, 18 Oct 2014 00:18:34 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14175741#comment-14175741

Konstantin Shvachko commented on HADOOP-6857:

*Scenario 1* looks rather unusual, but again this is how quotas are counted: When you modify
a file in a snapshot, then it is counted twice for quotas. Even if it doesn't actually store
axtra blocks. If people want to change this, it should be a different jira related to qoutas.
I don't think anything needs to be fixed in this regard here.
*Scenario 2* indeed gives inconsistent results for _du_ and _dus_, because _du_  resolves
path to point to the current tree rather than to the snapshot contents. This doesn't look
right to me.

> FsShell should report raw disk usage including replication factor
> -----------------------------------------------------------------
>                 Key: HADOOP-6857
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6857
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Alex Kozlov
>            Assignee: Byron Wong
>         Attachments: HADOOP-6857.patch, show-space-consumed.txt
> Currently FsShell report HDFS usage with "hadoop fs -dus <path>" command.  Since
replication level is per file level, it would be nice to add raw disk usage including the
replication factor (maybe "hadoop fs -dus -raw <path>"?).  This will allow to assess
resource usage more accurately.  -- Alex K

This message was sent by Atlassian JIRA

View raw message