hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tsz Wo (Nicholas), SZE (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HDFS-786) Implement getContentSummary(..) in HftpFileSystem
Date Thu, 14 Jan 2010 23:46:54 GMT

     [ https://issues.apache.org/jira/browse/HDFS-786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tsz Wo (Nicholas), SZE updated HDFS-786:
----------------------------------------

    Attachment: h786_20100106_0.20.patch

h786_20100106_0.20.patch: for 0.20 (won't be committed).

> Implement getContentSummary(..) in HftpFileSystem
> -------------------------------------------------
>
>                 Key: HDFS-786
>                 URL: https://issues.apache.org/jira/browse/HDFS-786
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Tsz Wo (Nicholas), SZE
>            Assignee: Tsz Wo (Nicholas), SZE
>             Fix For: 0.22.0
>
>         Attachments: h786_20091223.patch, h786_20091224.patch, h786_20100104.patch, h786_20100106.patch,
h786_20100106_0.20.patch
>
>
> HftpFileSystem does not override getContentSummary(..).  As a result, it uses FileSystem's
default implementation, which computes content summary on the client side by calling listStatus(..)
recursively.  In contrast, DistributedFileSystem has overridden getContentSummary(..) and
does the computation on the NameNode.
> As a result, running "fs -dus" on hftp is much slower than running it on hdfs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message