hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daryn Sharp (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-6732) Improve FsShell's heap consumption by switching to listStatus that returns an iterator
Date Tue, 16 Aug 2011 15:19:28 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-6732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Daryn Sharp updated HADOOP-6732:
--------------------------------

    Fix Version/s: 0.23.0

> Improve FsShell's heap consumption by switching to listStatus that returns an iterator
> --------------------------------------------------------------------------------------
>
>                 Key: HADOOP-6732
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6732
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Hairong Kuang
>            Assignee: Daryn Sharp
>             Fix For: 0.23.0
>
>
> When listing a large directory from the command line using the default heap configuration,
FsShell often runs out of memory. This is because all stats of the entries under the directory
need to be in memory before printing them. The new API listStatus that returns an iterator
of FileStatus, which implemented in HDFS-1091, no longer requires that all entries are fetched
first. Thus switching to this new API will greatly improve the use of heap space.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message