hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Newman (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-9884) Hadoop calling du -sk is expensive
Date Mon, 19 Aug 2013 19:13:48 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-9884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alex Newman updated HADOOP-9884:

    Summary: Hadoop calling du -sk is expensive  (was: Hadoop calling du -sk can cause huge
load )
> Hadoop calling du -sk is expensive
> ----------------------------------
>                 Key: HADOOP-9884
>                 URL: https://issues.apache.org/jira/browse/HADOOP-9884
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Alex Newman
> On numerous occasions we've had customers worry about slowness while hadoop calls du
-sk underneath the hood. For most of these users getting the information from df would be
sufficient and much faster. In fact there is a hack going around, that is quiet common that
replaces df with du. Sometimes people have to tune the vcache. What if we just allowed users
to use the df information instead of the du information with a patch and config setting. I'd
be glad to code it up

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message