hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daryn Sharp (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-10616) Improve performance of path handling
Date Tue, 12 Jul 2016 18:09:20 GMT
Daryn Sharp created HDFS-10616:

             Summary: Improve performance of path handling
                 Key: HDFS-10616
                 URL: https://issues.apache.org/jira/browse/HDFS-10616
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: hdfs
    Affects Versions: 2.0.0-alpha
            Reporter: Daryn Sharp
            Assignee: Daryn Sharp

Path handling in the namesystem and directory is very inefficient.  The path is repeatedly
resolved, decomposed into path components, recombined to a full path. parsed again, throughout
the system.  This is directly inefficient for general performance, and indirectly via unnecessary
pressure on young gen GC.

The namesystem should only operate on paths, parse it once into inodes, and the directory
should only operate on inodes.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org

View raw message