hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-971) DFS Scalabilty: Improve name node performance by adding a hostname to datanodes map
Date Tue, 03 Apr 2007 23:44:32 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12486532

Hadoop QA commented on HADOOP-971:

+1, because http://issues.apache.org/jira/secure/attachment/12354797/Host2NodesMap.patch applied
and successfully tested against trunk revision http://svn.apache.org/repos/asf/lucene/hadoop/trunk/525290.
Results are at http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch

> DFS Scalabilty: Improve name node performance by adding a hostname to datanodes map
> -----------------------------------------------------------------------------------
>                 Key: HADOOP-971
>                 URL: https://issues.apache.org/jira/browse/HADOOP-971
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: dfs
>    Affects Versions: 0.12.2
>            Reporter: Hairong Kuang
>         Assigned To: Hairong Kuang
>             Fix For: 0.13.0
>         Attachments: Host2NodesMap.patch
> A name node currently maintains a datanode map which maps a storage id to a dataNodeDescriptor.
When the name node needs to get a datanodeDescriptor by its name (hostname:port#) or by its
host name, it has to iterate through the data node set, which is very inefficent. I'd like
to add an additional map that maps a host name to an array list of DatanodeDescriptors that
contains all the data nodes running on the host. Since most of time a node runs only one data
node, the intial size of the array list is set to be 1.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message