hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raghu Angadi (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-1459) FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks 'data-locality' in map-reduce
Date Sat, 09 Jun 2007 18:29:26 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12503107
] 

Raghu Angadi commented on HADOOP-1459:
--------------------------------------

bq. Note that this increases serialization cost for any DatanodeInfo tranfer, which is pretty
much most RPC. This will needs a protocol version change since this won't work with prev clients/datanodes.

Does this also affect fsimage version?

> FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks
'data-locality' in map-reduce
> -------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-1459
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1459
>             Project: Hadoop
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.14.0
>            Reporter: Arun C Murthy
>            Assignee: dhruba borthakur
>            Priority: Blocker
>             Fix For: 0.14.0
>
>         Attachments: getHintsIpAddress.patch
>
>
> FileSystem.getFileCacheHints via DFSClient.getHints (post HADOOP-894?) returns IP address
of the datanodes instead of the hostnames which breaks mapping from task-tracker to datanodes
in map-reduce i.e. the system cannot intelligently place maps on datanodes where blocks are
present.
> I have verified that this affects trunk only, branch-0.13.0 seems ok.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message