hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "dhruba borthakur (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3293) When an input split spans cross block boundary, the split location should be the host having most of bytes on it.
Date Fri, 31 Oct 2008 23:07:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12644460#action_12644460
] 

dhruba borthakur commented on HADOOP-3293:
------------------------------------------

> DFS would override this method and will return the rack information along with the hosts.

This is a good idea, but returning only rack location might not work in the general case when
there are more than 2 levels in the network topology. Knowing the name of a rack might not
tell you how close it is to another rack. But getFileBlockLocations could return the complete
path of the host in the network topology. I will provide a patch for this one. See HADOOP-4567

> When an input split spans cross block boundary, the split location should be the host
having most of bytes on it. 
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3293
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3293
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>            Assignee: Jothi Padmanabhan
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message