hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hairong Kuang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-4717) Removal of default port# in NameNode.getUri() cause a map/reduce job failed to prompt temporay output
Date Thu, 04 Dec 2008 01:29:44 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hairong Kuang updated HADOOP-4717:
----------------------------------

    Attachment: relativePath.patch

In addition to Doug's change, this patch
1. throws IOException if relativitize fails as Koji suggested;
2. add a unit test to make sure a map/reduce job with output path containing no port works.

> Removal of default port# in NameNode.getUri() cause a map/reduce job failed to prompt
temporay output
> -----------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-4717
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4717
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.18.0
>            Reporter: Hairong Kuang
>            Assignee: Doug Cutting
>            Priority: Blocker
>             Fix For: 0.18.3
>
>         Attachments: HADOOP-4717.patch, relativePath.patch
>
>
> Problem reported here is that when the default port number (8020) is specified in the
output, job succeeds but no output is created. The cause of the problem is that "listStatus"
call drops the port number because NameNode.getUri removes the default port#.
> Assuming that a map/reduce output directory is set to be "hdfs://localhost:8020/out",
A call "listStatus" on any of its sub directory, for example, "hdfs://localhost:8020/out/tempXX",
returns results like below: 
> hdfs://localhost/out/tempXX/part-00005
> Because of this, Task.java
>     574   private Path getFinalPath(Path jobOutputDir, Path taskOutput) {
>     575     URI relativePath = taskOutputPath.toUri().relativize(taskOutput.toUri());
> does not get the correct relativePath because TaskOutputPath contain ports, but taskOutput
doesn't.
> It seems to me that the problem could be fixed if we make Path.makeQualified() to return
the same path not matter the input path contains the default port or not. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message