hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-2316) webhdfs: a complete FileSystem implementation for accessing HDFS over HTTP
Date Fri, 09 Sep 2011 02:40:09 GMT

    [ https://issues.apache.org/jira/browse/HDFS-2316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100891#comment-13100891

Alejandro Abdelnur commented on HDFS-2316:

IMO, the nice thing about #2 is that the file path of *HDFS:* and a *HTTP:* URIs will be exactly
the same, and in the case of using the NN/DD deployment of HOOP it will be even the same host.

In addition is it intuitive without any caveat, a given path will just work by replacing the
SCHEME://HOST:PORT part of it. 

Finally, and IMO this is very important from the Usability perspective, user applications
that take are designed to take the URI of the FS as parameter and operate via HDFS: or HTTP:
will be otherwise difficult to code. Hadoop's *Path(String parent, String child)* uses the
*URI.resolve(...)* that uses a well defined logic to resolve URIs based on other URIs[ http://download.oracle.com/javase/6/docs/api/java/net/URI.html#resolve(java.net.URI)
]. If we use a prefix for HTTP URIs then it will become difficult and error prone to compose
HDFS: URIs from HTTP: URIs and viceversa. (And I believe the same is true for libraries in
other languages)

Finally, I have not seen HDFS files under */data* as a common practice, thus the name collision
won't be that common.

> webhdfs: a complete FileSystem implementation for accessing HDFS over HTTP
> --------------------------------------------------------------------------
>                 Key: HDFS-2316
>                 URL: https://issues.apache.org/jira/browse/HDFS-2316
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: Tsz Wo (Nicholas), SZE
>            Assignee: Tsz Wo (Nicholas), SZE
> We current have hftp for accessing HDFS over HTTP.  However, hftp is a read-only FileSystem
and does not provide "write" accesses.
> In HDFS-2284, we propose to have webhdfs for providing a complete FileSystem implementation
for accessing HDFS over HTTP.  The is the umbrella JIRA for the tasks.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message