Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9BD368D2A for ; Mon, 8 Aug 2011 04:34:11 +0000 (UTC) Received: (qmail 23198 invoked by uid 500); 8 Aug 2011 04:34:07 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 23013 invoked by uid 500); 8 Aug 2011 04:33:54 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 23003 invoked by uid 99); 8 Aug 2011 04:33:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2011 04:33:51 +0000 X-ASF-Spam-Status: No, hits=-2000.8 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Aug 2011 04:33:48 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 14A88B11F6 for ; Mon, 8 Aug 2011 04:33:27 +0000 (UTC) Date: Mon, 8 Aug 2011 04:33:27 +0000 (UTC) From: "Eli Collins (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1165802212.15565.1312778007081.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1967411323.15507.1312776927116.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-2235) Hftp doesn't support paths with semicolons MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-2235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13080738#comment-13080738 ] Eli Collins commented on HDFS-2235: ----------------------------------- The bug is that HttpServletRequest#getPathInfo in jetty (and other popular servlet implementations, see [1]) don't consider semicolons and the characters that follow before the query fragment as part of the path for some reason. Eg for "foo;bar?a=b" getPathInfo returns "foo" rather than "foo;bar". One workaround is to use getRequestURI which will return "foo;bar", but unlike getPathInfo the result is not decoded, therefore the parts of the path that need encoding (eg contain spaces) and get encoded when the URI object is created will not get decoded. URL decoding the value getRequestURI returns doesn't work however because it will decode paths that contain reserved URI characters (eg will decode "+" into a space) - the decoding needs to be the same path decoding that eg jetty would perform [2], ie doesn't decode reserved characters. A suggestion from Todd is to encode the path used when creating the URL. Eg URL encode each individual segment of the path when creating the "/data" URL, and then decode each segment when retrieving the path with getPathInfo. 1. http://cdivilly.wordpress.com/2011/04 2. http://jetty.codehaus.org/jetty/jetty-6/xref/org/mortbay/util/URIUtil.html > Hftp doesn't support paths with semicolons > ------------------------------------------ > > Key: HDFS-2235 > URL: https://issues.apache.org/jira/browse/HDFS-2235 > Project: Hadoop HDFS > Issue Type: Bug > Components: name-node > Affects Versions: 0.23.0 > Reporter: Eli Collins > Fix For: 0.23.0 > > > Hftp does not support paths which contain semicolons. The commented out test in HDFS-2234 illustrates this. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira