Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 42485 invoked from network); 19 Oct 2010 17:30:47 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 19 Oct 2010 17:30:47 -0000 Received: (qmail 63013 invoked by uid 500); 19 Oct 2010 17:30:47 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 62973 invoked by uid 500); 19 Oct 2010 17:30:47 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 62965 invoked by uid 99); 19 Oct 2010 17:30:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Oct 2010 17:30:47 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 19 Oct 2010 17:30:47 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o9JHURt6004418 for ; Tue, 19 Oct 2010 17:30:27 GMT Message-ID: <18894240.50521287509427179.JavaMail.jira@thor> Date: Tue, 19 Oct 2010 13:30:27 -0400 (EDT) From: "Ramkumar Vadali (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Updated: (MAPREDUCE-2143) HarFileSystem is not able to handle spaces in its path In-Reply-To: <19294455.50471287509307490.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2143: --------------------------------------- Attachment: MAPREDUCE-2143.patch This patch has a unit-test that shows the problem. It also contains a fix that seems to work with existing unit-tests also. I have also tested this in production > HarFileSystem is not able to handle spaces in its path > ------------------------------------------------------ > > Key: MAPREDUCE-2143 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2143 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: harchive > Reporter: Ramkumar Vadali > Attachments: MAPREDUCE-2143.patch > > > If the Path to the HAR contains spaces, Path.getFileSystem() fails. The problem is in HarFileSystem.initialize(), which uses URI.toString() to get a string for getting to the .har suffix. URI.toString() returns a percent-encoded string when the path contains spaces. When this string is subsequently used to get the _index file, we get a FileNotFoundException. The fix is to use URI.getPath(). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.