Return-Path: Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: (qmail 11147 invoked from network); 15 Sep 2009 14:52:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 15 Sep 2009 14:52:53 -0000 Received: (qmail 56461 invoked by uid 500); 15 Sep 2009 14:52:52 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 56376 invoked by uid 500); 15 Sep 2009 14:52:52 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 56213 invoked by uid 99); 15 Sep 2009 14:52:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Sep 2009 14:52:52 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Sep 2009 14:52:48 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id EE918234C1EF for ; Tue, 15 Sep 2009 07:51:57 -0700 (PDT) Message-ID: <1223518559.1253026317975.JavaMail.jira@brutus> Date: Tue, 15 Sep 2009 07:51:57 -0700 (PDT) From: "Ben Slusky (JIRA)" To: common-issues@hadoop.apache.org Subject: [jira] Commented: (HADOOP-6097) Multiple bugs w/ Hadoop archives MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-6097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12755514#action_12755514 ] Ben Slusky commented on HADOOP-6097: ------------------------------------ Mahadev, HADOOP-6231 *is* a bug fix -- see Koji's last comment above. I attached a patch for the 0.20 branch there. > Multiple bugs w/ Hadoop archives > -------------------------------- > > Key: HADOOP-6097 > URL: https://issues.apache.org/jira/browse/HADOOP-6097 > Project: Hadoop Common > Issue Type: Bug > Components: fs > Affects Versions: 0.18.0, 0.18.1, 0.18.2, 0.18.3, 0.19.0, 0.19.1, 0.20.0 > Reporter: Ben Slusky > Assignee: Ben Slusky > Fix For: 0.20.2 > > Attachments: HADOOP-6097-v2.patch, HADOOP-6097.patch > > > Found and fixed several bugs involving Hadoop archives: > - In makeQualified(), the sloppy conversion from Path to URI and back mangles the path if it contains an escape-worthy character. > - It's possible that fileStatusInIndex() may have to read more than one segment of the index. The LineReader and count of bytes read need to be reset for each block. > - har:// connections cannot be indexed by (scheme, authority, username) -- the path is significant as well. Caching them in this way limits a hadoop client to opening one archive per filesystem. It seems to be safe not to cache them, since they wrap another connection that does the actual networking. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.