Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 64630 invoked from network); 5 May 2010 22:48:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 May 2010 22:48:37 -0000 Received: (qmail 27882 invoked by uid 500); 5 May 2010 22:48:37 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 27848 invoked by uid 500); 5 May 2010 22:48:37 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 27840 invoked by uid 99); 5 May 2010 22:48:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 22:48:37 +0000 X-ASF-Spam-Status: No, hits=-1394.4 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 May 2010 22:48:35 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o45MmFti004370 for ; Wed, 5 May 2010 22:48:15 GMT Message-ID: <32814299.35411273099695803.JavaMail.jira@thor> Date: Wed, 5 May 2010 18:48:15 -0400 (EDT) From: "Todd Lipcon (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-1028) INode.getPathNames could split more efficiently In-Reply-To: <1543593839.115381267859247283.JavaMail.jira@brutus.apache.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864568#action_12864568 ] Todd Lipcon commented on HDFS-1028: ----------------------------------- +1 to the body of the patch. Regarding the test change, how long does the test case take now? Can we make a better benchmark that is independent of the unit tests (I assume this change was to show a speed improvement)? I don't think it makes sense to overload the unit tests for the purposes of benchmarking. Personally I'd be satisfied to just have simple timings of loading one of your production fsimages with/without the change. > INode.getPathNames could split more efficiently > ----------------------------------------------- > > Key: HDFS-1028 > URL: https://issues.apache.org/jira/browse/HDFS-1028 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Reporter: Todd Lipcon > Assignee: Dmytro Molkov > Priority: Minor > Attachments: HDFS-split.patch > > > INode.getPathnames uses String.split(String) which actually uses the full Java regex implementation. Since we're always splitting on a single char, we could implement a faster one like StringUtils.split() (except without the escape character). This takes a significant amount of CPU during FSImage loading so should be a worthwhile speedup. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.