Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AB25E922C for ; Mon, 21 May 2012 18:13:41 +0000 (UTC) Received: (qmail 33242 invoked by uid 500); 21 May 2012 18:13:41 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 33181 invoked by uid 500); 21 May 2012 18:13:41 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 33171 invoked by uid 99); 21 May 2012 18:13:41 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 May 2012 18:13:41 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 5CE3B14281C for ; Mon, 21 May 2012 18:13:41 +0000 (UTC) Date: Mon, 21 May 2012 18:13:41 +0000 (UTC) From: "Colin Patrick McCabe (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1496268984.4094.1337624021383.JavaMail.jiratomcat@issues-vm> In-Reply-To: <661114597.2029.1329871488750.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HDFS-2982) Startup performance suffers when there are many edit log segments MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-2982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Colin Patrick McCabe updated HDFS-2982: --------------------------------------- Attachment: HDFS-2982.006.patch > Startup performance suffers when there are many edit log segments > ----------------------------------------------------------------- > > Key: HDFS-2982 > URL: https://issues.apache.org/jira/browse/HDFS-2982 > Project: Hadoop HDFS > Issue Type: Bug > Components: name-node > Affects Versions: 2.0.0 > Reporter: Todd Lipcon > Assignee: Colin Patrick McCabe > Priority: Critical > Attachments: HDFS-2982.001.patch, HDFS-2982.002.patch, HDFS-2982.003.patch, HDFS-2982.004.patch, HDFS-2982.005.patch, HDFS-2982.006.patch > > > For every one of the edit log segments, it seems like we are calling listFiles on the edit log directory inside of {{findMaxTransaction}}. This is killing performance, especially when there are many log segments and the directory is stored on NFS. It is taking several minutes to start up the NN when there are several thousand log segments present. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira