Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 66872 invoked from network); 28 Dec 2010 19:17:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 28 Dec 2010 19:17:08 -0000 Received: (qmail 86299 invoked by uid 500); 28 Dec 2010 19:17:08 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 86262 invoked by uid 500); 28 Dec 2010 19:17:08 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 86254 invoked by uid 99); 28 Dec 2010 19:17:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Dec 2010 19:17:07 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Dec 2010 19:17:07 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oBSJGl6U028601 for ; Tue, 28 Dec 2010 19:16:47 GMT Message-ID: <4581511.48151293563807438.JavaMail.jira@thor> Date: Tue, 28 Dec 2010 14:16:47 -0500 (EST) From: "dhruba borthakur (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Updated: (HDFS-1391) Exiting safemode takes a long time when there are lots of blocks in the HDFS MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dhruba borthakur updated HDFS-1391: ----------------------------------- Status: Patch Available (was: Open) > Exiting safemode takes a long time when there are lots of blocks in the HDFS > ---------------------------------------------------------------------------- > > Key: HDFS-1391 > URL: https://issues.apache.org/jira/browse/HDFS-1391 > Project: Hadoop HDFS > Issue Type: Bug > Components: name-node > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Attachments: excessReplicas.1_trunk.txt, excessReplicas2.txt > > > When the namenode decides to exit safemode, it acquires the FSNamesystem lock and then iterates over all blocks in the blocksmap to determine if any block has any excess replicas. This call takes upwards of 5 minutes on a cluster that has 100 million blocks. This delays namenode restart to a good extent. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.