Return-Path: X-Original-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ED8B14778 for ; Fri, 13 May 2011 21:53:29 +0000 (UTC) Received: (qmail 49382 invoked by uid 500); 13 May 2011 21:53:29 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 49342 invoked by uid 500); 13 May 2011 21:53:29 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 49334 invoked by uid 99); 13 May 2011 21:53:29 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 May 2011 21:53:29 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 May 2011 21:53:27 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 5F7FE89FA9 for ; Fri, 13 May 2011 21:52:47 +0000 (UTC) Date: Fri, 13 May 2011 21:52:47 +0000 (UTC) From: "Tsz Wo (Nicholas), SZE (JIRA)" To: common-issues@hadoop.apache.org Message-ID: <966467803.11611.1305323567387.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HADOOP-1117) DFS Scalability: When the namenode is restarted it consumes 80% CPU MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13033330#comment-13033330 ] Tsz Wo (Nicholas), SZE commented on HADOOP-1117: ------------------------------------------------ Sorry, wrong issue. > DFS Scalability: When the namenode is restarted it consumes 80% CPU > ------------------------------------------------------------------- > > Key: HADOOP-1117 > URL: https://issues.apache.org/jira/browse/HADOOP-1117 > Project: Hadoop Common > Issue Type: Bug > Affects Versions: 0.12.0 > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Priority: Blocker > Fix For: 0.12.1 > > Attachments: CpuPendingTransfer3.patch > > > When the namenode is restarted, the datanodes register and each block is inserted into neededReplication. When the namenode exists, safemode it sees starts processing neededReplication. It picks up a block from neededReplication, sees that it has already has the required number of replicas, and continues to the next block in neededReplication. The blocks remain in neededReplication permanentlyhe namenode worker thread to scans this huge list of blocks once every 3 seconds. This consumes plenty of CPU on the namenode. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira