Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 586CC7479 for ; Fri, 15 Jul 2011 01:02:24 +0000 (UTC) Received: (qmail 75316 invoked by uid 500); 15 Jul 2011 01:02:24 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 75140 invoked by uid 500); 15 Jul 2011 01:02:23 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 75131 invoked by uid 99); 15 Jul 2011 01:02:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Jul 2011 01:02:23 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Jul 2011 01:02:22 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id C03EE56FCA for ; Fri, 15 Jul 2011 01:02:01 +0000 (UTC) Date: Fri, 15 Jul 2011 01:02:01 +0000 (UTC) From: "Hairong Kuang (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <985962326.15769.1310691721784.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-395) DFS Scalability: Incremental block reports MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13065643#comment-13065643 ] Hairong Kuang commented on HDFS-395: ------------------------------------ > NN first deletes it in the block map and adds it to invalidate set for the datanodes. Only blocks belong to a deleted file are removed from the block map. For those blocks, Tom's patch has an optimization that datanode does not send to an ack back. But for an excessive replica, it remains in the block map and excessiveBlockMap until an ack is back. They are the ones that need explicit acknowledgment. > DFS Scalability: Incremental block reports > ------------------------------------------ > > Key: HDFS-395 > URL: https://issues.apache.org/jira/browse/HDFS-395 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Attachments: blockReportPeriod.patch, explicitDeleteAcks.patch > > > I have a cluster that has 1800 datanodes. Each datanode has around 50000 blocks and sends a block report to the namenode once every hour. This means that the namenode processes a block report once every 2 seconds. Each block report contains all blocks that the datanode currently hosts. This makes the namenode compare a huge number of blocks that practically remains the same between two consecutive reports. This wastes CPU on the namenode. > The problem becomes worse when the number of datanodes increases. > One proposal is to make succeeding block reports (after a successful send of a full block report) be incremental. This will make the namenode process only those blocks that were added/deleted in the last period. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira