Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C31C4701A for ; Tue, 23 Aug 2011 22:57:53 +0000 (UTC) Received: (qmail 15302 invoked by uid 500); 23 Aug 2011 22:57:53 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 15177 invoked by uid 500); 23 Aug 2011 22:57:52 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 15169 invoked by uid 99); 23 Aug 2011 22:57:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Aug 2011 22:57:52 +0000 X-ASF-Spam-Status: No, hits=-2000.5 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Aug 2011 22:57:51 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id DA0FCCC08D for ; Tue, 23 Aug 2011 22:57:30 +0000 (UTC) Date: Tue, 23 Aug 2011 22:57:30 +0000 (UTC) From: "jiraposter@reviews.apache.org (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <297084272.7221.1314140250889.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-395) DFS Scalability: Incremental block reports MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13089843#comment-13089843 ] jiraposter@reviews.apache.org commented on HDFS-395: ---------------------------------------------------- ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/998/ ----------------------------------------------------------- (Updated 2011-08-23 22:55:47.297820) Review request for hadoop-hdfs, Dhruba Borthakur and Hairong Kuang. Changes ------- Adjusted to recent code changes. Fized offerService comment. Summary ------- This patch introduces explicit ACKs sent form datanodes to the namenode in order to minimize the difference between NN blocks info and DN state. This will allow for sending the full block reports less frequently, which in turn will minimize the overhead of processing them, and blocking the namenode for extended time. This addresses bug HDFS-395. https://issues.apache.org/jira/browse/HDFS-395 Diffs (updated) ----- trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockManager.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/DataNode.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/FSDataset.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/FSDatasetAsyncDiskService.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/NameNode.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/protocol/BlockCommand.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/protocol/DatanodeProtocol.java 1160919 trunk/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/protocol/ReceivedDeletedBlockInfo.java PRE-CREATION trunk/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/NNThroughputBenchmark.java 1160919 trunk/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/namenode/TestDeadDatanode.java 1160919 Diff: https://reviews.apache.org/r/998/diff Testing ------- Will do JUnit tests. Thanks, Tomasz > DFS Scalability: Incremental block reports > ------------------------------------------ > > Key: HDFS-395 > URL: https://issues.apache.org/jira/browse/HDFS-395 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: data-node, name-node > Reporter: dhruba borthakur > Assignee: Tomasz Nykiel > Attachments: blockReportPeriod.patch, explicitAcks.patch-3, explicitAcks.patch-4, explicitDeleteAcks.patch > > > I have a cluster that has 1800 datanodes. Each datanode has around 50000 blocks and sends a block report to the namenode once every hour. This means that the namenode processes a block report once every 2 seconds. Each block report contains all blocks that the datanode currently hosts. This makes the namenode compare a huge number of blocks that practically remains the same between two consecutive reports. This wastes CPU on the namenode. > The problem becomes worse when the number of datanodes increases. > One proposal is to make succeeding block reports (after a successful send of a full block report) be incremental. This will make the namenode process only those blocks that were added/deleted in the last period. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira