Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 188F0927E for ; Sat, 28 Jan 2012 01:35:21 +0000 (UTC) Received: (qmail 70085 invoked by uid 500); 28 Jan 2012 01:35:20 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 70012 invoked by uid 500); 28 Jan 2012 01:35:20 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 70004 invoked by uid 99); 28 Jan 2012 01:35:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 28 Jan 2012 01:35:19 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 28 Jan 2012 01:35:17 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 338801689AE for ; Sat, 28 Jan 2012 01:34:35 +0000 (UTC) Date: Sat, 28 Jan 2012 01:34:35 +0000 (UTC) From: "Uma Maheswara Rao G (Commented) (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1013225967.3161.1327714475647.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1578386755.428.1327678571140.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-2851) After Balancer runs, usedSpace is not balancing correctly. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-2851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195323#comment-13195323 ] Uma Maheswara Rao G commented on HDFS-2851: ------------------------------------------- Hi Eli, This particular issue is happening only in branch. This case works fine in trunk. As for the initial look, There are 2 DNs(DN1,DN2) registered with NN initial block report also sent. After NN transitioned to active all blocks will be marked as stale until next block report comes from this DNs. One new DN (DN3) added , this particular DN registered with active NN sucessfully. When we run the balancer, it needs to move some blocks here and there to balance the cluster. Some blocks came to old DNs, and needs to process OverReplicated blocks as well. I think there is no immediate next block report after transitioned to active (this point need to confirm , whether we are triggering the block report immediately after transitioned to active or not), So the blocks was still in stale mode. Processing overReplicated blocks are getting postponed due to this reason. Since this nodes not processed OverReplicated blocks , used space is little high than expected. [ usedSpace (current: 390, expected: 300)] I just reduced the block report interval to very less (10s), then this particular case is passing. Thanks Uma > After Balancer runs, usedSpace is not balancing correctly. > ---------------------------------------------------------- > > Key: HDFS-2851 > URL: https://issues.apache.org/jira/browse/HDFS-2851 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: balancer, data-node, ha, name-node > Affects Versions: HA branch (HDFS-1623) > Reporter: Uma Maheswara Rao G > Assignee: Uma Maheswara Rao G > > After Balancer runs, usedSpace is not balancing correctly. > {code} > java.util.concurrent.TimeoutException: Cluster failed to reached expected values of totalSpace (current: 1500, expected: 1500), or usedSpace (current: 390, expected: 300), in more than 20000 msec. > at org.apache.hadoop.hdfs.server.balancer.TestBalancer.waitForHeartBeat(TestBalancer.java:233) > at org.apache.hadoop.hdfs.server.balancer.TestBalancerWithHANameNodes.testBalancerWithHANameNodes(TestBalancerWithHANameNodes.java:99) > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira