Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 50149 invoked from network); 4 Mar 2009 19:42:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Mar 2009 19:42:20 -0000 Received: (qmail 89085 invoked by uid 500); 4 Mar 2009 19:42:17 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 89045 invoked by uid 500); 4 Mar 2009 19:42:17 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 89030 invoked by uid 99); 4 Mar 2009 19:42:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Mar 2009 11:42:17 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Mar 2009 19:42:16 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6BD72234C4AB for ; Wed, 4 Mar 2009 11:41:56 -0800 (PST) Message-ID: <1934172222.1236195716440.JavaMail.jira@brutus> Date: Wed, 4 Mar 2009 11:41:56 -0800 (PST) From: "Hairong Kuang (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-5399) Simulated datanodes crashes NameNode In-Reply-To: <2058290168.1236194636321.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678873#action_12678873 ] Hairong Kuang commented on HADOOP-5399: --------------------------------------- It turns out this bug is caused by HADOOP-5384. Simulated datanodes send block reports to NN that contains a block with an invalid generation stamp, GenerationStamp.WILDCARD_STAMP. NN finds out the block does not belong to any file so marks it to be invalid. Then ReplicationMonitor schedules the block to be deleted on its datanode by adding it to the invalidateSet of its DatanodeDescriptor, which is a TreeSet. So adding the block to the invalidateSet triggers the call to Block#compareTo that throws IllegalStateExceptionon on wild card generation stamp. ReplicationMonitor calls System.exit to shutdown NN when catching a RuntimeException. So NN gets crashed. A simple solution to the problem is that block report processing should filter blocks with wild card generation stamp. > Simulated datanodes crashes NameNode > ------------------------------------ > > Key: HADOOP-5399 > URL: https://issues.apache.org/jira/browse/HADOOP-5399 > Project: Hadoop Core > Issue Type: Bug > Components: dfs > Reporter: Hairong Kuang > Fix For: 0.21.0 > > > INFO org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.processReport: block blk_448_1 on > XX size 10 does not belong to any file. > INFO org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.addToInvalidates: blk_448 is added > to invalidSet of XX > INFO org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.processReport: block blk_447_1 on > XX size 10 does not belong to any file. > INFO org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.addToInvalidates: blk_447 is added > to invalidSet of XX > WARN org.apache.hadoop.hdfs.server.namenode.FSNamesystem: ReplicationMonitor thread received > Runtime exception. java.lang.IllegalStateException: generationStamp (=1) == GenerationStamp.WILDCARD_STAMP > INFO org.apache.hadoop.hdfs.server.namenode.NameNode: SHUTDOWN_MSG: > /************************************************************ > SHUTDOWN_MSG: Shutting down NameNode at YY > ************************************************************/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.