Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 346CF101D8 for ; Sat, 15 Jun 2013 00:51:22 +0000 (UTC) Received: (qmail 31503 invoked by uid 500); 15 Jun 2013 00:51:21 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 31401 invoked by uid 500); 15 Jun 2013 00:51:21 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 31103 invoked by uid 99); 15 Jun 2013 00:51:21 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Jun 2013 00:51:21 +0000 Date: Sat, 15 Jun 2013 00:51:20 +0000 (UTC) From: "Chuan Liu (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Reopened] (HDFS-4521) invalid network toploogies should not be cached MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-4521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chuan Liu reopened HDFS-4521: ----------------------------- Reopen this issue. We also have similar problem in Hadoop 1.0. More information can be found in HADOOP-9633. The code in 1.0 was quite different from 2.0, so a new patch may be needed to address the problem in branch-1. Thanks! > invalid network toploogies should not be cached > ----------------------------------------------- > > Key: HDFS-4521 > URL: https://issues.apache.org/jira/browse/HDFS-4521 > Project: Hadoop HDFS > Issue Type: Improvement > Affects Versions: 2.1.0-beta > Reporter: Colin Patrick McCabe > Assignee: Colin Patrick McCabe > Priority: Minor > Fix For: 2.1.0-beta > > Attachments: HDFS-4521.001.patch, HDFS-4521.002.patch, HDFS-4521.005.patch, HDFS-4521.006.patch, HDFS-4521.008.patch > > > When the network topology is invalid, the DataNode refuses to start with a message such as this: > {quote} > org.apache.hadoop.hdfs.server.protocol.DatanodeProtocol.registerDatanode from 172.29.122.23:55886: error: > org.apache.hadoop.net.NetworkTopology$InvalidTopologyException: Invalid network topology. You cannot have a rack and a non-rack node at the same level of the network topology. > {quote} > This is expected if you specify a topology file or script which puts leaf nodes at two different depths. However, one problem we have now is that this incorrect topology is cached forever. Once the NameNode sees it, this DataNode can never be added to the cluster, since this exception will be rethrown each time. The NameNode will not check to see if the topology file or script has changed. We should clear the topology mappings when there is an InvalidTopologyException, to prevent this problem. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira