Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 183B0200C53 for ; Tue, 11 Apr 2017 23:16:47 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 16D98160B9E; Tue, 11 Apr 2017 21:16:47 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5CED7160B7D for ; Tue, 11 Apr 2017 23:16:46 +0200 (CEST) Received: (qmail 72338 invoked by uid 500); 11 Apr 2017 21:16:45 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 72327 invoked by uid 99); 11 Apr 2017 21:16:45 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Apr 2017 21:16:45 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id EDFB91A04C4 for ; Tue, 11 Apr 2017 21:16:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id PanHBOhaZXz7 for ; Tue, 11 Apr 2017 21:16:44 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 6842260D87 for ; Tue, 11 Apr 2017 21:16:43 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 80B7AE0D3C for ; Tue, 11 Apr 2017 21:16:42 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id D7EA624067 for ; Tue, 11 Apr 2017 21:16:41 +0000 (UTC) Date: Tue, 11 Apr 2017 21:16:41 +0000 (UTC) From: "Chen Liang (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HDFS-11530) Use HDFS specific network topology to choose datanode in BlockPlacementPolicyDefault MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 11 Apr 2017 21:16:47 -0000 [ https://issues.apache.org/jira/browse/HDFS-11530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen Liang updated HDFS-11530: ------------------------------ Attachment: HDFS-11530.008.patch Attaching v008 patch to fix the issue. [~linyiqun] [~arpitagarwal] would you please take a look on this one some time? As v008 patch made a few small changes but they kind of break some earlier assumptions. More specifically, {{TestBlockManager.testSafeModeIBR}} fail seems to be caused by that the {{add(Node node)}} was called more than once on the same node, with different storage type info. This violates the earlier assumption that nodes only get called by add once, or at least always the same storage info. The reason seems that this tests is simulating datanode fails and this is why nodes get added twice. This seems to be a valid case so I added {{DFSTopologyNodeImpl.updateExistingDatanode}} for this. On the other hand, {{TestBlockStatsMXBean.testStorageTypeStatsWhenStorageFailed}} fail seems to be caused by that the {{excludedNodes}} set does not have to be {{DatanodeDescriptor}}, but can also be {{DatanodeInfo}} instead. (Although it's a bit suspicious to me why this is the only test that failed due to this). Assuming this is valid, I modified {{DFSNetworkTopology.chooseRandomWithStorageType}} to do some more check. I do appreciate any comments. Basically I made the changes here assuming the tests are doing the right thing, but it might very well be that the tests themselves should be modified instead. > Use HDFS specific network topology to choose datanode in BlockPlacementPolicyDefault > ------------------------------------------------------------------------------------ > > Key: HDFS-11530 > URL: https://issues.apache.org/jira/browse/HDFS-11530 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: namenode > Affects Versions: 3.0.0-alpha2 > Reporter: Yiqun Lin > Assignee: Yiqun Lin > Attachments: HDFS-11530.001.patch, HDFS-11530.002.patch, HDFS-11530.003.patch, HDFS-11530.004.patch, HDFS-11530.005.patch, HDFS-11530.006.patch, HDFS-11530.007.patch, HDFS-11530.008.patch > > > The work for {{chooseRandomWithStorageType}} has been merged in HDFS-11482. But this method is contained in new topology {{DFSNetworkTopology}} which is specified for HDFS. We should update this and let {{BlockPlacementPolicyDefault}} use the new way since the original way is inefficient. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org