Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 79497 invoked from network); 28 Feb 2008 16:08:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 28 Feb 2008 16:08:49 -0000 Received: (qmail 52676 invoked by uid 500); 28 Feb 2008 16:08:42 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 52653 invoked by uid 500); 28 Feb 2008 16:08:42 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 52644 invoked by uid 99); 28 Feb 2008 16:08:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Feb 2008 08:08:42 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Feb 2008 16:08:16 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 88C70234C010 for ; Thu, 28 Feb 2008 08:07:51 -0800 (PST) Message-ID: <1507094410.1204214871545.JavaMail.jira@brutus> Date: Thu, 28 Feb 2008 08:07:51 -0800 (PST) From: "Devaraj Das (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Updated: (HADOOP-1985) Abstract node to switch mapping into a topology service class used by namenode and jobtracker MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj Das updated HADOOP-1985: -------------------------------- Attachment: 1985.v25.patch The last patch went out-of-sync with the trunk. The update to the last patch is very trivial though, and I validated the hudson tests on my local machine. > Abstract node to switch mapping into a topology service class used by namenode and jobtracker > --------------------------------------------------------------------------------------------- > > Key: HADOOP-1985 > URL: https://issues.apache.org/jira/browse/HADOOP-1985 > Project: Hadoop Core > Issue Type: New Feature > Components: dfs, mapred > Reporter: eric baldeschwieler > Assignee: Devaraj Das > Fix For: 0.17.0 > > Attachments: 1985.new.patch, 1985.v1.patch, 1985.v10.patch, 1985.v11.patch, 1985.v19.patch, 1985.v2.patch, 1985.v20.patch, 1985.v23.patch, 1985.v24.patch, 1985.v25.patch, 1985.v3.patch, 1985.v4.patch, 1985.v5.patch, 1985.v6.patch, 1985.v9.patch, jobinprogress.patch > > > In order to implement switch locality in MapReduce, we need to have switch location in both the namenode and job tracker. Currently the namenode asks the data nodes for this info and they run a local script to answer this question. In our environment and others that I know of there is no reason to push this to each node. It is easier to maintain a centralized script that maps node DNS names to switch strings. > I propose that we build a new class that caches known DNS name to switch mappings and invokes a loadable class or a configurable system call to resolve unknown DNS to switch mappings. We can then add this to the namenode to support the current block to switch mapping needs and simplify the data nodes. We can also add this same callout to the job tracker and then implement rack locality logic there without needing to chane the filesystem API or the split planning API. > Not only is this the least intrusive path to building racklocal MR I can ID, it is also future compatible to future infrastructures that may derive topology on the fly, etc, etc... -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.