Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 70754 invoked from network); 7 Jan 2011 09:51:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Jan 2011 09:51:09 -0000 Received: (qmail 80218 invoked by uid 500); 7 Jan 2011 09:51:09 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 80110 invoked by uid 500); 7 Jan 2011 09:51:09 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 79916 invoked by uid 99); 7 Jan 2011 09:51:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Jan 2011 09:51:08 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Jan 2011 09:51:08 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id p079olmM011085 for ; Fri, 7 Jan 2011 09:50:47 GMT Message-ID: <13963014.208481294393847772.JavaMail.jira@thor> Date: Fri, 7 Jan 2011 04:50:47 -0500 (EST) From: "Scott Carey (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-1547) Improve decommission mechanism In-Reply-To: <662967.224821292884142120.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12978710#action_12978710 ] Scott Carey commented on HDFS-1547: ----------------------------------- I like Todd's proposal to have only one file, that lists each node at most once, and do not see any explanation why it won't work. A node has only one state from the administrator POV, and what should be shown in the UI (dead, decomission in progress, etc) can be derived from that. Why have 3 files when one will do? Its only more confusing. Yes, the current two file format has issues because the meaning is overloaded or the names are bas. But a single file with a format like Todd suggests seems like it would work. Possible format: {noformat} node1=active node2=decommission node3=exclude {noformat} When an administrator wants to decommission a node, the part after the = in the file for that node is changed from active to decommission. Nodes in the decommission state are allowed to talk to the NN and register with it, but will shut down after successful decommission. Nodes marked exclude are not allowed to talk to the NN. Nodes marked active are tracked and compared to what is regisered (along with decommission marked nodes) to identify dead nodes. In short, all three files in this proposal could be combined into one. > Improve decommission mechanism > ------------------------------ > > Key: HDFS-1547 > URL: https://issues.apache.org/jira/browse/HDFS-1547 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Affects Versions: 0.23.0 > Reporter: Suresh Srinivas > Assignee: Suresh Srinivas > Fix For: 0.23.0 > > > Current decommission mechanism driven using exclude file has several issues. This bug proposes some changes in the mechanism for better manageability. See the proposal in the next comment for more details. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.