Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 63739 invoked from network); 12 Jan 2011 21:35:12 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Jan 2011 21:35:12 -0000 Received: (qmail 16789 invoked by uid 500); 12 Jan 2011 21:35:12 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 16400 invoked by uid 500); 12 Jan 2011 21:35:12 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 16236 invoked by uid 99); 12 Jan 2011 21:35:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Jan 2011 21:35:11 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Jan 2011 21:35:09 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id p0CLYmH8007972 for ; Wed, 12 Jan 2011 21:34:48 GMT Message-ID: <554970.324061294868088127.JavaMail.jira@thor> Date: Wed, 12 Jan 2011 16:34:48 -0500 (EST) From: "Suresh Srinivas (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Commented: (HDFS-1547) Improve decommission mechanism In-Reply-To: <662967.224821292884142120.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12980939#action_12980939 ] Suresh Srinivas commented on HDFS-1547: --------------------------------------- Thinking a bit more about the problem, I think there could be issues in some cases: Consider a cluster with N nodes, L live and D decommissioned with transceiver load on each datanode {X1, X2, ... XN}. A datanode is not good for write when Xi > 2 * X /(L+D) That means when D > L, a lot of the nodes will be not eligible for writes. The remainining that are good, will have to take write load and will push X higher. Also read traffic that is not subject to the above condition will push X higher. In the worst case scenarios, if the load on every node is equal to X and write load dominates reads, then very few or no nodes are good for writes! Some observations: # This problem is severe as D gets closer to and more than N/2. # Doing such a decommission of large number datanodes has several issues: #* It reduces cluster available free storage for writes. Writes could simply fail because of no free storage. The decommissioning may not complete, because of lack of free storage. #* Further when this happens, the number nodes available for writes is significantly reduced (as writes are not done to D nodes). #* Note this problem also exists when decommissioning is in progress for large number of nodes. Given this I am leaning towards not handling this case. > Improve decommission mechanism > ------------------------------ > > Key: HDFS-1547 > URL: https://issues.apache.org/jira/browse/HDFS-1547 > Project: Hadoop HDFS > Issue Type: Improvement > Components: name-node > Affects Versions: 0.23.0 > Reporter: Suresh Srinivas > Assignee: Suresh Srinivas > Fix For: 0.23.0 > > Attachments: HDFS-1547.1.patch, HDFS-1547.patch > > > Current decommission mechanism driven using exclude file has several issues. This bug proposes some changes in the mechanism for better manageability. See the proposal in the next comment for more details. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.