Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 42832 invoked from network); 16 Feb 2009 11:15:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Feb 2009 11:15:36 -0000 Received: (qmail 81836 invoked by uid 500); 16 Feb 2009 11:15:32 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 81797 invoked by uid 500); 16 Feb 2009 11:15:32 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 81786 invoked by uid 99); 16 Feb 2009 11:15:32 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Feb 2009 03:15:32 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Feb 2009 11:15:21 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D31AD234C4A8 for ; Mon, 16 Feb 2009 03:14:59 -0800 (PST) Message-ID: <680269673.1234782899863.JavaMail.jira@brutus> Date: Mon, 16 Feb 2009 03:14:59 -0800 (PST) From: "Hemanth Yamijala (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-5262) Allow specifying min shares as percentage of cluster In-Reply-To: <166567703.1234599119576.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12673849#action_12673849 ] Hemanth Yamijala commented on HADOOP-5262: ------------------------------------------ Matei, the fixed number of slots use case makes sense. So, I guess we need both modes. I am considering how this would compare to having a separate config variable for expressing this as a percentage. i.e we could either specify minMaps and minReduces, or minCapacityPercent in the config file. The advantages are: - I think it is a little simpler to manage. If someone misses a percentage symbol without intending to, it could lead to some weird results in the earlier suggestion because it would still be a valid value for the configuration. And like I mentioned, one can add up the percentages and do some simple check that it doesn't exceed 100 and so on. - I don't see a need of having two separate variables in the percentage mode. Like you've mentioned in the description, groups pay for X% of the cluster capacity, and generally not X% maps and Y% reduces. So, setting up two numbers, which almost always are going to be the same doesn't seem to be required. The disadvantage I see is: - One more configuration variable - And what happens if both are specified. We could either have one of the two modes take precendence, or just error out. But either way, the semantics should be decided. In spite of the disadvantages, given how central this, I would favor separate configuration options. Thoughts ? > Allow specifying min shares as percentage of cluster > ---------------------------------------------------- > > Key: HADOOP-5262 > URL: https://issues.apache.org/jira/browse/HADOOP-5262 > Project: Hadoop Core > Issue Type: New Feature > Components: contrib/fair-share > Reporter: Matei Zaharia > Priority: Minor > > Currently the guaranteed shares for pools in the fair scheduler are specified as a number of slots. For organizations where a group pays X% of the cluster and the actual number of nodes in the cluster varies due to failures, expansion, etc over time, it would be useful to support a guaranteed share given as a percentage too. This would just let you write in the config file something like 5% instead of 42. The scheduler would need to recompute what this means in terms of number of slots on every update (probably through some kind of update(ClusterStatus) method in PoolManager). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.