Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 23933 invoked from network); 10 Nov 2008 10:49:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 10 Nov 2008 10:49:42 -0000 Received: (qmail 29070 invoked by uid 500); 10 Nov 2008 10:49:43 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 29015 invoked by uid 500); 10 Nov 2008 10:49:42 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 28983 invoked by uid 99); 10 Nov 2008 10:49:42 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Nov 2008 02:49:42 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Nov 2008 10:48:31 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 6DF0C234C286 for ; Mon, 10 Nov 2008 02:48:44 -0800 (PST) Message-ID: <1636146451.1226314124435.JavaMail.jira@brutus> Date: Mon, 10 Nov 2008 02:48:44 -0800 (PST) From: "Vinod K V (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Updated: (HADOOP-4523) Enhance how memory-intensive user tasks are handled In-Reply-To: <1134360276.1225090124414.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinod K V updated HADOOP-4523: ------------------------------ Attachment: HADOOP-4523-20081110.txt bq. The latest patch kills only the last task that started if the sum total of all tasks' memory usage goes beyond the configured limit. Picking up only one task may or may not bring down the usage to within the configured limits. Attaching a new patch to address this. TaskMemoryManagerThread now calls {{TaskTracker.findTaskToKill()}} repeatedly to find a few tasks with the least progress so as to bring down the total memory usage of all tasks falls below TT's limit, and then kills them. Modified the signature of {{TaskTracker.findTaskToKill()}} to {{TaskTracker.findTaskToKill(List tasksToExclude)}} so as to help excluding tasks that are already marked for killing. > Enhance how memory-intensive user tasks are handled > --------------------------------------------------- > > Key: HADOOP-4523 > URL: https://issues.apache.org/jira/browse/HADOOP-4523 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Affects Versions: 0.19.0 > Reporter: Vivek Ratan > Assignee: Vinod K V > Attachments: HADOOP-4523-200811-05.txt, HADOOP-4523-200811-06.txt, HADOOP-4523-20081110.txt > > > HADOOP-3581 monitors each Hadoop task to see if its memory usage (which includes usage of any tasks spawned by it and so on) is within a per-task limit. If the task's memory usage goes over its limit, the task is killed. This, by itself, is not enough to prevent badly behaving jobs from bringing down nodes. What is also needed is the ability to make sure that the sum total of VM usage of all Hadoop tasks does not exceed a certain limit. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.