hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vinod K V (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-4523) Enhance how memory-intensive user tasks are handled
Date Mon, 10 Nov 2008 10:48:44 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Vinod K V updated HADOOP-4523:

    Attachment: HADOOP-4523-20081110.txt

bq. The latest patch kills only the last task that started if the sum total of all tasks'
memory usage goes beyond the configured limit. Picking up only one task may or may not bring
down the usage to within the configured limits.
Attaching a new patch to address this. TaskMemoryManagerThread now calls {{TaskTracker.findTaskToKill()}}
repeatedly to find a few tasks with the least progress so as to bring down the total memory
usage of all tasks falls below TT's limit, and then kills them. Modified the signature of
{{TaskTracker.findTaskToKill()}} to {{TaskTracker.findTaskToKill(List<TaskAttempId>
tasksToExclude)}} so as to help excluding tasks that are already marked for killing.

> Enhance how memory-intensive user tasks are handled
> ---------------------------------------------------
>                 Key: HADOOP-4523
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4523
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.19.0
>            Reporter: Vivek Ratan
>            Assignee: Vinod K V
>         Attachments: HADOOP-4523-200811-05.txt, HADOOP-4523-200811-06.txt, HADOOP-4523-20081110.txt
> HADOOP-3581 monitors each Hadoop task to see if its memory usage (which includes usage
of any tasks spawned by it and so on) is within a per-task limit. If the task's memory usage
goes over its limit, the task is killed. This, by itself, is not enough to prevent badly behaving
jobs from bringing down nodes. What is also needed is the ability to make sure that the sum
total of VM usage of all Hadoop tasks does not exceed a certain limit.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message