hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rohith Sharma K S (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-5197) RM leaks containers if running container disappears from node update
Date Fri, 10 Jun 2016 04:18:21 GMT

    [ https://issues.apache.org/jira/browse/YARN-5197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15323832#comment-15323832
] 

Rohith Sharma K S commented on YARN-5197:
-----------------------------------------

Overall patch looks good to me. 
One nit : In method {{findLostContainers}} , before adding to nodeContainers , can it be guarded
with execution type for GUARANTEED?

> RM leaks containers if running container disappears from node update
> --------------------------------------------------------------------
>
>                 Key: YARN-5197
>                 URL: https://issues.apache.org/jira/browse/YARN-5197
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.7.2, 2.6.4
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>         Attachments: YARN-5197.001.patch, YARN-5197.002.patch
>
>
> Once a node reports a container running in a status update, the corresponding RMNodeImpl
will track the container in its launchedContainers map.  If the node somehow misses sending
the completed container status to the RM and the container simply disappears from subsequent
heartbeats, the container will leak in launchedContainers forever and the container completion
event will not be sent to the scheduler.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message