hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sunil G (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-3933) Race condition when calling AbstractYarnScheduler.completedContainer.
Date Mon, 21 Mar 2016 05:34:25 GMT

    [ https://issues.apache.org/jira/browse/YARN-3933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15203746#comment-15203746
] 

Sunil G commented on YARN-3933:
-------------------------------

As per existing patch, new liveContainers check is done before below code {{FS#completedContainerInternal}}.
Pls correct me if am wrong w.r.t FS, {{containerCompleted}} need to be processed for those
containers which are RESERVED too. So with current patch, this scenario may not hit.

{code}
 864     if (rmContainer.getState() == RMContainerState.RESERVED) {
 865       application.unreserve(rmContainer.getReservedPriority(), node);
 866     } else {
{code}


> Race condition when calling AbstractYarnScheduler.completedContainer.
> ---------------------------------------------------------------------
>
>                 Key: YARN-3933
>                 URL: https://issues.apache.org/jira/browse/YARN-3933
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: fairscheduler
>    Affects Versions: 2.6.0, 2.7.0, 2.5.2, 2.7.1
>            Reporter: Lavkesh Lahngir
>            Assignee: Shiwei Guo
>         Attachments: YARN-3933.001.patch, YARN-3933.002.patch, YARN-3933.003.patch
>
>
> In our cluster we are seeing available memory and cores being negative. 
> Initial inspection:
> Scenario no. 1: 
> In capacity scheduler the method allocateContainersToNode() checks if 
> there are excess reservation of containers for an application, and they are no longer
needed then it calls queue.completedContainer() which causes resources being negative. And
they were never assigned in the first place. 
> I am still looking through the code. Can somebody suggest how to simulate excess containers
assignments ?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message