hadoop-yarn-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jian He (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-2825) Container leak on NM
Date Fri, 07 Nov 2014 00:59:33 GMT
Jian He created YARN-2825:

             Summary: Container leak on NM
                 Key: YARN-2825
                 URL: https://issues.apache.org/jira/browse/YARN-2825
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Jian He
            Assignee: Jian He
            Priority: Critical

Caused by YARN-1372. thanks [~vinodkv] for pointing  this out.

The problem is that in YARN-1372 we changed the behavior to remove containers from NMContext
only after the containers are acknowledged  by AM. But in the {{NodeStatusUpdaterImpl#removeCompletedContainersFromContext}}
call, we didn't check whether the container is really completed or not.  If the container
is stilll running, we shouldn't remove the container from the context

This message was sent by Atlassian JIRA

View raw message