hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph (JIRA)" <j...@apache.org>
Subject [jira] [Created] (YARN-4331) Killing NodeManager leaves orphaned containers
Date Wed, 04 Nov 2015 18:49:27 GMT
Joseph created YARN-4331:
----------------------------

             Summary: Killing NodeManager leaves orphaned containers
                 Key: YARN-4331
                 URL: https://issues.apache.org/jira/browse/YARN-4331
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager, yarn
    Affects Versions: 2.7.1
            Reporter: Joseph
            Priority: Critical


We are seeing a lot of orphaned containers running in our production clusters.
I tried to simulate this locally on my machine and can replicate the issue by killing nodemanager.
I'm running Yarn 2.7.1 with RM state stored in zookeeper and deploying samza jobs.
Steps:
1. Deploy a job 
2. Issue a kill -9 signal to nodemanager 
3. We should see the AM and its container running without nodemanager
4. AM should die but the container still keeps running
5. Restarting nodemanager brings up new AM and container but leaves the orphaned container
running in the background

This is effectively causing double processing of data.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message