Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 791C11090B for ; Mon, 2 Mar 2015 10:19:21 +0000 (UTC) Received: (qmail 34519 invoked by uid 500); 2 Mar 2015 10:19:05 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 34467 invoked by uid 500); 2 Mar 2015 10:19:05 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 34172 invoked by uid 99); 2 Mar 2015 10:19:05 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Mar 2015 10:19:05 +0000 Date: Mon, 2 Mar 2015 10:19:05 +0000 (UTC) From: "Beckham007 (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-3080) The DockerContainerExecutor could not write the right pid to container pidFile MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-3080?page=3Dcom.atlassian.= jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D14342= 995#comment-14342995 ]=20 Beckham007 commented on YARN-3080: ---------------------------------- We can use "yarn containerId" as docker "container name". So, docker kill "= yarn containerId" will correctly kill the docker container. > The DockerContainerExecutor could not write the right pid to container pi= dFile > -------------------------------------------------------------------------= ----- > > Key: YARN-3080 > URL: https://issues.apache.org/jira/browse/YARN-3080 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.6.0 > Reporter: Beckham007 > Assignee: Abin Shahab > Attachments: YARN-3080.patch, YARN-3080.patch, YARN-3080.patch, Y= ARN-3080.patch > > > The docker_container_executor_session.sh is like this: > {quote} > #!/usr/bin/env bash > echo `/usr/bin/docker inspect --format {{.State.Pid}} container_142172368= 5222_0008_01_000002` > /data/nm_restart/hadoop-2.4.1/data/yarn/local/nmPriv= ate/application_1421723685222_0008/container_1421723685222_0008_01_000002/c= ontainer_1421723685222_0008_01_000002.pid.tmp > /bin/mv -f /data/nm_restart/hadoop-2.4.1/data/yarn/local/nmPrivate/applic= ation_1421723685222_0008/container_1421723685222_0008_01_000002/container_1= 421723685222_0008_01_000002.pid.tmp /data/nm_restart/hadoop-2.4.1/data/yarn= /local/nmPrivate/application_1421723685222_0008/container_1421723685222_000= 8_01_000002/container_1421723685222_0008_01_000002.pid > /usr/bin/docker run --rm --name container_1421723685222_0008_01_000002 -= e GAIA_HOST_IP=3Dc162 -e GAIA_API_SERVER=3D10.6.207.226:8080 -e GAIA_CLUSTE= R_ID=3Dshpc-nm_restart -e GAIA_QUEUE=3Droot.tdwadmin -e GAIA_APP_NAME=3Dtes= t_nm_docker -e GAIA_INSTANCE_ID=3D1 -e GAIA_CONTAINER_ID=3Dcontainer_142172= 3685222_0008_01_000002 --memory=3D32M --cpu-shares=3D1024 -v /data/nm_resta= rt/hadoop-2.4.1/data/yarn/container-logs/application_1421723685222_0008/con= tainer_1421723685222_0008_01_000002:/data/nm_restart/hadoop-2.4.1/data/yarn= /container-logs/application_1421723685222_0008/container_1421723685222_0008= _01_000002 -v /data/nm_restart/hadoop-2.4.1/data/yarn/local/usercache/tdwad= min/appcache/application_1421723685222_0008/container_1421723685222_0008_01= _000002:/data/nm_restart/hadoop-2.4.1/data/yarn/local/usercache/tdwadmin/ap= pcache/application_1421723685222_0008/container_1421723685222_0008_01_00000= 2 -P -e A=3DB --privileged=3Dtrue docker.oa.com:8080/library/centos7 bash "= /data/nm_restart/hadoop-2.4.1/data/yarn/local/usercache/tdwadmin/appcache/a= pplication_1421723685222_0008/container_1421723685222_0008_01_000002/launch= _container.sh" > {quote} > The DockerContainerExecutor use docker inspect before docker run, so the = docker inspect couldn't get the right pid for the docker, signalContainer()= and nm restart would fail. -- This message was sent by Atlassian JIRA (v6.3.4#6332)