Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id D2F5B200AE4 for ; Thu, 26 May 2016 00:54:39 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id D15FD160A2E; Wed, 25 May 2016 22:54:39 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 4AF37160A29 for ; Thu, 26 May 2016 00:54:39 +0200 (CEST) Received: (qmail 20487 invoked by uid 500); 25 May 2016 22:09:21 -0000 Mailing-List: contact yarn-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-dev@hadoop.apache.org Received: (qmail 18153 invoked by uid 99); 25 May 2016 22:04:13 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 May 2016 22:04:13 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id CFCC52C1F60 for ; Wed, 25 May 2016 22:04:12 +0000 (UTC) Date: Wed, 25 May 2016 22:04:12 +0000 (UTC) From: "Jason Lowe (JIRA)" To: yarn-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (YARN-5154) DelayedProcessKiller can kill the wrong process if pid is recycled MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 25 May 2016 22:54:40 -0000 Jason Lowe created YARN-5154: -------------------------------- Summary: DelayedProcessKiller can kill the wrong process if pid is recycled Key: YARN-5154 URL: https://issues.apache.org/jira/browse/YARN-5154 Project: Hadoop YARN Issue Type: Bug Components: nodemanager Affects Versions: 2.6.4, 2.7.2, 2.8.0 Reporter: Jason Lowe YARN-4459 reduced the chance of the container-executor accidentally killing the wrong process when pids are recycled, but it can still occur. We should improve the DelayedProcessKiller to cancel the timer if the container exits between the deliverly of the two signals and/or implement some sanity checks in the executor just before delivering the signal to try to avoid killing the wrong thing. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-dev-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-dev-help@hadoop.apache.org