Return-Path: X-Original-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7039E1112B for ; Wed, 24 Sep 2014 23:55:34 +0000 (UTC) Received: (qmail 44366 invoked by uid 500); 24 Sep 2014 23:55:34 -0000 Delivered-To: apmail-hadoop-yarn-dev-archive@hadoop.apache.org Received: (qmail 44286 invoked by uid 500); 24 Sep 2014 23:55:33 -0000 Mailing-List: contact yarn-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-dev@hadoop.apache.org Delivered-To: mailing list yarn-dev@hadoop.apache.org Received: (qmail 44019 invoked by uid 99); 24 Sep 2014 23:55:33 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Sep 2014 23:55:33 +0000 Date: Wed, 24 Sep 2014 23:55:33 +0000 (UTC) From: "Sangjin Lee (JIRA)" To: yarn-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (YARN-2600) if the container is killed during localization outstanding public cache localization tasks should be cancelled MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Sangjin Lee created YARN-2600: --------------------------------- Summary: if the container is killed during localization outstanding public cache localization tasks should be cancelled Key: YARN-2600 URL: https://issues.apache.org/jira/browse/YARN-2600 Project: Hadoop YARN Issue Type: Improvement Components: nodemanager Affects Versions: 2.4.0 Reporter: Sangjin Lee We came across a situation (partly related with HDFS-7005) where a large number of public cache localization tasks were queued in the public localizer thread pool but the container is killed during localization (as it went over the timeout). What's not helpful in this situation is that any work item that's queued will still be serviced by the resource localization service which is wasteful. And that may further delay localization efforts of other containers. It would be good if we can cancel the pending localization tasks when the container is killed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)