Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5BE0A2009FB for ; Fri, 6 May 2016 22:15:15 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 5ACB6160A0E; Fri, 6 May 2016 20:15:15 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id B682D1608F8 for ; Fri, 6 May 2016 22:15:14 +0200 (CEST) Received: (qmail 96142 invoked by uid 500); 6 May 2016 20:15:13 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 95958 invoked by uid 99); 6 May 2016 20:15:13 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 May 2016 20:15:13 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id E7A032C1F61 for ; Fri, 6 May 2016 20:15:12 +0000 (UTC) Date: Fri, 6 May 2016 20:15:12 +0000 (UTC) From: "Chris Trezzo (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (MAPREDUCE-6690) Limit the number of resources a single map reduce job can submit for localization MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 06 May 2016 20:15:15 -0000 Chris Trezzo created MAPREDUCE-6690: --------------------------------------- Summary: Limit the number of resources a single map reduce job can submit for localization Key: MAPREDUCE-6690 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6690 Project: Hadoop Map/Reduce Issue Type: New Feature Reporter: Chris Trezzo Assignee: Chris Trezzo Users will sometimes submit a large amount of resources to be localized as part of a single map reduce job. This can cause issues with YARN localization that destabilize the cluster and potentially impact other user jobs. These resources are specified via the files, libjars, archives and jobjar command line arguments or directly through the configuration (i.e. distributed cache api). The resources specified could be too large in multiple dimensions: # Total size # Number of files # Size of an individual resource (i.e. a large fat jar) We would like to encourage good behavior on the client side by having the option of enforcing resource limits along the above dimensions. There should be a separate effort to enforce limits at the YARN layer on the server side, but this jira is only covering the map reduce layer on the client side. In practice, having these client side limits will get us a long way towards preventing these localization anti-patterns. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org