Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A3388200BB9 for ; Mon, 7 Nov 2016 17:35:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id A1ED5160B10; Mon, 7 Nov 2016 16:35:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E94B5160AEC for ; Mon, 7 Nov 2016 17:34:59 +0100 (CET) Received: (qmail 20295 invoked by uid 500); 7 Nov 2016 16:34:59 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 20254 invoked by uid 99); 7 Nov 2016 16:34:59 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Nov 2016 16:34:59 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id BC1BA2C2ABA for ; Mon, 7 Nov 2016 16:34:58 +0000 (UTC) Date: Mon, 7 Nov 2016 16:34:58 +0000 (UTC) From: "Varun Saxena (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-574) PrivateLocalizer does not support parallel resource download via ContainerLocalizer MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 07 Nov 2016 16:35:00 -0000 [ https://issues.apache.org/jira/browse/YARN-574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15644642#comment-15644642 ] Varun Saxena commented on YARN-574: ----------------------------------- [~ajithshetty], thanks for the patch. Parallel downloads should speed up container localization phase. Coming to the patch, should the configuration value be 4 ? Or should we keep it as 1 i.e. current behavior. And then it can be tuned based on size and visibility of resources to be downloaded, if somebody wants to use it. Localizers are not treated as containers which means resources used by them are not accounted for, so it should not be that they all together end up eating up quite a bit of resources on the node with default value of 4. Thoughts ? > PrivateLocalizer does not support parallel resource download via ContainerLocalizer > ----------------------------------------------------------------------------------- > > Key: YARN-574 > URL: https://issues.apache.org/jira/browse/YARN-574 > Project: Hadoop YARN > Issue Type: Sub-task > Affects Versions: 2.6.0, 2.8.0, 2.7.1 > Reporter: Omkar Vinit Joshi > Assignee: Ajith S > Attachments: YARN-574.03.patch, YARN-574.1.patch, YARN-574.2.patch > > > At present private resources will be downloaded in parallel only if multiple containers request the same resource. However otherwise it will be serial. The protocol between PrivateLocalizer and ContainerLocalizer supports multiple downloads however it is not used and only one resource is sent for downloading at a time. > I think we can increase / assure parallelism (even for single container requesting resource) for private/application resources by making multiple downloads per ContainerLocalizer. > Total Parallelism before > = number of threads allotted for PublicLocalizer [public resource] + number of containers[private and application resource] > Total Parallelism after > = number of threads allotted for PublicLocalizer [public resource] + number of containers * max downloads per container [private and application resource] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org