Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 6A521200B82 for ; Fri, 2 Sep 2016 06:07:23 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 68D20160AB5; Fri, 2 Sep 2016 04:07:23 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id AE985160AB7 for ; Fri, 2 Sep 2016 06:07:22 +0200 (CEST) Received: (qmail 22903 invoked by uid 500); 2 Sep 2016 04:07:21 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 22862 invoked by uid 99); 2 Sep 2016 04:07:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Sep 2016 04:07:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 77D7C2C1B7B for ; Fri, 2 Sep 2016 04:07:21 +0000 (UTC) Date: Fri, 2 Sep 2016 04:07:21 +0000 (UTC) From: "Jian He (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (YARN-5576) Core change to localize resource while container is running MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 02 Sep 2016 04:07:23 -0000 [ https://issues.apache.org/jira/browse/YARN-5576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15457446#comment-15457446 ] Jian He edited comment on YARN-5576 at 9/2/16 4:06 AM: ------------------------------------------------------- bq. We need to override the ContainerManagerImpl::localize() method in the QueuingContainerManagerImpl. Re-localization should not be allowed if the container is currently queued (not yet running) It is not allowed, the method in ContainerManagerImpl only allows localization while running bq. I only see entries added to ResourceSet::resourcesFailedToBeLocalized set. Shouldnt we remove these once the AM is notified of the failure ? Also, Shouldn't these be notified back to the AM ? or we are just relying on the diagnostic string sent to the AM in the GetContainerStatus response to notify the AM ? The status part is not yet implemented as mentioned in the parent jira. It'll will be done once the requirement is clear. Earlier I was thinking these will be sent as part of container status. bq. wondering if we should have another RE_LOCALIZE_CONTAINER_RESOURCE event in the ResourceLocalizationService to distinguish from the localization needed for container initialization and correspondingly send different events to the Container. Or maybe for the timebeing, we should just rename INIT_CONTAINER_RESOURCE to LOCALIZE_CONTAINER_RESOURCE. I don't think adding new events type for doing the same thing is necessary at this point. This will also add additional complexity as you need to conditionally sends different types of events. The goal is to reuse existing code. I can rename it. bq. : spurious change in the imports of ContainerImpl and BaseAMRMProxyTest That's done by IDE auto fixing some unused imports, I edited it manually. was (Author: jianhe): bq. We need to override the ContainerManagerImpl::localize() method in the QueuingContainerManagerImpl. Re-localization should not be allowed if the container is currently queued (not yet running) It is not allowed, the method in ContainerManagerImpl only allows localization while running bq. We need to override the ContainerManagerImpl::localize() method in the QueuingContainerManagerImpl. Re-localization should not be allowed if the container is currently queued (not yet running) bq. I only see entries added to ResourceSet::resourcesFailedToBeLocalized set. Shouldnt we remove these once the AM is notified of the failure ? Also, Shouldn't these be notified back to the AM ? or we are just relying on the diagnostic string sent to the AM in the GetContainerStatus response to notify the AM ? The status part is not yet implemented as mentioned in the parent jira. It'll will be done once the requirement is clear. Earlier I was thinking these will be sent as part of container status. bq. wondering if we should have another RE_LOCALIZE_CONTAINER_RESOURCE event in the ResourceLocalizationService to distinguish from the localization needed for container initialization and correspondingly send different events to the Container. Or maybe for the timebeing, we should just rename INIT_CONTAINER_RESOURCE to LOCALIZE_CONTAINER_RESOURCE. I don't think adding new events type for doing the same thing is necessary at this point. This will also add additional complexity as you need to conditionally sends different types of events. The goal is to reuse existing code. I can rename it. bq. : spurious change in the imports of ContainerImpl and BaseAMRMProxyTest That's done by IDE auto fixing some unused imports, I edited it manually. > Core change to localize resource while container is running > ----------------------------------------------------------- > > Key: YARN-5576 > URL: https://issues.apache.org/jira/browse/YARN-5576 > Project: Hadoop YARN > Issue Type: Sub-task > Reporter: Jian He > Assignee: Jian He > Attachments: YARN-5576.1.patch, YARN-5576.2.patch, YARN-5576.3.patch > > -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org