Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B413E200B50 for ; Fri, 29 Jul 2016 14:25:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B2E92160A79; Fri, 29 Jul 2016 12:25:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E6866160AA7 for ; Fri, 29 Jul 2016 14:25:21 +0200 (CEST) Received: (qmail 44103 invoked by uid 500); 29 Jul 2016 12:25:21 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 43748 invoked by uid 99); 29 Jul 2016 12:25:21 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jul 2016 12:25:21 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 98C9B2C0D69 for ; Fri, 29 Jul 2016 12:25:20 +0000 (UTC) Date: Fri, 29 Jul 2016 12:25:20 +0000 (UTC) From: "Rohith Sharma K S (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 29 Jul 2016 12:25:22 -0000 [ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15399237#comment-15399237 ] Rohith Sharma K S commented on YARN-5333: ----------------------------------------- I just tried modifying the code, the below error I was talking that RMWebApp start fails. {noformat} com.google.inject.CreationException: Unable to create injector, see the following errors: 1) Binding to null instances is not allowed. Use toProvider(Providers.of(null)) if this is your intended behaviour. at org.apache.hadoop.yarn.webapp.WebApps$Builder$2.configure(WebApps.java:335) 1 error at com.google.inject.internal.Errors.throwCreationExceptionIfErrorsExist(Errors.java:466) at com.google.inject.internal.InternalInjectorCreator.initializeStatically(InternalInjectorCreator.java:155) at com.google.inject.internal.InternalInjectorCreator.build(InternalInjectorCreator.java:107) at com.google.inject.Guice.createInjector(Guice.java:96) at com.google.inject.Guice.createInjector(Guice.java:73) at com.google.inject.Guice.createInjector(Guice.java:62) at org.apache.hadoop.yarn.webapp.WebApps$Builder.build(WebApps.java:331) at org.apache.hadoop.yarn.webapp.WebApps$Builder.start(WebApps.java:372) at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startWepApp(ResourceManager.java:1025) at org.apache.hadoop.yarn.server.resourcemanager.MockRM.startWepApp(MockRM.java:909) at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:1127) at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193) at org.apache.hadoop.yarn.server.resourcemanager.TestRMHA.testRMDispatcherForHA(TestRMHA.java:333) {noformat} Apart from above, other point is RMWebService is started in StandBy RM where in REST calls can be made. Since if we do not initialize active services, then we could expect NPE from RMWebService. There are many more things to take care if we go for initializing active services during transitionToActive. > Some recovered apps are put into default queue when RM HA > --------------------------------------------------------- > > Key: YARN-5333 > URL: https://issues.apache.org/jira/browse/YARN-5333 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Jun Gong > Assignee: Jun Gong > Attachments: YARN-5333.01.patch, YARN-5333.02.patch, YARN-5333.03.patch > > > Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to false, {{yarn.scheduler.fair.user-as-default-queue}} is set to false. > Reproduce steps: > 1. Start two RMs. > 2. After RMs are running, change both RM's file {{etc/hadoop/fair-scheduler.xml}}, then add some queues. > 3. Submit some apps to the new added queues. > 4. Stop the active RM, then the standby RM will transit to active and recover apps. > However the new active RM will put recovered apps into default queue because it might have not loaded the new {{fair-scheduler.xml}}. We need call {{initScheduler}} before start active services or bring {{refreshAll()}} in front of {{rm.transitionToActive()}}. *It seems it is also important for other scheduler*. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org