hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jun Gong (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA
Date Wed, 03 Aug 2016 07:31:20 GMT

    [ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405503#comment-15405503
] 

Jun Gong commented on YARN-5333:
--------------------------------

Hi [~jianhe], I think the [comment|https://issues.apache.org/jira/browse/YARN-3893?focusedCommentId=14708471&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14708471]
in YARN-3893 makes sense. How about it?
{quote}
If refreshAll() fails, BOTH RM will be in ACTIVE state as per this defect. Continuing RM services
with invalid configuration does not good idea. Moreover invalid configurations should be notified
to user immediately. So it would be better to make use of fail-fast configuration to exit
the RM JVM. If this configuration is set to false , then call rm.handleTransitionToStandBy.
{quote}

> Some recovered apps are put into default queue when RM HA
> ---------------------------------------------------------
>
>                 Key: YARN-5333
>                 URL: https://issues.apache.org/jira/browse/YARN-5333
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Jun Gong
>            Assignee: Jun Gong
>         Attachments: YARN-5333.01.patch, YARN-5333.02.patch, YARN-5333.03.patch, YARN-5333.04.patch,
YARN-5333.05.patch, YARN-5333.06.patch
>
>
> Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is
set to false, {{yarn.scheduler.fair.user-as-default-queue}} is set to false.
> Reproduce steps:
> 1. Start two RMs.
> 2. After RMs are running, change both RM's file {{etc/hadoop/fair-scheduler.xml}}, then
add some queues.
> 3. Submit some apps to the new added queues.
> 4. Stop the active RM, then the standby RM will transit to active and recover apps.
> However the new active RM will put recovered apps into default queue because it might
have not loaded the new {{fair-scheduler.xml}}. We need call {{initScheduler}} before start
active services or bring {{refreshAll()}} in front of {{rm.transitionToActive()}}. *It seems
it is also important for other scheduler*.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org


Mime
View raw message