hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ray Chiang (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-4911) Bad placement policy in FairScheduler causes the RM to crash
Date Tue, 05 Apr 2016 21:17:25 GMT

     [ https://issues.apache.org/jira/browse/YARN-4911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ray Chiang updated YARN-4911:
-----------------------------
    Attachment: YARN-4911.001.patch

- Rather than remove the exception, catch it at a higher level.  That allows for a clearer
error message.  This results in the message:

java.io.IOException: Failed to run job : Unable to match app application_1459890777944_0001
to a queue placement policy.  Check with an administrator to make sure that you are submitting
to a valid queue and/or check that the queue placement policies have the create property set
to true.


> Bad placement policy in FairScheduler causes the RM to crash
> ------------------------------------------------------------
>
>                 Key: YARN-4911
>                 URL: https://issues.apache.org/jira/browse/YARN-4911
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: fairscheduler
>            Reporter: Ray Chiang
>            Assignee: Ray Chiang
>              Labels: supportability
>         Attachments: YARN-4911.001.patch
>
>
> When you have a fair-scheduler.xml with the rule:
>   <queuePlacementPolicy>
>     <rule name="default" queue="okay1" create="false" />
>   </queuePlacementPolicy>
> and the queue okay1 doesn't exist, the following exception occurs in the RM:
> 2016-04-01 16:56:33,383 FATAL org.apache.hadoop.yarn.server.resourcemanager.ResourceManager:
Error in handling event type APP_ADDED to the scheduler
> java.lang.IllegalStateException: Should have applied a rule before reaching here
>         at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.QueuePlacementPolicy.assignAppToQueue(QueuePlacementPolicy.java:173)
>         at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.assignToQueue(FairScheduler.java:728)
>         at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.addApplication(FairScheduler.java:634)
>         at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.handle(FairScheduler.java:1224)
>         at org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.handle(FairScheduler.java:112)
>         at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:691)
>         at java.lang.Thread.run(Thread.java:745)
> which causes the RM to crash.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message