hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sunil G (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-6872) Ensure apps could run given NodeLabels are disabled post RM switchover/restart
Date Mon, 31 Jul 2017 18:14:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16107715#comment-16107715

Sunil G commented on YARN-6872:

NM work preserving was off. Now I can see that resources are coming correctly.

However I am seeing an issue with Cluster Metrics. Its coming -ve or wrong after RM restart.
Even without node label disabled scenario, metrics are wrong. I think it should be handled
in another ticket as metrics calculation is wrong after running app recovery and RM work preserving
restart (when labels are used).

Please suggest whether we need to include metrics issue also here.
cc/[~leftnoteasy] and [~jianhe]

> Ensure apps could run given NodeLabels are disabled post RM switchover/restart
> ------------------------------------------------------------------------------
>                 Key: YARN-6872
>                 URL: https://issues.apache.org/jira/browse/YARN-6872
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>            Reporter: Sunil G
>            Assignee: Sunil G
>         Attachments: YARN-6872.001.patch
> Post YARN-6031, few apps could be failed during recovery provided they had some label
requirements for AM and labels were disable post RM restart/switchover. As discussed in YARN-6031,
its better to run such apps as it may be long running apps as well.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message