hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yufei Gu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-6042) Dump scheduler and queue state information into FairScheduler DEBUG log
Date Fri, 03 Mar 2017 18:08:45 GMT

    [ https://issues.apache.org/jira/browse/YARN-6042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15894782#comment-15894782

Yufei Gu commented on YARN-6042:

[~Tao Jie], YARN-5437 is an umbrella which adds useful message of scheduler in WebUI. YARN-4329
is the FS part, but we can always add more useful information in WebUI. Not a bad idea to
keep improvement on that. Other than that, we need add more queue metrics on scheduler WebUI,
which I cannot remember a JIRA for that. This JIRA will potentially dump a very long message,
add a link to WebUI for the new log file will be a reasonable solution. 

> Dump scheduler and queue state information into FairScheduler DEBUG log
> -----------------------------------------------------------------------
>                 Key: YARN-6042
>                 URL: https://issues.apache.org/jira/browse/YARN-6042
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: fairscheduler
>            Reporter: Yufei Gu
>            Assignee: Yufei Gu
>         Attachments: YARN-6042.001.patch, YARN-6042.002.patch, YARN-6042.003.patch, YARN-6042.004.patch,
YARN-6042.005.patch, YARN-6042.006.patch, YARN-6042.007.patch, YARN-6042.008.patch
> To improve the debugging of scheduler issues it would be a big improvement to be able
to dump the scheduler state into a log on request. 
> The Dump the scheduler state at a point in time would allow debugging of a scheduler
that is not hung (deadlocked) but also not assigning containers. Currently we do not have
a proper overview of what state the scheduler and the queues are in and we have to make assumptions
or guess
> The scheduler and queue state needed would include (not exhaustive):
> - instantaneous and steady fair share (app / queue)
> - AM share and resources
> - weight
> - app demand
> - application run state (runnable/non runnable)
> - last time at fair/min share

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message