hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wangda Tan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-9120) Need to have a way to turn off GPU auto-discovery in GpuDiscoverer
Date Fri, 14 Dec 2018 18:04:00 GMT

    [ https://issues.apache.org/jira/browse/YARN-9120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16721650#comment-16721650

Wangda Tan commented on YARN-9120:

[~snemeth] / [~tangzhankun], I prefer to make GPU plugin can be disabled/enabled as a whole.
To me adding the new option makes harder to troubleshooting. And I'm not sure if there's any
solid requirement to allow enable/disable GPU when the node is running, there're some logics
of NM may prevent this as well. Just my $0.02.

> Need to have a way to turn off GPU auto-discovery in GpuDiscoverer
> ------------------------------------------------------------------
>                 Key: YARN-9120
>                 URL: https://issues.apache.org/jira/browse/YARN-9120
>             Project: Hadoop YARN
>          Issue Type: Improvement
>            Reporter: Szilard Nemeth
>            Assignee: Szilard Nemeth
>            Priority: Major
> GpuDiscoverer.getGpusUsableByYarn either parses the user-defined GPU devices or should
have the value 'auto' (from property: yarn.nodemanager.resource-plugins.gpu.allowed-gpu-devices)
> In some circumstances, users would want to exclude a node from scheduling, so they should
have an option to turn off auto-discovery.
> It's straightforward that this is possible by removing the GPU resource-plugin from YARN's
config along with GPU-related config in container-executor.cfg, but doing that with a dedicated
value for yarn.nodemanager.resource-plugins.gpu.allowed-gpu-devices is a more lightweight

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org

View raw message