tvm-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [incubator-tvm] JonathanMace opened a new issue #5420: Relay CUDA strategy requires local GPU even for remote training
Date Thu, 23 Apr 2020 10:42:56 GMT

JonathanMace opened a new issue #5420:
URL: https://github.com/apache/incubator-tvm/issues/5420


   I'm training a resnet18_v2 model for CUDA using the RPC runner.  I run an RPC server on
a remote machine that has a GPU, and I run the training program locally on a machine that
has no GPUs.
   
   `extract_from_program` fails because internally a GPU is required.  Specifically, in `/python/tvm/relay/op/strategy/cuda.py`,
the code checks for local GPU properties in three places:
   ```
   if nvcc.have_tensorcore(tvm.gpu(0).compute_version):
   ```
   This check fails when there is no local GPU.  It's also logically inconsistent for remote
GPUs even if the local GPU exists.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



Mime
View raw message