mxnet-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From barry-jin <notificati...@github.com>
Subject [apache/incubator-mxnet] [RFC] Turn Off CuDNN When Training PSPNet (#19056)
Date Tue, 01 Sep 2020 00:03:52 GMT
## Error when training PSPNet on Cityscapes dataset using GluonCV #17439

### Problem Description
The problem is when I train a PSPNet using GluonCV semantic segmentation library on the Cityscapes
dataset, the training will stuck (hang) right after it started. 

### Debugging
After bisect the date of failure, I find the first bad commit is [PR 13896](https://github.com/apache/incubator-mxnet/pull/13896),
which introduced this problem. 

## Proposed solutions
Turn off CuDNN by setting `cudnn_off` to `True` in [Dropout](https://github.com/apache/incubator-mxnet/blob/9b22c8c2e935cd42ff0f7d339a4b790f5b3367b6/python/mxnet/gluon/nn/basic_layers.py#L271)

## References
- list reference and related literature 
[Issue #17439](https://github.com/apache/incubator-mxnet/issues/17439), [PR #13896](https://github.com/apache/incubator-mxnet/pull/13896)
- list known implementations


-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/apache/incubator-mxnet/issues/19056
Mime
  • Unnamed multipart/alternative (inline, 7-Bit, 0 bytes)
View raw message