mxnet-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] szha commented on issue #12541: [MXNET-936] [WIP] [DO NOT REVIEW] Support projection and clip in CuDNN LSTM
Date Sun, 23 Sep 2018 02:59:06 GMT
szha commented on issue #12541: [MXNET-936] [WIP] [DO NOT REVIEW] Support projection and clip
in CuDNN LSTM
URL: https://github.com/apache/incubator-mxnet/pull/12541#issuecomment-423788130
 
 
   When creating begin_states, the currently gluon RNN layer logic doesn't handle the dtype
correctly (reported by @Ishitori). example:
   ```
   fake_data = nd.random.uniform(shape=(batch_size, context_max_length, 8 * embedding_size),
dtype="float16")
   attention_output = nd.transpose(fake_data, axes=(1, 0, 2))
   modeling_layer = LSTM(hidden_size=100, num_layers=2, dropout=0.2, bidirectional=True)
   modeling_layer.cast("float16")
   modeling_layer.initialize()
   modeling_layer(attention_output)
   ```
   
   @haojin2 could you include the fix?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message