singa-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "zhangzhaoqi (Jira)" <j...@apache.org>
Subject [jira] [Created] (SINGA-506) add autograd operators for NLP models
Date Fri, 21 Feb 2020 04:28:00 GMT
zhangzhaoqi created SINGA-506:
---------------------------------

             Summary: add autograd operators for NLP models
                 Key: SINGA-506
                 URL: https://issues.apache.org/jira/browse/SINGA-506
             Project: Singa
          Issue Type: New Feature
            Reporter: zhangzhaoqi


*We are going to support these three NLP models, called, Bidirectional Attention Flow, BERT-Squad
and GPT-2.*

*Totally, there are still 21 operators that we need to add as following,*
ArgMax
Cast
CategoryMapper
Ceil
Compress
ConstantOfShape
Dropout
Gather
Hardmax
Identity
NonZero
ReduceMax
ReduceMean
ReduceSum
Scan
Shape
Slice
Split
Squeeze
Tile
Transpose

 

*For details, these 21 operators belong to these three models separately:*

*Bidirectional Attention Flow:*
ArgMax
Cast
CategoryMapper
Ceil
Compress
ConstantOfShape
Dropout
Gather
Hardmax
ReduceMax
ReduceSum
Scan
Shape
Slice
Squeeze
Transpose


*BERT-Squad:*
Slice
Squeeze
Shape
Identity
Gather
ReduceMean
Cast
Tile
Transpose
Split

*GPT-2:*
ConstantOfShape
Slice
Shape
Gather
ReduceMean
NonZero
Cast
Transpose
Split

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message