transformer architecture
1555 papers
Also known as
TTE
STTR
TGT
DIT
ENT
BERT
DETR
ROBERTA
TA
Co-occurring keywords
Papers
How Much Attention Do You Need? A Granular Analysis of Neural Machine Translation Architectures
ACL 2018
Attention is All you Need
NIPS 2017