transformer architecture
1555 papers
Also known as
TTE
STTR
TGT
DIT
ENT
BERT
DETR
ROBERTA
TA
Co-occurring keywords
Papers
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models
INTERSPEECH 2021
ViViT: A Video Vision Transformer
ICCV 2021
Noise Robust Acoustic Modeling for Single-Channel Speech Recognition Based on a Stream-Wise Transformer Architecture
INTERSPEECH 2021
I-BERT: Integer-only BERT Quantization
ICML 2021
Transformer-Style Relational Reasoning with Dynamic Memory Updating for Temporal Network Modeling
AAAI 2021
AMR Parsing with Action-Pointer Transformer
NAACL 2021