attention mechanism
3975 papers
Also known as
AGM
GPA
AM
MHSA
QKV
DAM
ATTENTION
MHA
IMA
Co-occurring keywords
Papers
Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks
NIPS 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
CVPR 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
INTERSPEECH 2023
Block-State Transformers
NIPS 2023
CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning
CVPR 2023
Abed at KSAA-RD Shared Task: Enhancing Arabic Word Embedding with Modified BERT Multilingual
EMNLP 2023
Neural Functional Transformers
NIPS 2023