transformer architecture
1555 papers
Also known as
TTE
STTR
TGT
DIT
ENT
BERT
DETR
ROBERTA
TA
Co-occurring keywords
Papers
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
NAACL 2025
Binary Event-Driven Spiking Transformer
IJCAI 2025
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
WACV 2025
Deconstructing Attention: Investigating Design Principles for Effective Language Modeling
IJCNLP 2025
DenseSSM: State Space Models with Dense Hidden Connection for Efficient Large Language Models
NAACL 2025
ReGLA: Refining Gated Linear Attention
NAACL 2025