conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Architectures
Deep Learning
›
Architectures
›
Transformers
9,294 papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Two Issues with Chinese Spelling Correction and A Refinement Solution
ACL 2024
Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval
ACL 2024
Greed is All You Need: An Evaluation of Tokenizer Inference Methods
ACL 2024
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models
ACL 2024
LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models
ACL 2024
MoExtend: Tuning New Experts for Modality and Task Extension
ACL 2024
Computational Expressivity of Neural Language Models
ACL 2024
Resonance RoPE: Improving Context Length Generalization of Large Language Models
ACL 2024
Controllable Text Generation with Residual Memory Transformer
ACL 2024
Neurons in Large Language Models: Dead, N-gram, Positional
ACL 2024
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
ACL 2024
Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector
ACL 2024
VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks
ACL 2024
Selective Prefix Tuning for Pre-trained Language Models
ACL 2024
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
ACL 2024
Generative Input: Towards Next-Generation Input Methods Paradigm
ACL 2024
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task
ACL 2024
On the Language Encoder of Contrastive Cross-modal Models
ACL 2024
Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision
ACL 2024
Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument Extraction
ACL 2024
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
ACL 2024
ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer Model
ACL 2024
ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and Expressivity
ACL 2024
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning
ACL 2024
Identifying Semantic Induction Heads to Understand In-Context Learning
ACL 2024
<
1
…
105
106
107
…
372
>