conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Architectures
Deep Learning
›
Architectures
›
Transformers
9,294 papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Multiple Instance Verification
JMLR 2025
Neural Operators Can Play Dynamic Stackelberg Games
JMLR 2025
Fine-grained Fallacy Detection with Human Label Variation
NAACL 2025
MoCE: Adaptive Mixture of Contextualization Experts for Byte-based Neural Machine Translation
NAACL 2025
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
NAACL 2025
From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks
NAACL 2025
ReGLA: Refining Gated Linear Attention
NAACL 2025
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison
NAACL 2025
Representing Rule-based Chatbots with Transformers
NAACL 2025
Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models
NAACL 2025
Label Drop for Multi-Aspect Relation Modeling in Universal Information Extraction
NAACL 2025
Token-based Decision Criteria Are Suboptimal in In-context Learning
NAACL 2025
Getting More Juice Out of Your Data: Hard Pair Refinement Enhances Visual-Language Models Without Extra Data
NAACL 2025
Analyzing the Inner Workings of Transformers in Compositional Generalization
NAACL 2025
Emergence of Episodic Memory in Transformers: Characterizing Changes in Temporal Structure of Attention Scores During Training
NAACL 2025
Prototypical Extreme Multi-label Classification with a Dynamic Margin Loss
NAACL 2025
Fine-Grained Transfer Learning for Harmful Content Detection through Label-Specific Soft Prompt Tuning
NAACL 2025
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech
NAACL 2025
ProSE: Diffusion Priors for Speech Enhancement
NAACL 2025
Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models
NAACL 2025
Cross-Lingual Transfer Learning for Speech Translation
NAACL 2025
Scaling Graph-Based Dependency Parsing with Arc Vectorization and Attention-Based Refinement
NAACL 2025
RTSM: Knowledge Distillation with Diverse Signals for Efficient Real-Time Semantic Matching in E-Commerce
NAACL 2025
CodeGenWrangler: Data Wrangling task automation using Code-Generating Models
NAACL 2025
Detecting Sexism in Tweets: A Sentiment Analysis and Graph Neural Network Approach
NAACL 2025
<
1
…
69
70
71
…
372
>