Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Architectures
Deep Learning
›
Architectures
›
Transformers
9294 directly classified papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Glancing Transformer for Non-Autoregressive Neural Machine Translation
ACL 2021
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding
ACL 2021
Lightweight Cross-Lingual Sentence Representation Learning
ACL 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021
G-Transformer for Document-Level Machine Translation
ACL 2021
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models
ACL 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
ACL 2021
Syntax-Enhanced Pre-trained Model
ACL 2021
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling
ACL 2021
Discriminative Reranking for Neural Machine Translation
ACL 2021
The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models
ACL 2021
A Semantics-aware Transformer Model of Relation Linking for Knowledge Base Question Answering
ACL 2021
Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation
ACL 2021
Exploring Listwise Evidence Reasoning with T5 for Fact Verification
ACL 2021
Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling
ACL 2021
Transformer-Based Direct Hidden Markov Model for Machine Translation
ACL 2021
Synchronous Syntactic Attention for Transformer Neural Machine Translation
ACL 2021
DAAI at CASE 2021 Task 1: Transformer-based Multilingual Socio-political and Crisis Event Detection
ACL 2021
Ensemble ALBERT and RoBERTa for Span Prediction in Question Answering
ACL 2021
BERT Goes Shopping: Comparing Distributional Models for Product Representations
ACL 2021
Multilingual Dependency Parsing for Low-Resource African Languages: Case Studies on Bambara, Wolof, and Yoruba
ACL 2021
Applying Occam’s Razor to Transformer-Based Dependency Parsing: What Works, What Doesn’t, and What is Really Necessary
ACL 2021
XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders
COLING 2020
PolyGen: An Autoregressive Generative Model of 3D Meshes
ICML 2020
Emergence of Separable Manifolds in Deep Language Representations
ICML 2020
<
1
…
314
315
316
…
372
>