conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Architectures
Deep Learning
›
Architectures
›
Transformers
9,294 papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
EMNLP 2021
Sentence Bottleneck Autoencoders from Transformer Language Models
EMNLP 2021
AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions
EMNLP 2021
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation
EMNLP 2021
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder
EMNLP 2021
What’s Hidden in a One-layer Randomly Weighted Transformer?
EMNLP 2021
A Simple and Effective Positional Encoding for Transformers
EMNLP 2021
Explore Better Relative Position Embeddings from Encoding Perspective for Transformer Models
EMNLP 2021
Recurrent Attention for Neural Machine Translation
EMNLP 2021
Enlivening Redundant Heads in Multi-head Self-attention for Machine Translation
EMNLP 2021
SHAPE: Shifted Absolute Position Embedding for Transformers
EMNLP 2021
STANKER: Stacking Network based on Level-grained Attention-masked BERT for Rumor Detection on Social Media
EMNLP 2021
FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
EMNLP 2021
Context-Aware Interaction Network for Question Matching
EMNLP 2021
Considering Nested Tree Structure in Sentence Extractive Summarization with Pre-trained Transformer
EMNLP 2021
Structural Adapters in Pretrained Language Models for AMR-to-Text Generation
EMNLP 2021
Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
EMNLP 2021
Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement
EMNLP 2021
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
EMNLP 2021
Sentence-Permuted Paragraph Generation
EMNLP 2021
Document-level Entity-based Extraction as Template Generation
EMNLP 2021
Transformer Feed-Forward Layers Are Key-Value Memories
EMNLP 2021
The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders
EMNLP 2021
Gradient-based Adversarial Attacks against Text Transformers
EMNLP 2021
Do Transformer Modifications Transfer Across Implementations and Applications?
EMNLP 2021
<
1
…
285
286
287
…
372
>