conftrace_

← Architectures

Deep Learning › Architectures ›

Transformers

9,294 papers

Papers per year

Papers

Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent EMNLP 2021

Sentence Bottleneck Autoencoders from Transformer Language Models EMNLP 2021

AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions EMNLP 2021

A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation EMNLP 2021

Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder EMNLP 2021

What’s Hidden in a One-layer Randomly Weighted Transformer? EMNLP 2021

A Simple and Effective Positional Encoding for Transformers EMNLP 2021

Explore Better Relative Position Embeddings from Encoding Perspective for Transformer Models EMNLP 2021

Recurrent Attention for Neural Machine Translation EMNLP 2021

Enlivening Redundant Heads in Multi-head Self-attention for Machine Translation EMNLP 2021

SHAPE: Shifted Absolute Position Embedding for Transformers EMNLP 2021

STANKER: Stacking Network based on Level-grained Attention-masked BERT for Rumor Detection on Social Media EMNLP 2021

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation EMNLP 2021

Context-Aware Interaction Network for Question Matching EMNLP 2021

Considering Nested Tree Structure in Sentence Extractive Summarization with Pre-trained Transformer EMNLP 2021

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation EMNLP 2021

Incorporating Residual and Normalization Layers into Analysis of Masked Language Models EMNLP 2021

Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement EMNLP 2021

Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy EMNLP 2021

Sentence-Permuted Paragraph Generation EMNLP 2021

Document-level Entity-based Extraction as Template Generation EMNLP 2021

Transformer Feed-Forward Layers Are Key-Value Memories EMNLP 2021

The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders EMNLP 2021

Gradient-based Adversarial Attacks against Text Transformers EMNLP 2021

Do Transformer Modifications Transfer Across Implementations and Applications? EMNLP 2021