conftrace_

← Architectures

Deep Learning › Architectures ›

Transformers

9,294 papers

Papers per year

Papers

Two Issues with Chinese Spelling Correction and A Refinement Solution ACL 2024

Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval ACL 2024

Greed is All You Need: An Evaluation of Tokenizer Inference Methods ACL 2024

LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models ACL 2024

LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models ACL 2024

MoExtend: Tuning New Experts for Modality and Task Extension ACL 2024

Computational Expressivity of Neural Language Models ACL 2024

Resonance RoPE: Improving Context Length Generalization of Large Language Models ACL 2024

Controllable Text Generation with Residual Memory Transformer ACL 2024

Neurons in Large Language Models: Dead, N-gram, Positional ACL 2024

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models ACL 2024

Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector ACL 2024

VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks ACL 2024

Selective Prefix Tuning for Pre-trained Language Models ACL 2024

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives ACL 2024

Generative Input: Towards Next-Generation Input Methods Paradigm ACL 2024

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task ACL 2024

On the Language Encoder of Contrastive Cross-modal Models ACL 2024

Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision ACL 2024

Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument Extraction ACL 2024

Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge ACL 2024

ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer Model ACL 2024

ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and Expressivity ACL 2024

PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning ACL 2024

Identifying Semantic Induction Heads to Understand In-Context Learning ACL 2024