Luke Zettlemoyer
235 papers · 2007–2025 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (28) π Interdisciplinary Bridge π Conference Polyglot (15)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(14)
π
Conference Loyalist
(70)
π§¬
Topic Evolution
π€
Dynamic Duo
(38)
π
Grand Slam
π₯
Mega-Team
(60)
π
Triple Crown
π±
Topic Pioneer
π¬
Deep Specialist
(20)
π
Keyword Champion
ποΈ
Keyword Collector
(70)
π
Trend Setter
π
Century Club
(235)
π₯
Unstoppable
(19)
β
The Questioner
(4)
π
Conference Pioneer
β‘
Prolific Year
(34)
Conferences
EMNLP (70)
ACL (63)
ICLR (27)
IJCNLP (18)
NAACL (17)
NIPS (17)
ICML (7)
CVPR (4)
CORL (3)
EACL (3)
CONLL (2)
AAAI (1)
COLING (1)
ICCV (1)
IJCAI (1)
Top co-authors
Keywords
language model
(21)
large language model
(21)
zero-shot learning
(18)
question answering
(15)
representation learning
(13)
few-shot learning
(12)
semantic parsing
(12)
in-context learning
(11)
transfer learning
(11)
neural network
(10)
multilingual language model
(9)
cross-lingual transfer
(9)
code generation
(8)
language modeling
(8)
coreference resolution
(8)
text generation
(7)
distant supervision
(7)
machine translation
(7)
masked language model
(7)
semantic role labeling
(7)
Papers
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
CORL 2025
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
NAACL 2025
Memory Layers at Scale
ICML 2025
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
ICLR 2025
(Mis)Fitting Scaling Laws: A Survey of Scaling Law Fitting Techniques in Deep Learning
ICLR 2025
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
ICLR 2025
Fantastic Copyrighted Beasts and How (Not) to Generate Them
ICLR 2025
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
ICLR 2025
Byte Latent Transformer: Patches Scale Better Than Tokens
ACL 2025
Improving Factuality with Explicit Working Memory
ACL 2025
Latent Action Pretraining from Videos
ICLR 2025
MULTIGUARD: An Efficient Approach for AI Safety Moderation Across Languages and Modalities
EMNLP 2025
s1: Simple test-time scaling
EMNLP 2025
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
NIPS 2024
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
ACL 2024
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
EMNLP 2024
Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models
EACL 2024
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
EMNLP 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
ACL 2024
OLMo: Accelerating the Science of Language Models
ACL 2024
In-Context Pretraining: Language Modeling Beyond Document Boundaries
ICLR 2024
MoDE: CLIP Data Experts via Clustering
CVPR 2024
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
NIPS 2024
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
NIPS 2024
Evaluating Copyright Takedown Methods for Language Models
NIPS 2024
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
Better Alignment with Instruction Back-and-Forth Translation
EMNLP 2024
Detecting Pretraining Data from Large Language Models
ICLR 2024
Altogether: Image Captioning via Re-aligning Alt-text
EMNLP 2024
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
ACL 2024
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding
NAACL 2024
REPLUG: Retrieval-Augmented Black-Box Language Models
NAACL 2024
Demystifying CLIP Data
ICLR 2024
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
ICLR 2024
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
ICLR 2024
Representation Deficiency in Masked Language Modeling
ICLR 2024
Self-Alignment with Instruction Backtranslation
ICLR 2024
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
ACL 2023
CREPE: Open-Domain Question Answering with False Presuppositions
ACL 2023
Contrastive Decoding: Open-ended Text Generation as Optimization
ACL 2023
Training Trajectories of Language Models Across Scales
ACL 2023
One Embedder, Any Task: Instruction-Finetuned Text Embeddings
ACL 2023
Nonparametric Masked Language Modeling
ACL 2023
In-context Examples Selection for Machine Translation
ACL 2023
Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
AAAI 2023
CiT: Curation in Training for Effective Vision-Language Data
ICCV 2023
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages
EMNLP 2023
Toward Human Readable Prompt Tuning: Kubrickβs The Shining is a good movie, and a good prompt too?
EMNLP 2023
Demystifying Prompts in Language Models via Perplexity Estimation
EMNLP 2023
Getting MoRE out of Mixture of Language Model Reasoning Experts
EMNLP 2023
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
EMNLP 2023
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
EMNLP 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
EMNLP 2023
Revisiting Machine Translation for Cross-lingual Classification
EMNLP 2023
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
NIPS 2023
AGRO: Adversarial discovery of error-prone Groups for Robust Optimization
ICLR 2023
Mega: Moving Average Equipped Gated Attention
ICLR 2023
Selective Annotation Makes Language Models Better Few-Shot Learners
ICLR 2023
QLoRA: Efficient Finetuning of Quantized LLMs
NIPS 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
NIPS 2023
LIMA: Less Is More for Alignment
NIPS 2023
Stable and low-precision training for large-scale vision-language models
NIPS 2023
Scaling Laws for Generative Mixed-Modal Language Models
ICML 2023
The case for 4-bit precision: k-bit Inference Scaling Laws
ICML 2023
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation
ICML 2023
Retrieval-Augmented Multimodal Language Modeling
ICML 2023
Binding Language Models in Symbolic Languages
ICLR 2023
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
ICLR 2023
InCoder: A Generative Model for Code Infilling and Synthesis
ICLR 2023
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations
ACL 2023
Prompting Language Models for Linguistic Structure
ACL 2023
Few-shot Learning with Multilingual Generative Language Models
EMNLP 2022
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
ICLR 2022
8-bit Optimizers via Block-wise Quantization
ICLR 2022
BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation
NAACL 2022
DEMix Layers: Disentangling Domains for Modular Language Modeling
NAACL 2022
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
NAACL 2022
MetaICL: Learning to Learn In Context
NAACL 2022
Improving Policy Learning via Language Dynamics Distillation
NIPS 2022
GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale
NIPS 2022
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
NIPS 2022
Prompt-free and Efficient Few-shot Learning with Language Models
ACL 2022
FaVIQ: FAct Verification from Information-seeking Questions
ACL 2022
Noisy Channel Language Model Prompting for Few-Shot Text Classification
ACL 2022
Question Answering Infused Pre-training of General-Purpose Contextualized Representations
ACL 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
EMNLP 2022
M2D2: A Massively Multi-Domain Language Modeling Dataset
EMNLP 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
EMNLP 2022
Nearest Neighbor Zero-Shot Inference
EMNLP 2022
Natural Language to Code Translation with Execution
EMNLP 2022
Language Contamination Helps Explains the Cross-lingual Capabilities of English Pretrained Models
EMNLP 2022
Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
EMNLP 2022
Improving Passage Retrieval with Zero-Shot Question Generation
EMNLP 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
EMNLP 2022
Efficient Large Scale Language Modeling with Mixtures of Experts
EMNLP 2022
CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation
EMNLP 2022
On the Role of Bidirectionality in Language Model Pre-Training
EMNLP 2022
Better Fine-Tuning by Reducing Representational Collapse
ICLR 2021
Language Grounding with 3D Objects
CORL 2021
Inducing Semantic Roles Without Syntax
IJCNLP 2021
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
IJCNLP 2021
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
IJCNLP 2021
Detecting Hallucinated Content in Conditional Neural Sequence Generation
ACL 2021
DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions
ACL 2021
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
ACL 2021
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
ACL 2021
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment
ACL 2021
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
ACL 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
EMNLP 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
EMNLP 2021
Surface Form Competition: Why the Highest Probability Answer Isnβt Always Right
EMNLP 2021
Detecting Hallucinated Content in Conditional Neural Sequence Generation
IJCNLP 2021
Inducing Semantic Roles Without Syntax
ACL 2021
FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary
EACL 2021
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
IJCNLP 2021
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment
IJCNLP 2021
DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions
IJCNLP 2021
SILG: The Multi-domain Symbolic Interactive Language Grounding Benchmark
NIPS 2021
Luna: Linear Unified Nested Attention
NIPS 2021
BASE Layers: Simplifying Training of Large, Sparse Models
ICML 2021
Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing
ICLR 2021
DeLighT: Deep and Light-weight Transformer
ICLR 2021
Nearest Neighbor Machine Translation
ICLR 2021
Scalable Zero-shot Entity Linking with Dense Entity Retrieval
EMNLP 2020
Aligned Cross Entropy for Non-Autoregressive Machine Translation
ICML 2020
Unsupervised Cross-lingual Representation Learning at Scale
ACL 2020
Active Learning for Coreference Resolution using Discrete Annotation
ACL 2020
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
ACL 2020
Controlled Crowdsourcing for High-Quality QA-SRL Annotation
ACL 2020
Emerging Cross-lingual Structure in Pretrained Language Models
ACL 2020
Simple and Effective Retrieve-Edit-Rerank Text Generation
ACL 2020
Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders
ACL 2020
Pre-training via Paraphrasing
NIPS 2020
An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction
EMNLP 2020
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
EMNLP 2020
AmbigQA: Answering Ambiguous Open-domain Questions
EMNLP 2020
Grounded Adaptation for Zero-shot Executable Semantic Parsing
EMNLP 2020
Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles
EMNLP 2020
Generalization through Memorization: Nearest Neighbor Language Models
ICLR 2020
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
CVPR 2020
QANom: Question-Answer driven SRL for Nominalizations
COLING 2020
Iterative Search for Weakly Supervised Semantic Parsing
NAACL 2019
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
NAACL 2019
Learning Programmatic Idioms for Scalable Semantic Parsing
EMNLP 2019
Vision-and-Dialog Navigation
CORL 2019
A Discrete Hard EM Approach for Weakly Supervised Question Answering
IJCNLP 2019
Cloze-driven Pretraining of Self-attention Networks
EMNLP 2019
Donβt Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases
EMNLP 2019
A Discrete Hard EM Approach for Weakly Supervised Question Answering
EMNLP 2019
Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog
EMNLP 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
IJCNLP 2019
Better Character Language Modeling through Morphology
ACL 2019
Evaluating Gender Bias in Machine Translation
ACL 2019
E3: Entailment-driven Extracting and Editing for Conversational Machine Reading
ACL 2019
BERT for Coreference Resolution: Baselines and Analysis
IJCNLP 2019
JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation
IJCNLP 2019
Learning Programmatic Idioms for Scalable Semantic Parsing
IJCNLP 2019
Cloze-driven Pretraining of Self-attention Networks
IJCNLP 2019
Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog
IJCNLP 2019
Multi-hop Reading Comprehension through Question Decomposition and Rescoring
ACL 2019
Donβt Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases
IJCNLP 2019
Compositional Questions Do Not Necessitate Multi-hop Reasoning
ACL 2019
The Referential Reader: A Recurrent Entity Network for Anaphora Resolution
ACL 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
EMNLP 2019
BERT for Coreference Resolution: Baselines and Analysis
EMNLP 2019
JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation
EMNLP 2019
Mapping Language to Code in Programmatic Context
EMNLP 2018
Ultra-Fine Entity Typing
ACL 2018
Large-Scale QA-SRL Parsing
ACL 2018
Deep RNNs Encode Soft Hierarchical Syntax
ACL 2018
Jointly Predicting Predicates and Arguments in Neural Semantic Role Labeling
ACL 2018
Long Short-Term Memory as a Dynamically Computed Element-wise Weighted Sum
ACL 2018
Neural Semantic Parsing
ACL 2018
AllenNLP: A Deep Semantic Natural Language Processing Platform
ACL 2018
SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach
EMNLP 2018
Neural Metaphor Detection in Context
EMNLP 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
EMNLP 2018
QuAC: Question Answering in Context
EMNLP 2018
Syntactic Scaffolds for Semantic Structures
EMNLP 2018
Deep contextualized word representations
ICLR 2018
Supervised Open Information Extraction
NAACL 2018
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
NAACL 2018
Deep Contextualized Word Representations
NAACL 2018
Crowdsourcing Question-Answer Meaning Representations
NAACL 2018
Higher-Order Coreference Resolution with Coarse-to-Fine Inference
NAACL 2018
Zero-Shot Relation Extraction via Reading Comprehension
CONLL 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
ACL 2017
End-to-end Neural Coreference Resolution
EMNLP 2017
Neural AMR: Sequence-to-Sequence Models for Parsing and Generation
ACL 2017
Deep Semantic Role Labeling: What Works and Whatβs Next
ACL 2017
Learning a Neural Semantic Parser from User Feedback
ACL 2017
Commonly Uncommon: Semantic Sparsity in Situation Recognition
CVPR 2017
Globally Coherent Text Generation with Neural Checklist Models
EMNLP 2016
A Theme-Rewriting Approach for Generating Algebra Word Problems
EMNLP 2016
Human-in-the-Loop Parsing
EMNLP 2016
Global Neural CCG Parsing with Optimality Guarantees
EMNLP 2016
Situation Recognition: Visual Semantic Role Labeling for Image Understanding
CVPR 2016
LSTM CCG Parsing
NAACL 2016
Document-level Sentiment Inference with Social, Faction, and Discourse Context
ACL 2016
Summarizing Source Code using a Neural Attention Model
ACL 2016
Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language
EMNLP 2015
Broad-coverage CCG Semantic Parsing with AMR
EMNLP 2015
Scalable Semantic Parsing with Partial Ontologies
IJCNLP 2015
Event Detection and Factuality Assessment with Non-Expert Supervision
EMNLP 2015
Joint A* CCG Parsing and Semantic Role Labelling
EMNLP 2015
Mise en Place: Unsupervised Interpretation of Instructional Recipes
EMNLP 2015
Scalable Semantic Parsing with Partial Ontologies
ACL 2015
Personalized Mathematical Word Problem Generation
IJCAI 2015
Semantic Parsing with Combinatory Categorial Grammars
EMNLP 2014
Morpho-syntactic Lexical Generalization for CCG Semantic Parsing
EMNLP 2014
Context-dependent Semantic Parsing for Time Expressions
ACL 2014
Learning to Automatically Solve Algebra Word Problems
ACL 2014
Lightly Supervised Learning of Procedural Dialog Systems
ACL 2013
Learning Distributions over Logical Forms for Referring Expression Generation
EMNLP 2013
Scaling Semantic Parsers with On-the-Fly Ontology Matching
EMNLP 2013
Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves
EMNLP 2013
Learning to Relate Literal and Sentimental Descriptions of Visual Properties
NAACL 2013
Paraphrase-Driven Learning for Open Question Answering
ACL 2013
Semantic Parsing with Combinatory Categorial Grammars
ACL 2013
Automatic Idiom Identification in Wiktionary
EMNLP 2013
A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings
EACL 2012
Discriminative Learning for Joint Template Filling
ACL 2012
Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations
ACL 2011
Bootstrapping Semantic Parsers from Conversations
EMNLP 2011
Lexical Generalization in CCG Grammar Induction for Semantic Parsing
EMNLP 2011
Reading between the Lines: Learning to Map High-Level Instructions to Commands
ACL 2010
Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification
EMNLP 2010
Reinforcement Learning for Mapping Instructions to Actions
ACL 2009
Learning Context-Dependent Mappings from Sentences to Logical Form
ACL 2009
Reinforcement Learning for Mapping Instructions to Actions
IJCNLP 2009
Learning Context-Dependent Mappings from Sentences to Logical Form
IJCNLP 2009
Multi-Agent Filtering with Infinitely Nested Beliefs
NIPS 2008
Selective Phrase Pair Extraction for Improved Statistical Machine Translation
NAACL 2007
Online Learning of Relaxed CCG Grammars for Parsing to Logical Form
CONLL 2007
Online Learning of Relaxed CCG Grammars for Parsing to Logical Form
EMNLP 2007