Luke Zettlemoyer

235 papers · 2007–2025 · 15 conferences · across top CS/AI conferences

Achievements

+20 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (28) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (15)

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (14) 🏠 Conference Loyalist (70) 🧬 Topic Evolution 🤝 Dynamic Duo (38) 🏆 Grand Slam 👥 Mega-Team (60) 👑 Triple Crown 🌱 Topic Pioneer 🔬 Deep Specialist (20) 🏆 Keyword Champion 🗃️ Keyword Collector (70) 📈 Trend Setter 💎 Century Club (235) 🔥 Unstoppable (19) ❓ The Questioner (4) 🚀 Conference Pioneer ⚡ Prolific Year (34)

Conferences

EMNLP (70) ACL (63) ICLR (27) IJCNLP (18) NAACL (17) NIPS (17) ICML (7) CVPR (4) CORL (3) EACL (3) CONLL (2) AAAI (1) COLING (1) ICCV (1) IJCAI (1)

Top co-authors

Mike Lewis (38) Hannaneh Hajishirzi (26) Weijia Shi (22) Omer Levy (19) Sewon Min (19) Wen-tau Yih (17) Marjan Ghazvininejad (16) Noah A. Smith (15) Armen Aghajanyan (13) Terra Blevins (13)

Keywords

language model (21) large language model (21) zero-shot learning (18) question answering (15) representation learning (13) few-shot learning (12) semantic parsing (12) in-context learning (11) transfer learning (11) neural network (10) multilingual language model (9) cross-lingual transfer (9) code generation (8) language modeling (8) coreference resolution (8) text generation (7) distant supervision (7) machine translation (7) masked language model (7) semantic role labeling (7)

Papers

DreamGen: Unlocking Generalization in Robot Learning through Video World Models CORL 2025 Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models NAACL 2025 Memory Layers at Scale ICML 2025 Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass ICLR 2025 (Mis)Fitting Scaling Laws: A Survey of Scaling Law Fitting Techniques in Deep Learning ICLR 2025 MUSE: Machine Unlearning Six-Way Evaluation for Language Models ICLR 2025 Fantastic Copyrighted Beasts and How (Not) to Generate Them ICLR 2025 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model ICLR 2025 Byte Latent Transformer: Patches Scale Better Than Tokens ACL 2025 Improving Factuality with Explicit Working Memory ACL 2025 Latent Action Pretraining from Videos ICLR 2025 MULTIGUARD: An Efficient Approach for AI Safety Moderation Across Languages and Modalities EMNLP 2025 s1: Simple test-time scaling EMNLP 2025 Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length NIPS 2024 MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling ACL 2024 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation EMNLP 2024 Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models EACL 2024 Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models EMNLP 2024 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research ACL 2024 OLMo: Accelerating the Science of Language Models ACL 2024 In-Context Pretraining: Language Modeling Beyond Document Boundaries ICLR 2024 MoDE: CLIP Data Experts via Clustering CVPR 2024 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore NIPS 2024 Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models NIPS 2024 Evaluating Copyright Takedown Methods for Language Models NIPS 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 Better Alignment with Instruction Back-and-Forth Translation EMNLP 2024 Detecting Pretraining Data from Large Language Models ICLR 2024 Altogether: Image Captioning via Re-aligning Alt-text EMNLP 2024 The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants ACL 2024 Trusting Your Evidence: Hallucinate Less with Context-aware Decoding NAACL 2024 REPLUG: Retrieval-Augmented Black-Box Language Models NAACL 2024 Demystifying CLIP Data ICLR 2024 RA-DIT: Retrieval-Augmented Dual Instruction Tuning ICLR 2024 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore ICLR 2024 Representation Deficiency in Masked Language Modeling ICLR 2024 Self-Alignment with Instruction Backtranslation ICLR 2024 Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters ACL 2023 CREPE: Open-Domain Question Answering with False Presuppositions ACL 2023 Contrastive Decoding: Open-ended Text Generation as Optimization ACL 2023 Training Trajectories of Language Models Across Scales ACL 2023 One Embedder, Any Task: Instruction-Finetuned Text Embeddings ACL 2023 Nonparametric Masked Language Modeling ACL 2023 In-context Examples Selection for Machine Translation ACL 2023 Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI AAAI 2023 CiT: Curation in Training for Effective Vision-Language Data ICCV 2023 The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages EMNLP 2023 Toward Human Readable Prompt Tuning: Kubrick’s The Shining is a good movie, and a good prompt too? EMNLP 2023 Demystifying Prompts in Language Models via Perplexity Estimation EMNLP 2023 Getting MoRE out of Mixture of Language Model Reasoning Experts EMNLP 2023 RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering EMNLP 2023 XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models EMNLP 2023 FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation EMNLP 2023 Revisiting Machine Translation for Cross-lingual Classification EMNLP 2023 MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers NIPS 2023 AGRO: Adversarial discovery of error-prone Groups for Robust Optimization ICLR 2023 Mega: Moving Average Equipped Gated Attention ICLR 2023 Selective Annotation Makes Language Models Better Few-Shot Learners ICLR 2023 QLoRA: Efficient Finetuning of Quantized LLMs NIPS 2023 Toolformer: Language Models Can Teach Themselves to Use Tools NIPS 2023 LIMA: Less Is More for Alignment NIPS 2023 Stable and low-precision training for large-scale vision-language models NIPS 2023 Scaling Laws for Generative Mixed-Modal Language Models ICML 2023 The case for 4-bit precision: k-bit Inference Scaling Laws ICML 2023 DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation ICML 2023 Retrieval-Augmented Multimodal Language Modeling ICML 2023 Binding Language Models in Symbolic Languages ICLR 2023 ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning ICLR 2023 InCoder: A Generative Model for Code Infilling and Synthesis ICLR 2023 Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations ACL 2023 Prompting Language Models for Linguistic Structure ACL 2023 Few-shot Learning with Multilingual Generative Language Models EMNLP 2022 HTLM: Hyper-Text Pre-Training and Prompting of Language Models ICLR 2022 8-bit Optimizers via Block-wise Quantization ICLR 2022 BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation NAACL 2022 DEMix Layers: Disentangling Domains for Modular Language Modeling NAACL 2022 Quantifying Adaptability in Pre-trained Language Models with 500 Tasks NAACL 2022 MetaICL: Learning to Learn In Context NAACL 2022 Improving Policy Learning via Language Dynamics Distillation NIPS 2022 GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale NIPS 2022 Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models NIPS 2022 Prompt-free and Efficient Few-shot Learning with Language Models ACL 2022 FaVIQ: FAct Verification from Information-seeking Questions ACL 2022 Noisy Channel Language Model Prompting for Few-Shot Text Classification ACL 2022 Question Answering Infused Pre-training of General-Purpose Contextualized Representations ACL 2022 UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models EMNLP 2022 M2D2: A Massively Multi-Domain Language Modeling Dataset EMNLP 2022 Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection EMNLP 2022 Nearest Neighbor Zero-Shot Inference EMNLP 2022 Natural Language to Code Translation with Execution EMNLP 2022 Language Contamination Helps Explains the Cross-lingual Capabilities of English Pretrained Models EMNLP 2022 Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models EMNLP 2022 Improving Passage Retrieval with Zero-Shot Question Generation EMNLP 2022 Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? EMNLP 2022 Efficient Large Scale Language Modeling with Mixtures of Experts EMNLP 2022 CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation EMNLP 2022 On the Role of Bidirectionality in Language Model Pre-Training EMNLP 2022 Better Fine-Tuning by Reducing Representational Collapse ICLR 2021 Language Grounding with 3D Objects CORL 2021 Inducing Semantic Roles Without Syntax IJCNLP 2021 VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding IJCNLP 2021 Prompting Contrastive Explanations for Commonsense Reasoning Tasks IJCNLP 2021 Detecting Hallucinated Content in Conditional Neural Sequence Generation ACL 2021 DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions ACL 2021 Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning ACL 2021 Prompting Contrastive Explanations for Commonsense Reasoning Tasks ACL 2021 Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment ACL 2021 VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding ACL 2021 Muppet: Massive Multi-task Representations with Pre-Finetuning EMNLP 2021 VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding EMNLP 2021 Surface Form Competition: Why the Highest Probability Answer Isn’t Always Right EMNLP 2021 Detecting Hallucinated Content in Conditional Neural Sequence Generation IJCNLP 2021 Inducing Semantic Roles Without Syntax ACL 2021 FEWS: Large-Scale, Low-Shot Word Sense Disambiguation with the Dictionary EACL 2021 Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning IJCNLP 2021 Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment IJCNLP 2021 DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions IJCNLP 2021 SILG: The Multi-domain Symbolic Interactive Language Grounding Benchmark NIPS 2021 Luna: Linear Unified Nested Attention NIPS 2021 BASE Layers: Simplifying Training of Large, Sparse Models ICML 2021 Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing ICLR 2021 DeLighT: Deep and Light-weight Transformer ICLR 2021 Nearest Neighbor Machine Translation ICLR 2021 Scalable Zero-shot Entity Linking with Dense Entity Retrieval EMNLP 2020 Aligned Cross Entropy for Non-Autoregressive Machine Translation ICML 2020 Unsupervised Cross-lingual Representation Learning at Scale ACL 2020 Active Learning for Coreference Resolution using Discrete Annotation ACL 2020 BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension ACL 2020 Controlled Crowdsourcing for High-Quality QA-SRL Annotation ACL 2020 Emerging Cross-lingual Structure in Pretrained Language Models ACL 2020 Simple and Effective Retrieve-Edit-Rerank Text Generation ACL 2020 Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders ACL 2020 Pre-training via Paraphrasing NIPS 2020 An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction EMNLP 2020 Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing EMNLP 2020 AmbigQA: Answering Ambiguous Open-domain Questions EMNLP 2020 Grounded Adaptation for Zero-shot Executable Semantic Parsing EMNLP 2020 Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles EMNLP 2020 Generalization through Memorization: Nearest Neighbor Language Models ICLR 2020 ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks CVPR 2020 QANom: Question-Answer driven SRL for Nominalizations COLING 2020 Iterative Search for Weakly Supervised Semantic Parsing NAACL 2019 pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference NAACL 2019 Learning Programmatic Idioms for Scalable Semantic Parsing EMNLP 2019 Vision-and-Dialog Navigation CORL 2019 A Discrete Hard EM Approach for Weakly Supervised Question Answering IJCNLP 2019 Cloze-driven Pretraining of Self-attention Networks EMNLP 2019 Don’t Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases EMNLP 2019 A Discrete Hard EM Approach for Weakly Supervised Question Answering EMNLP 2019 Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog EMNLP 2019 Mask-Predict: Parallel Decoding of Conditional Masked Language Models IJCNLP 2019 Better Character Language Modeling through Morphology ACL 2019 Evaluating Gender Bias in Machine Translation ACL 2019 E3: Entailment-driven Extracting and Editing for Conversational Machine Reading ACL 2019 BERT for Coreference Resolution: Baselines and Analysis IJCNLP 2019 JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation IJCNLP 2019 Learning Programmatic Idioms for Scalable Semantic Parsing IJCNLP 2019 Cloze-driven Pretraining of Self-attention Networks IJCNLP 2019 Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog IJCNLP 2019 Multi-hop Reading Comprehension through Question Decomposition and Rescoring ACL 2019 Don’t Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases IJCNLP 2019 Compositional Questions Do Not Necessitate Multi-hop Reasoning ACL 2019 The Referential Reader: A Recurrent Entity Network for Anaphora Resolution ACL 2019 Mask-Predict: Parallel Decoding of Conditional Masked Language Models EMNLP 2019 BERT for Coreference Resolution: Baselines and Analysis EMNLP 2019 JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation EMNLP 2019 Mapping Language to Code in Programmatic Context EMNLP 2018 Ultra-Fine Entity Typing ACL 2018 Large-Scale QA-SRL Parsing ACL 2018 Deep RNNs Encode Soft Hierarchical Syntax ACL 2018 Jointly Predicting Predicates and Arguments in Neural Semantic Role Labeling ACL 2018 Long Short-Term Memory as a Dynamically Computed Element-wise Weighted Sum ACL 2018 Neural Semantic Parsing ACL 2018 AllenNLP: A Deep Semantic Natural Language Processing Platform ACL 2018 SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach EMNLP 2018 Neural Metaphor Detection in Context EMNLP 2018 Dissecting Contextual Word Embeddings: Architecture and Representation EMNLP 2018 QuAC: Question Answering in Context EMNLP 2018 Syntactic Scaffolds for Semantic Structures EMNLP 2018 Deep contextualized word representations ICLR 2018 Supervised Open Information Extraction NAACL 2018 Adversarial Example Generation with Syntactically Controlled Paraphrase Networks NAACL 2018 Deep Contextualized Word Representations NAACL 2018 Crowdsourcing Question-Answer Meaning Representations NAACL 2018 Higher-Order Coreference Resolution with Coarse-to-Fine Inference NAACL 2018 Zero-Shot Relation Extraction via Reading Comprehension CONLL 2017 TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension ACL 2017 End-to-end Neural Coreference Resolution EMNLP 2017 Neural AMR: Sequence-to-Sequence Models for Parsing and Generation ACL 2017 Deep Semantic Role Labeling: What Works and What’s Next ACL 2017 Learning a Neural Semantic Parser from User Feedback ACL 2017 Commonly Uncommon: Semantic Sparsity in Situation Recognition CVPR 2017 Globally Coherent Text Generation with Neural Checklist Models EMNLP 2016 A Theme-Rewriting Approach for Generating Algebra Word Problems EMNLP 2016 Human-in-the-Loop Parsing EMNLP 2016 Global Neural CCG Parsing with Optimality Guarantees EMNLP 2016 Situation Recognition: Visual Semantic Role Labeling for Image Understanding CVPR 2016 LSTM CCG Parsing NAACL 2016 Document-level Sentiment Inference with Social, Faction, and Discourse Context ACL 2016 Summarizing Source Code using a Neural Attention Model ACL 2016 Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language EMNLP 2015 Broad-coverage CCG Semantic Parsing with AMR EMNLP 2015 Scalable Semantic Parsing with Partial Ontologies IJCNLP 2015 Event Detection and Factuality Assessment with Non-Expert Supervision EMNLP 2015 Joint A* CCG Parsing and Semantic Role Labelling EMNLP 2015 Mise en Place: Unsupervised Interpretation of Instructional Recipes EMNLP 2015 Scalable Semantic Parsing with Partial Ontologies ACL 2015 Personalized Mathematical Word Problem Generation IJCAI 2015 Semantic Parsing with Combinatory Categorial Grammars EMNLP 2014 Morpho-syntactic Lexical Generalization for CCG Semantic Parsing EMNLP 2014 Context-dependent Semantic Parsing for Time Expressions ACL 2014 Learning to Automatically Solve Algebra Word Problems ACL 2014 Lightly Supervised Learning of Procedural Dialog Systems ACL 2013 Learning Distributions over Logical Forms for Referring Expression Generation EMNLP 2013 Scaling Semantic Parsers with On-the-Fly Ontology Matching EMNLP 2013 Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves EMNLP 2013 Learning to Relate Literal and Sentimental Descriptions of Visual Properties NAACL 2013 Paraphrase-Driven Learning for Open Question Answering ACL 2013 Semantic Parsing with Combinatory Categorial Grammars ACL 2013 Automatic Idiom Identification in Wiktionary EMNLP 2013 A Probabilistic Model of Syntactic and Semantic Acquisition from Child-Directed Utterances and their Meanings EACL 2012 Discriminative Learning for Joint Template Filling ACL 2012 Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations ACL 2011 Bootstrapping Semantic Parsers from Conversations EMNLP 2011 Lexical Generalization in CCG Grammar Induction for Semantic Parsing EMNLP 2011 Reading between the Lines: Learning to Map High-Level Instructions to Commands ACL 2010 Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification EMNLP 2010 Reinforcement Learning for Mapping Instructions to Actions ACL 2009 Learning Context-Dependent Mappings from Sentences to Logical Form ACL 2009 Reinforcement Learning for Mapping Instructions to Actions IJCNLP 2009 Learning Context-Dependent Mappings from Sentences to Logical Form IJCNLP 2009 Multi-Agent Filtering with Infinitely Nested Beliefs NIPS 2008 Selective Phrase Pair Extraction for Improved Statistical Machine Translation NAACL 2007 Online Learning of Relaxed CCG Grammars for Parsing to Logical Form CONLL 2007 Online Learning of Relaxed CCG Grammars for Parsing to Logical Form EMNLP 2007