Doug Downey
51 papers · 2005–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🌍 Conference Polyglot (10) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏃 Academic Marathon (20)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🧬
Topic Evolution
🏆
Keyword Champion
👥
Mega-Team
(23)
🗃️
Keyword Collector
(177)
⚡
Prolific Year
(6)
📈
Trend Setter
💎
Century Club
(50)
🔥
Unstoppable
(9)
❓
The Questioner
(3)
Conferences
EMNLP (18)
ACL (15)
NAACL (8)
CONLL (2)
EACL (2)
NIPS (2)
AAAI (1)
AISTATS (1)
ICLR (1)
IJCNLP (1)
Top co-authors
Keywords
language model
(11)
question answering
(5)
language modeling
(3)
information extraction
(3)
commonsense reasoning
(3)
word embedding
(3)
large language model
(3)
scientific literature
(3)
neural language model
(3)
scientific document
(2)
recurrent neural network
(2)
text classification
(2)
few-shot learning
(2)
retrieval-augmented generation
(2)
importance sampling
(2)
benchmark evaluation
(2)
probability distribution
(2)
commonsense knowledge
(2)
entity linking
(2)
transfer learning
(2)
Papers
Generating Literature-Driven Scientific Theories at Scale
ACL 2026
Ai2 Scholar QA: Organized Literature Synthesis with Attribution
ACL 2025
Intent-aware Schema Generation and Refinement for Literature Review Tables
EMNLP 2025
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
EMNLP 2025
ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews
ACL 2024
SciMON: Scientific Inspiration Machines Optimized for Novelty
ACL 2024
CARE: Extracting Experimental Findings From Clinical Literature
NAACL 2024
TOPICAL: TOPIC Pages AutomagicaLly
NAACL 2024
Penguins Don’t Fly: Reasoning about Generics through Instantiations and Exceptions
EACL 2023
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents
EMNLP 2023
CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies
EMNLP 2023
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
EMNLP 2023
S2abEL: A Dataset for Entity Linking from Scientific Tables
EMNLP 2023
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation
ACL 2023
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents
ACL 2023
Embedding Recycling for Language Models
EACL 2023
Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities
NIPS 2022
Don’t Say What You Don’t Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search
EMNLP 2022
Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models
EMNLP 2022
ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts
EMNLP 2022
Few-Shot Self-Rationalization with Natural Language Prompts
NAACL 2022
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains
CONLL 2021
“It doesn’t look good for a date”: Transforming Critiques into Preferences for Conversational Recommendation Systems
EMNLP 2021
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains
EMNLP 2021
SPECTER: Document-level Representation Learning using Citation-informed Transformers
ACL 2020
Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks
ACL 2020
Stolen Probability: A Structural Weakness of Neural Language Models
ACL 2020
Abductive Commonsense Reasoning
ICLR 2020
Generative Data Augmentation for Commonsense Reasoning
EMNLP 2020
Just Add Functions: A Neural-Symbolic Language Model
AAAI 2020
A new evaluation framework for topic modeling algorithms based on synthetic corpora
AISTATS 2019
Using Large Corpus N-gram Statistics to Improve Recurrent Neural Language Models
NAACL 2019
CODAH: An Adversarially-Authored Question Answering Dataset for Common Sense
NAACL 2019
Construction of the Literature Graph in Semantic Scholar
NAACL 2018
Estimating Marginal Probabilities of n-grams for Recurrent Neural Language Models
EMNLP 2018
Sampling Informative Training Data for RNN Language Models
ACL 2018
Extracting Commonsense Properties from Embeddings with Limited Human Guidance
ACL 2018
VecShare: A Framework for Sharing Word Representation Vectors
EMNLP 2017
Efficient Methods for Inferring Large Sparse Topic Hierarchies
ACL 2015
Efficient Methods for Incorporating Knowledge into Topic Models
EMNLP 2015
Efficient Methods for Inferring Large Sparse Topic Hierarchies
IJCNLP 2015
Adding High-Precision Links to Wikipedia
EMNLP 2014
Scaling Semi-supervised Naive Bayes with Feature Marginals
ACL 2013
Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text
NAACL 2013
Local and Global Algorithms for Disambiguation to Wikipedia
ACL 2011
Language Models as Representations for Weakly Supervised NLP Tasks
CONLL 2011
Improved Extraction Assessment through Better Language Models
NAACL 2010
It’s a Contradiction – no, it’s not: A Case Study using Functional Relations
EMNLP 2008
Look Ma, No Hands: Analyzing the Monotonic Feature Abstraction for Text Classification
NIPS 2008
Sparse Information Extraction: Unsupervised Language Models to the Rescue
ACL 2007
KnowItNow: Fast, Scalable Information Extraction from the Web
EMNLP 2005