Wen-tau Yih
100 papers · 2002–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (12)
π
Interdisciplinary Bridge
π
Academic Marathon
(23)
πΊοΈ
Taxonomy Completionist
(13)
π
Conference Loyalist
(30)
π
Keyword Trendsetter Combo
(13)
π€
Dynamic Duo
(17)
π±
Topic Pioneer
π
Keyword Champion
(2)
π
Grand Slam
π₯
Mega-Team
(27)
π¬
Deep Specialist
(22)
π₯
Unstoppable
(17)
ποΈ
Keyword Collector
(266)
π
Conference Pioneer
β‘
Prolific Year
(10)
β
The Questioner
(2)
π
Century Club
(99)
π
Trend Setter
Conferences
EMNLP (30)
ACL (25)
NAACL (13)
ICML (6)
IJCNLP (6)
CONLL (5)
ICLR (4)
NIPS (4)
COLING (3)
EACL (2)
AAAI (1)
CVPR (1)
Top co-authors
Research topics
Keywords
question answering
(17)
semantic parsing
(9)
large language model
(8)
information retrieval
(8)
retrieval-augmented generation
(8)
open-domain question answering
(7)
few-shot learning
(6)
dense retrieval
(6)
contrastive learning
(5)
zero-shot learning
(5)
language model
(5)
procedural text
(5)
multi-task learning
(4)
instruction tuning
(4)
knowledge-intensive task
(3)
sparse retrieval
(3)
transfer learning
(3)
domain adaptation
(3)
natural language processing
(3)
fact verification
(3)
Papers
Knowledge Extraction on Semi-Structured Content: Does It Remain Relevant for Question Answering in the Era of LLMs?
EACL 2026
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers
ACL 2025
Improving Factuality with Explicit Working Memory
ACL 2025
Memory Layers at Scale
ICML 2025
ImpRAG: Retrieval-Augmented Generation with Implicit Queries
EMNLP 2025
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
ICML 2025
PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning
EMNLP 2025
In-Context Pretraining: Language Modeling Beyond Document Boundaries
ICLR 2024
Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering
EACL 2024
MoDE: CLIP Data Experts via Clustering
CVPR 2024
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
NIPS 2024
FLAME : Factuality-Aware Alignment for Large Language Models
NIPS 2024
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding
NAACL 2024
REPLUG: Retrieval-Augmented Black-Box Language Models
NAACL 2024
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
ICLR 2024
Altogether: Image Captioning via Re-aligning Alt-text
EMNLP 2024
CRAG - Comprehensive RAG Benchmark
NIPS 2024
Instruction-tuned Language Models are Better Knowledge Learners
ACL 2024
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
EMNLP 2023
How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval
EMNLP 2023
Coder Reviewer Reranking for Code Generation
ICML 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
ACL 2023
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
ACL 2023
One Embedder, Any Task: Instruction-Finetuned Text Embeddings
ACL 2023
Nonparametric Masked Language Modeling
ACL 2023
Task-aware Retrieval with Instructions
ACL 2023
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
ACL 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
EMNLP 2023
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation
ICML 2023
LEVER: Learning to Verify Language-to-Code Generation with Execution
ICML 2023
Retrieval-Augmented Multimodal Language Modeling
ICML 2023
Improving Passage Retrieval with Zero-Shot Question Generation
EMNLP 2022
On Continual Model Refinement in Out-of-Distribution Data Streams
ACL 2022
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
EMNLP 2022
On Unifying Misinformation Detection
NAACL 2021
Multi-Task Retrieval for Knowledge-Intensive Tasks
ACL 2021
RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
NAACL 2021
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study
ACL 2021
Joint Verification and Reranking for Open Fact Checking Over Tables
ACL 2021
On the Influence of Masking Policies in Intermediate Pre-training
EMNLP 2021
Joint Verification and Reranking for Open Fact Checking Over Tables
IJCNLP 2021
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study
IJCNLP 2021
Multi-Task Retrieval for Knowledge-Intensive Tasks
IJCNLP 2021
Open-Domain Question Answering
ACL 2020
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
ACL 2020
Efficient One-Pass End-to-End Entity Linking for Questions
EMNLP 2020
Dense Passage Retrieval for Open-Domain Question Answering
EMNLP 2020
An Imitation Game for Learning Semantic Parsers from User Interaction
EMNLP 2020
Unsupervised Question Decomposition for Question Answering
EMNLP 2020
Blockwise Self-Attention for Long Document Understanding
EMNLP 2020
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
NIPS 2020
Abductive Commonsense Reasoning
ICLR 2020
Language Models as Fact Checkers?
ACL 2020
Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text
IJCNLP 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
IJCNLP 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
EMNLP 2019
Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text
EMNLP 2019
Be Consistent! Improving Procedural Text Comprehension using Label Consistency
NAACL 2019
QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships
AAAI 2019
FlowQA: Grasping Flow in History for Conversational Machine Comprehension
ICLR 2019
Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations
EMNLP 2018
Tracking State Changes in Procedural Text: a Challenge Dataset and Models for Process Paragraph Comprehension
NAACL 2018
Reasoning about Actions and State Changes by Injecting Commonsense Knowledge
EMNLP 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
EMNLP 2018
QuAC: Question Answering in Context
EMNLP 2018
Natural Language to Structured Query Generation via Meta-Learning
NAACL 2018
NLP for Precision Medicine
ACL 2017
Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision
EMNLP 2017
Search-based Neural Structured Learning for Sequential Question Answering
ACL 2017
Question Answering with Knowledge Base, Web and Beyond
NAACL 2016
Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems
EMNLP 2016
The Value of Semantic Parse Labeling for Knowledge Base Question Answering
ACL 2016
Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text
ACL 2016
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base
ACL 2015
WikiQA: A Challenge Dataset for Open-Domain Question Answering
EMNLP 2015
Deep Learning and Continuous Representations for Natural Language Processing
NAACL 2015
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base
IJCNLP 2015
Typed Tensor Decomposition of Knowledge Bases for Relation Extraction
EMNLP 2014
Semantic Parsing for Single-Relation Question Answering
ACL 2014
Learning Continuous Phrase Representations for Translation Modeling
ACL 2014
Multi-Relational Latent Semantic Analysis
EMNLP 2013
Animacy Detection with Voting Models
EMNLP 2013
Question Answering Using Enhanced Lexical Semantic Models
ACL 2013
Linguistic Regularities in Continuous Space Word Representations
NAACL 2013
Combining Heterogeneous Models for Measuring Relational Similarity
NAACL 2013
Polarity Inducing Latent Semantic Analysis
CONLL 2012
Polarity Inducing Latent Semantic Analysis
EMNLP 2012
Measuring Word Relatedness Using Heterogeneous Vector Space Models
NAACL 2012
MSR SPLAT, a language analysis toolkit
NAACL 2012
Learning Discriminative Projections for Text Similarity Measures
CONLL 2011
Translingual Document Representations from Discriminative Projections
EMNLP 2010
Learning Term-weighting Functions for Similarity Measures
EMNLP 2009
Improved Discriminative Bilingual Word Alignment
ACL 2006
Improved Discriminative Bilingual Word Alignment
COLING 2006
Generalized Inference with Multiple Semantic Role Labeling Systems
CONLL 2005
Demonstrating an Interactive Semantic Role Labeling System
EMNLP 2005
Semantic Role Labeling Via Integer Linear Programming Inference
COLING 2004
Semantic Role Labeling Via Generalized Inference Over Classifiers
CONLL 2004
A Linear Programming Formulation for Global Inference in Natural Language Tasks
CONLL 2004
Probabilistic Reasoning for Entity & Relation Recognition
COLING 2002