conftrace_

Wen-tau Yih

100 papers · 2002–2026 · 12 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+18 more ↓

🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (12)

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (23) 🗺️ Taxonomy Completionist (13) 🏠 Conference Loyalist (30) 🌟 Keyword Trendsetter Combo (13) 🤝 Dynamic Duo (17) 🌱 Topic Pioneer 🏆 Keyword Champion (2) 🏆 Grand Slam 👥 Mega-Team (27) 🔬 Deep Specialist (22) 🔥 Unstoppable (17) 🗃️ Keyword Collector (266) 🚀 Conference Pioneer ⚡ Prolific Year (10) ❓ The Questioner (2) 💎 Century Club (99) 📈 Trend Setter

Conferences

EMNLP (30) ACL (25) NAACL (13) ICML (6) IJCNLP (6) CONLL (5) ICLR (4) NIPS (4) COLING (3) EACL (2) AAAI (1) CVPR (1)

Top co-authors

Luke Zettlemoyer (17) Mike Lewis (12) Xilun Chen (11) Barlas Oguz (11) Xi Victoria Lin (10) Weijia Shi (9) Ming-Wei Chang (8) Christopher Meek (7) Sewon Min (7) Xiaodong He (7)

Research topics

Keywords

question answering (17) semantic parsing (9) large language model (8) information retrieval (8) retrieval-augmented generation (8) open-domain question answering (7) few-shot learning (6) dense retrieval (6) contrastive learning (5) zero-shot learning (5) language model (5) procedural text (5) multi-task learning (4) instruction tuning (4) knowledge-intensive task (3) sparse retrieval (3) transfer learning (3) domain adaptation (3) natural language processing (3) fact verification (3)

Papers

Knowledge Extraction on Semi-Structured Content: Does It Remain Relevant for Question Answering in the Era of LLMs? EACL 2026 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers ACL 2025 Improving Factuality with Explicit Working Memory ACL 2025 Memory Layers at Scale ICML 2025 ImpRAG: Retrieval-Augmented Generation with Implicit Queries EMNLP 2025 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models ICML 2025 PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning EMNLP 2025 In-Context Pretraining: Language Modeling Beyond Document Boundaries ICLR 2024 Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering EACL 2024 MoDE: CLIP Data Experts via Clustering CVPR 2024 Nearest Neighbor Speculative Decoding for LLM Generation and Attribution NIPS 2024 FLAME : Factuality-Aware Alignment for Large Language Models NIPS 2024 Trusting Your Evidence: Hallucinate Less with Context-aware Decoding NAACL 2024 REPLUG: Retrieval-Augmented Black-Box Language Models NAACL 2024 RA-DIT: Retrieval-Augmented Dual Instruction Tuning ICLR 2024 Altogether: Image Captioning via Re-aligning Alt-text EMNLP 2024 CRAG - Comprehensive RAG Benchmark NIPS 2024 Instruction-tuned Language Models are Better Knowledge Learners ACL 2024 RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering EMNLP 2023 How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval EMNLP 2023 Coder Reviewer Reranking for Code Generation ICML 2023 Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing ACL 2023 CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval ACL 2023 One Embedder, Any Task: Instruction-Finetuned Text Embeddings ACL 2023 Nonparametric Masked Language Modeling ACL 2023 Task-aware Retrieval with Instructions ACL 2023 Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering ACL 2023 FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation EMNLP 2023 DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation ICML 2023 LEVER: Learning to Verify Language-to-Code Generation with Execution ICML 2023 Retrieval-Augmented Multimodal Language Modeling ICML 2023 Improving Passage Retrieval with Zero-Shot Question Generation EMNLP 2022 On Continual Model Refinement in Out-of-Distribution Data Streams ACL 2022 Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One? EMNLP 2022 On Unifying Misinformation Detection NAACL 2021 Multi-Task Retrieval for Knowledge-Intensive Tasks ACL 2021 RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering NAACL 2021 On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study ACL 2021 Joint Verification and Reranking for Open Fact Checking Over Tables ACL 2021 On the Influence of Masking Policies in Intermediate Pre-training EMNLP 2021 Joint Verification and Reranking for Open Fact Checking Over Tables IJCNLP 2021 On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study IJCNLP 2021 Multi-Task Retrieval for Knowledge-Intensive Tasks IJCNLP 2021 Open-Domain Question Answering ACL 2020 TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data ACL 2020 Efficient One-Pass End-to-End Entity Linking for Questions EMNLP 2020 Dense Passage Retrieval for Open-Domain Question Answering EMNLP 2020 An Imitation Game for Learning Semantic Parsers from User Interaction EMNLP 2020 Unsupervised Question Decomposition for Question Answering EMNLP 2020 Blockwise Self-Attention for Long Document Understanding EMNLP 2020 Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks NIPS 2020 Abductive Commonsense Reasoning ICLR 2020 Language Models as Fact Checkers? ACL 2020 Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text IJCNLP 2019 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study IJCNLP 2019 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study EMNLP 2019 Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text EMNLP 2019 Be Consistent! Improving Procedural Text Comprehension using Label Consistency NAACL 2019 QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships AAAI 2019 FlowQA: Grasping Flow in History for Conversational Machine Comprehension ICLR 2019 Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations EMNLP 2018 Tracking State Changes in Procedural Text: a Challenge Dataset and Models for Process Paragraph Comprehension NAACL 2018 Reasoning about Actions and State Changes by Injecting Commonsense Knowledge EMNLP 2018 Dissecting Contextual Word Embeddings: Architecture and Representation EMNLP 2018 QuAC: Question Answering in Context EMNLP 2018 Natural Language to Structured Query Generation via Meta-Learning NAACL 2018 NLP for Precision Medicine ACL 2017 Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision EMNLP 2017 Search-based Neural Structured Learning for Sequential Question Answering ACL 2017 Question Answering with Knowledge Base, Web and Beyond NAACL 2016 Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems EMNLP 2016 The Value of Semantic Parse Labeling for Knowledge Base Question Answering ACL 2016 Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text ACL 2016 Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base ACL 2015 WikiQA: A Challenge Dataset for Open-Domain Question Answering EMNLP 2015 Deep Learning and Continuous Representations for Natural Language Processing NAACL 2015 Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base IJCNLP 2015 Typed Tensor Decomposition of Knowledge Bases for Relation Extraction EMNLP 2014 Semantic Parsing for Single-Relation Question Answering ACL 2014 Learning Continuous Phrase Representations for Translation Modeling ACL 2014 Multi-Relational Latent Semantic Analysis EMNLP 2013 Animacy Detection with Voting Models EMNLP 2013 Question Answering Using Enhanced Lexical Semantic Models ACL 2013 Linguistic Regularities in Continuous Space Word Representations NAACL 2013 Combining Heterogeneous Models for Measuring Relational Similarity NAACL 2013 Polarity Inducing Latent Semantic Analysis CONLL 2012 Polarity Inducing Latent Semantic Analysis EMNLP 2012 Measuring Word Relatedness Using Heterogeneous Vector Space Models NAACL 2012 MSR SPLAT, a language analysis toolkit NAACL 2012 Learning Discriminative Projections for Text Similarity Measures CONLL 2011 Translingual Document Representations from Discriminative Projections EMNLP 2010 Learning Term-weighting Functions for Similarity Measures EMNLP 2009 Improved Discriminative Bilingual Word Alignment ACL 2006 Improved Discriminative Bilingual Word Alignment COLING 2006 Generalized Inference with Multiple Semantic Role Labeling Systems CONLL 2005 Demonstrating an Interactive Semantic Role Labeling System EMNLP 2005 Semantic Role Labeling Via Integer Linear Programming Inference COLING 2004 Semantic Role Labeling Via Generalized Inference Over Classifiers CONLL 2004 A Linear Programming Formulation for Global Inference in Natural Language Tasks CONLL 2004 Probabilistic Reasoning for Entity & Relation Recognition COLING 2002