William Yang Wang
230 papers · 2010–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (27) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Academic Marathon
(15)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
π
Keyword Trendsetter Combo
(6)
π
Conference Loyalist
(28)
π€
Dynamic Duo
(29)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(71)
π¬
Deep Specialist
(30)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π₯
Unstoppable
(16)
β
The Questioner
(7)
π
Century Club
(227)
ποΈ
Keyword Collector
(72)
β‘
Prolific Year
(26)
π
Trend Setter
π
Conference Pioneer
Conferences
ACL (56)
EMNLP (47)
NAACL (28)
ICLR (20)
NIPS (13)
IJCNLP (12)
EACL (11)
AAAI (10)
CVPR (8)
ICML (8)
COLING (4)
ECCV (4)
IJCAI (4)
ICCV (3)
AACL (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
large language model
(26)
text generation
(19)
reinforcement learning
(15)
question answering
(14)
multimodal learning
(14)
benchmark evaluation
(11)
zero-shot learning
(11)
language model
(9)
few-shot learning
(9)
neural network
(9)
dialogue system
(9)
text classification
(8)
representation learning
(8)
relation extraction
(7)
natural language generation
(7)
self-supervised learning
(7)
natural language processing
(7)
multi-modal learning
(6)
variational autoencoder
(6)
vision-language navigation
(6)
Papers
LEDOM: Reverse Language Model
ACL 2026
Can Editing LLMs Inject Harm?
AAAI 2026
Detecting Training Data of Large Language Models via Expectation Maximization
EACL 2026
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
NAACL 2025
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
AntiLeakBench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
ACL 2025
Investigating the Transferability of Code Repair for Low-Resource Programming Languages
NAACL 2025
Scaling LLM Inference Efficiently with Optimized Sample Compute Allocation
NAACL 2025
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
ACL 2025
Disentangling Memory and Reasoning Ability in Large Language Models
ACL 2025
Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework
ACL 2025
InductionBench: LLMs Fail in the Simplest Complexity Class
ACL 2025
GΓΆdel Agent: A Self-Referential Agent Framework for Recursively Self-Improvement
ACL 2025
TC-Bench: Benchmarking Temporal Compositionality in Conditional Video Generation
ACL 2025
REALM: A Dataset of Real-World LLM Use Cases
ACL 2025
Human Bias in the Face of AI: Examining Human Judgment Against Text Labeled as AI Generated
ACL 2025
Extrapolating to Unknown Opinions Using LLMs
COLING 2025
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
AAAI 2025
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
ICML 2025
Weak-to-Strong Jailbreaking on Large Language Models
ICML 2025
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
ICLR 2025
T2V-Turbo-v2: Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design
ICLR 2025
Generalization v.s. Memorization: Tracing Language Modelsβ Capabilities Back to Pretraining Data
ICLR 2025
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement
ICLR 2025
VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs
ICCV 2025
DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics
EMNLP 2025
Uncovering Factor-Level Preference to Improve Human-Model Alignment
EMNLP 2025
Dynamic Evaluation for Oversensitivity in LLMs
EMNLP 2025
Do You Know About My Nation? Investigating Multilingual Language Modelsβ Cultural Literacy Through Factual Knowledge
EMNLP 2025
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
EMNLP 2025
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
CVPR 2025
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
EMNLP 2024
AKEW: Assessing Knowledge Editing in the Wild
EMNLP 2024
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
ACL 2024
Hire a Linguist!: Learning Endangered Languages in LLMs with In-Context Linguistic Descriptions
ACL 2024
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models
ACL 2024
BPO: Staying Close to the Behavior LLM Creates Better Online LLM Alignment
EMNLP 2024
Guiding Instruction-based Image Editing via Multimodal Large Language Models
ICLR 2024
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
ICLR 2024
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models
ACL 2024
DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text
ICLR 2024
Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data
ICLR 2024
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering
EMNLP 2024
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback
NAACL 2024
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences
NIPS 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
NIPS 2024
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model
NIPS 2024
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
AAAI 2024
Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
NAACL 2024
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
NIPS 2024
Multimodal Procedural Planning via Dual Text-Image Prompting
EMNLP 2024
Position: AI/ML Influencers Have a Place in the Academic Process
ICML 2024
Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
ICML 2024
Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning
ICML 2024
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
A Survey on Detection of LLMs-Generated Content
EMNLP 2024
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
EMNLP 2024
An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling
CVPR 2023
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text
NIPS 2023
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
NIPS 2023
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning
NIPS 2023
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
NIPS 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
NIPS 2023
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
NIPS 2023
ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers
NIPS 2023
Attacking Open-domain Question Answering by Injecting Misinformation
AACL 2023
Multilingual Conceptual Coverage in Text-to-Image Models
ACL 2023
SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes
ACL 2023
Fact-Checking Complex Claims with Program-Guided Reasoning
ACL 2023
Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
ACL 2023
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
ACL 2023
RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-Domain Question Answering
ACL 2023
Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
ACL 2023
Benchmarking Diverse-Modal Entity Linking with Generative Models
ACL 2023
Language Agnostic Multilingual Information Retrieval with Contrastive Learning
ACL 2023
Hybrid Hierarchical Retrieval for Open-Domain Question Answering
ACL 2023
Foveate, Attribute, and Rationalize: Towards Physically Safe and Trustworthy AI
ACL 2023
CausalDialogue: Modeling Utterance-level Causality in Conversations
ACL 2023
Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation
CVPR 2023
PECO: Examining Single Sentence Label Leakage in Natural Language Inference Datasets through Progressive Evaluation of Cluster Outliers
EACL 2023
Addressing Issues of Cross-Linguality in Open-Retrieval Question Answering Systems For Emergent Domains
EACL 2023
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
EACL 2023
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation
EACL 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
EACL 2023
Learning Concise and Descriptive Attributes for Visual Recognition
ICCV 2023
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
ICLR 2023
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
ICLR 2023
Causal Balancing for Domain Generalization
ICLR 2023
WikiWhy: Answering and Explaining Cause-and-Effect Questions
ICLR 2023
Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
ICLR 2023
Neuro-Symbolic Procedural Planning with Commonsense Prompting
ICLR 2023
STREET: A MULTI-TASK STRUCTURED REASONING AND EXPLANATION BENCHMARK
ICLR 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
ICML 2023
ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval
ICML 2023
NeuPSL: Neural Probabilistic Soft Logic
IJCAI 2023
Attacking Open-domain Question Answering by Injecting Misinformation
IJCNLP 2023
Data Augmentation for Diverse Voice Conversion in Noisy Environments
INTERSPEECH 2023
Language-Driven Artistic Style Transfer
ECCV 2022
M3L: Language-Based Video Editing via Multi-Modal Multi-Level Transformers
CVPR 2022
Mitigating Covertly Unsafe Text within Natural Language Systems
EMNLP 2022
FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue
EMNLP 2022
ULN: Towards Underspecified Vision-and-Language Navigation
EMNLP 2022
ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering
EMNLP 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
EMNLP 2022
SafeText: A Benchmark for Exploring Physical Safety in Language Models
EMNLP 2022
Imagination-Augmented Natural Language Understanding
NAACL 2022
D-REX: Dialogue Relation Extraction with Explanations
ACL 2022
End-to-end Dense Video Captioning as Sequence Generation
COLING 2022
Diagnosing Vision-and-Language Navigation: What Really Matters
NAACL 2022
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer
AAAI 2022
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
AAAI 2022
Towards Large-Scale Interpretable Knowledge Graph Reasoning for Dialogue Systems
ACL 2022
HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data
ACL 2022
KETOD: Knowledge-Enriched Task-Oriented Dialogue
NAACL 2022
Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis
EMNLP 2022
Bridging the Training-Inference Gap for Dense Phrase Retrieval
EMNLP 2022
Unsupervised Multi-hop Question Answering by Question Generation
NAACL 2021
Counterfactual Maximum Likelihood Estimation for Training Deep Networks
NIPS 2021
Local Explanation of Dialogue Response Generation
NIPS 2021
Investigating Memorization of Conspiracy Theories in Text Generation
IJCNLP 2021
Zero-shot Fact Verification by Claim Generation
IJCNLP 2021
Neural Stylistic Response Generation with Disentangled Latent Variables
IJCNLP 2021
HULK: An Energy Efficiency Benchmark Platform for Responsible Natural Language Processing
EACL 2021
Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering
EACL 2021
Modeling Disclosive Transparency in NLP Application Descriptions
EMNLP 2021
FinQA: A Dataset of Numerical Reasoning over Financial Data
EMNLP 2021
A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space
EMNLP 2021
Open-Domain Question-Answering for COVID-19 and Other Emergent Domains
EMNLP 2021
Investigating Memorization of Conspiracy Theories in Text Generation
ACL 2021
Zero-shot Fact Verification by Claim Generation
ACL 2021
Neural Stylistic Response Generation with Disentangled Latent Variables
ACL 2021
On Hallucination and Predictive Uncertainty in Conditional Language Generation
EACL 2021
L2C: Describing Visual Differences Needs Semantic Understanding of Individuals
EACL 2021
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
EACL 2021
Semi-Supervised Policy Initialization for Playing Games with Language Hints
NAACL 2021
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
ICLR 2021
Open Question Answering over Tables and Text
ICLR 2021
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler
ECCV 2020
Multi-Task Self-Supervised Learning for Disfluency Detection
AAAI 2020
Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs
AAAI 2020
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model
ICLR 2020
TabFact: A Large-scale Dataset for Table-based Fact Verification
ICLR 2020
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
CVPR 2020
Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment
ACL 2020
Logical Natural Language Generation from Open-Domain Tables
ACL 2020
HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data
EMNLP 2020
Logic2Text: High-Fidelity Natural Language Generation from Logical Forms
EMNLP 2020
On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond
ACL 2020
Towards Understanding Gender Bias in Relation Extraction
ACL 2020
Few-Shot NLG with Pre-Trained Language Model
ACL 2020
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
CVPR 2020
Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation
EMNLP 2020
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations
EMNLP 2020
KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation
EMNLP 2020
Investigating African-American Vernacular English in Transformer-Based Text Generation
EMNLP 2020
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
EMNLP 2020
Counterfactual Off-Policy Training for Neural Dialogue Generation
EMNLP 2020
Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
ECCV 2020
Quantifying Uncertainties in Natural Language Processing Tasks
AAAI 2019
Deep Adversarial Learning for NLP
NAACL 2019
Neural Gaussian Copula for Variational Autoencoder
EMNLP 2019
Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification
EMNLP 2019
A Benchmark Dataset for Learning to Intervene in Online Hate Speech
EMNLP 2019
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
EMNLP 2019
Simple yet Effective Bridge Reasoning for Open-Domain Multi-Hop Question Answering
EMNLP 2019
What Should I Ask? Using Conversationally Informative Rewards for Goal-oriented Visual Dialog.
ACL 2019
Towards Explainable NLP: A Generative Explanation Framework for Text Classification
ACL 2019
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
ICCV 2019
TWEETQA: A Social Media Focused Question Answering Dataset
ACL 2019
Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader
ACL 2019
Self-Supervised Dialogue Learning
ACL 2019
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
ACL 2019
Self-Supervised Learning for Contextualized Extractive Summarization
ACL 2019
Mitigating Gender Bias in Natural Language Processing: Literature Review
ACL 2019
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning
AAAI 2019
Neural Gaussian Copula for Variational Autoencoder
IJCNLP 2019
Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification
IJCNLP 2019
A Benchmark Dataset for Learning to Intervene in Online Hate Speech
IJCNLP 2019
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
IJCNLP 2019
How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection
NAACL 2019
Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling
NAACL 2019
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Typing
NAACL 2019
Sentence Embedding Alignment for Lifelong Relation Extraction
NAACL 2019
Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation
NAACL 2019
Learning to Decipher Hate Symbols
NAACL 2019
DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction
ACL 2018
Multi-view Models for Political Ideology Detection of News Articles
EMNLP 2018
One-Shot Relational Learning for Knowledge Graphs
EMNLP 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
ACL 2018
Deep Reinforcement Learning for Chinese Zero Pronoun Resolution
ACL 2018
Variational Knowledge Graph Reasoning
NAACL 2018
Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection
NAACL 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
NAACL 2018
Scalable Construction and Reasoning of Massive Knowledge Bases
NAACL 2018
Zero Pronoun Resolution with Attention-based Neural Network
COLING 2018
Hierarchical CVAE for Fine-Grained Hate Speech Classification
EMNLP 2018
Video Captioning via Hierarchical Reinforcement Learning
CVPR 2018
MojiTalk: Generating Emotional Responses at Scale
ACL 2018
Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning
ACL 2018
Deep Reinforcement Learning for NLP
ACL 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
EMNLP 2018
Scheduled Policy Optimization for Natural Language Communication with Intelligent Agents
IJCAI 2018
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
ECCV 2018
Reinforced Co-Training
NAACL 2018
Simple Models for Word Formation in Slang
NAACL 2018
KBGAN: Adversarial Learning for Knowledge Graph Embeddings
NAACL 2018
Learning to Explain Non-Standard English Words and Phrases
IJCNLP 2017
Deep Residual Learning for Weakly-Supervised Relation Extraction
EMNLP 2017
βLiar, Liar Pants on Fireβ: A New Benchmark Dataset for Fake News Detection
ACL 2017
DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning
EMNLP 2017
Learning First-Order Logic Embeddings via Matrix Factorization
IJCAI 2016
A Low-Rank Approximation Approach to Learning Joint Embeddings of News Stories and Images for Timeline Summarization
NAACL 2016
Scalable Statistical Relational Learning for NLP
NAACL 2016
Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach
IJCNLP 2015
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken Language Understanding
IJCNLP 2015
Thatβs So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets
EMNLP 2015
I Can Has Cheezburger? A Nonparanormal Approach to Combining Textual and Visual Information for Predicting and Generating Popular Meme Descriptions
NAACL 2015
Jointly Modeling Inter-Slot Relations by Random Walk on Knowledge Graphs for Unsupervised Spoken Language Understanding
NAACL 2015
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken Language Understanding
ACL 2015
Joint Information Extraction and Reasoning: A Scalable Statistical Relational Learning Approach
ACL 2015
A Soft Version of Predicate Invention Based on Structured Sparsity
IJCAI 2015
Dependency Parsing for Weibo: An Efficient Probabilistic Logic Programming Approach
EMNLP 2014
A Semiparametric Gaussian Copula Regression Model for Predicting Financial Risks from Earnings Calls
ACL 2014
This Text Has the Scent of Starbucks: A Laplacian Structured Sparsity Model for Computational Branding Analytics
EMNLP 2013
Automatic Domain Partitioning for Multi-Domain Learning
EMNLP 2013
Historical Analysis of Legal Opinions with a Sparse Mixed-Effects Latent Variable Model
ACL 2012
Identifying Event Descriptions using Co-training with Online News Summaries
IJCNLP 2011
βGot You!β: Automatic Vandalism Detection in Wikipedia with Web-based Shallow Syntactic-Semantic Modeling
COLING 2010