Yao Liu

37 papers · 2016–2026 · 13 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (13) 🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 👥 Mega-Team (34) 🏆 Grand Slam 🧬 Topic Evolution 🗃️ Keyword Collector (168) ⚡ Prolific Year (5) 🚀 Conference Pioneer 💎 Century Club (33) 🔥 Unstoppable (8) 📈 Trend Setter ❓ The Questioner

Conferences

AAAI (6) EMNLP (6) ICML (5) NIPS (5) ACL (4) ICLR (2) IJCAI (2) UAI (2) AACL (1) ACML (1) CORL (1) IJCNLP (1) NSDI (1)

Top co-authors

Emma Brunskill (7) Rasool Fakoor (6) Xiang Li (5) Omer Gottesman (4) Jiapeng Zhu (4) Yao Cheng (4) Qiao Liu (3) Julian McAuley (3) Jianxiang Yu (3) Kavosh Asadi (3)

Keywords

large language model (6) reinforcement learning (5) importance sampling (4) graph retrieval (3) markov decision process (3) off-policy evaluation (3) knowledge distillation (3) batch reinforcement learning (3) knowledge graph (3) instruction tuning (3) review comment understanding (2) peer review (2) multi-hop reasoning (2) semantic mind graph (2) policy optimization (2) hierarchical background graph (2) few-shot learning (1) iterative optimization (1) semi-supervised learning (1) sparse recovery (1)

Papers

Exploiting Inter-Session Information with Frequency-enhanced Dual-Path Networks for Sequential Recommendation AAAI 2026 Why Do Emotions Change? Appraisal-Guided Reasoning for Emotion–Cause Triplet Extraction in Conversations ACL 2026 Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving AAAI 2026 MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence AAAI 2026 SCE: Semantic Consistency Enhanced Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning EMNLP 2025 SEAGraph: Unveiling the Whole Story of Paper Review Comments AACL 2025 Enhancing LLM-based Hatred and Toxicity Detection with Meta-Toxic Knowledge Graph ACL 2025 GEMS: Generation-Based Event Argument Extraction via Multi-perspective Prompts and Ontology Steering ACL 2025 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning EMNLP 2025 Can Large Language Models Act as Ensembler for Multi-GNNs? EMNLP 2025 Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models EMNLP 2025 AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents ICLR 2025 D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning IJCAI 2025 SEAGraph: Unveiling the Whole Story of Paper Review Comments IJCNLP 2025 EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data CORL 2024 patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds AAAI 2024 TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models ICLR 2024 Learning the Target Network in Function Space ICML 2024 InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment ACL 2024 Cognitive Bias in Decision-Making with LLMs EMNLP 2024 MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation EMNLP 2023 Budgeting Counterfactual for Offline RL NIPS 2023 TD Convergence: An Optimization Perspective NIPS 2023 Generalized Federated Learning via Sharpness Aware Minimization ICML 2022 Offline policy optimization with eligible actions UAI 2022 Provably sample-efficient RL with side information about latent dynamics NIPS 2022 Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing AAAI 2021 Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions ICML 2020 SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation AAAI 2020 Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration NIPS 2020 Comb Decoding towards Collision-Free WiFi NSDI 2020 Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling ICML 2020 Combining parametric and nonparametric models for off-policy evaluation ICML 2019 Off-Policy Policy Gradient with Stationary Distribution Correction UAI 2019 A Scalable Heterogeneous Parallel SOM Based on MPI/CUDA ACML 2018 Representation Balancing MDPs for Off-policy Policy Evaluation NIPS 2018 A Decision Procedure for a Fragment of Linear Time Mu-Calculus IJCAI 2016