Yao Liu
37 papers · 2016–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Conference Polyglot
(13)
πΊοΈ
Taxonomy Completionist
(14)
π§
Keyword Pioneer
π₯
Mega-Team
(34)
π
Grand Slam
π§¬
Topic Evolution
ποΈ
Keyword Collector
(168)
β‘
Prolific Year
(5)
π
Conference Pioneer
π
Century Club
(33)
π₯
Unstoppable
(8)
π
Trend Setter
β
The Questioner
Conferences
AAAI (6)
EMNLP (6)
ICML (5)
NIPS (5)
ACL (4)
ICLR (2)
IJCAI (2)
UAI (2)
AACL (1)
ACML (1)
CORL (1)
IJCNLP (1)
NSDI (1)
Top co-authors
Keywords
large language model
(6)
reinforcement learning
(5)
importance sampling
(4)
graph retrieval
(3)
markov decision process
(3)
off-policy evaluation
(3)
knowledge distillation
(3)
batch reinforcement learning
(3)
knowledge graph
(3)
instruction tuning
(3)
review comment understanding
(2)
peer review
(2)
multi-hop reasoning
(2)
semantic mind graph
(2)
policy optimization
(2)
hierarchical background graph
(2)
few-shot learning
(1)
iterative optimization
(1)
semi-supervised learning
(1)
sparse recovery
(1)
Papers
Exploiting Inter-Session Information with Frequency-enhanced Dual-Path Networks for Sequential Recommendation
AAAI 2026
Why Do Emotions Change? Appraisal-Guided Reasoning for EmotionβCause Triplet Extraction in Conversations
ACL 2026
Human Cognition Inspired RAG with Knowledge Graph for Complex Problem Solving
AAAI 2026
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence
AAAI 2026
SCE: Semantic Consistency Enhanced Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning
EMNLP 2025
SEAGraph: Unveiling the Whole Story of Paper Review Comments
AACL 2025
Enhancing LLM-based Hatred and Toxicity Detection with Meta-Toxic Knowledge Graph
ACL 2025
GEMS: Generation-Based Event Argument Extraction via Multi-perspective Prompts and Ontology Steering
ACL 2025
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
EMNLP 2025
Can Large Language Models Act as Ensembler for Multi-GNNs?
EMNLP 2025
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
EMNLP 2025
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
ICLR 2025
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
IJCAI 2025
SEAGraph: Unveiling the Whole Story of Paper Review Comments
IJCNLP 2025
EXTRACT: Efficient Policy Learning by Extracting Transferable Robot Skills from Offline Data
CORL 2024
patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds
AAAI 2024
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
ICLR 2024
Learning the Target Network in Function Space
ICML 2024
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
ACL 2024
Cognitive Bias in Decision-Making with LLMs
EMNLP 2024
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
EMNLP 2023
Budgeting Counterfactual for Offline RL
NIPS 2023
TD Convergence: An Optimization Perspective
NIPS 2023
Generalized Federated Learning via Sharpness Aware Minimization
ICML 2022
Offline policy optimization with eligible actions
UAI 2022
Provably sample-efficient RL with side information about latent dynamics
NIPS 2022
Asynchronous Teacher Guided Bit-wise Hard Mining for Online Hashing
AAAI 2021
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions
ICML 2020
SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation
AAAI 2020
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration
NIPS 2020
Comb Decoding towards Collision-Free WiFi
NSDI 2020
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
ICML 2020
Combining parametric and nonparametric models for off-policy evaluation
ICML 2019
Off-Policy Policy Gradient with Stationary Distribution Correction
UAI 2019
A Scalable Heterogeneous Parallel SOM Based on MPI/CUDA
ACML 2018
Representation Balancing MDPs for Off-policy Policy Evaluation
NIPS 2018
A Decision Procedure for a Fragment of Linear Time Mu-Calculus
IJCAI 2016