Chao Yu
37 papers · 2019–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (9) π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(37)
π
Grand Slam
π¬
Deep Specialist
(11)
π§¬
Topic Evolution
π
Triple Crown
π€
Dynamic Duo
(12)
π
Keyword Champion
(2)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(103)
π₯
Unstoppable
(5)
β
The Questioner
π
Century Club
(34)
Conferences
AAAI (10)
ICML (7)
NIPS (6)
ICLR (4)
IJCAI (4)
CORL (2)
ACL (1)
ECCV (1)
EMNLP (1)
JMLR (1)
Top co-authors
Keywords
multi-agent reinforcement learning
(9)
offline reinforcement learning
(4)
model-based reinforcement learning
(3)
deep reinforcement learning
(3)
policy optimization
(3)
nash equilibrium
(3)
proximal policy optimization
(2)
policy gradient
(2)
reinforcement learning
(2)
game theory
(2)
multi-step prediction
(2)
model rollout
(2)
trajectory generation
(2)
multi-agent coordination
(2)
multi-agent system
(2)
preference learning
(1)
uncertainty quantification
(1)
curriculum learning
(1)
hierarchical reinforcement learning
(1)
transfer learning
(1)
Papers
Reliability-Guaranteed and Reward-Seeking Sequence Modeling for Model-Based Offline Reinforcement Learning
AAAI 2026
Red Teaming Large Reasoning Models
ACL 2026
CATAL: Causally Disentangled Task Representation Learning for Offline Meta-Reinforcement Learning
AAAI 2026
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
EMNLP 2025
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
ICLR 2025
Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams
CORL 2025
Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play
JMLR 2025
Rapid Learning in Constrained Minimax Games with Negative Momentum
AAAI 2025
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
AAAI 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
ICML 2025
Conservative Offline Goal-Conditioned Implicit V-Learning
ICML 2025
Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning
CORL 2025
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
ICML 2025
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
NIPS 2024
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
ICML 2024
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
AAAI 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
ICLR 2024
Causal Deep Reinforcement Learning Using Observational Data
IJCAI 2023
Hybrid Policy Optimization from Imperfect Demonstrations
NIPS 2023
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning
AAAI 2023
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks
AAAI 2023
Hierarchical Mean-Field Deep Reinforcement Learning for Large-Scale Multiagent Systems
AAAI 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
ICLR 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
ICML 2023
Automatic Truss Design with Reinforcement Learning
IJCAI 2023
Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning
NIPS 2022
Learning Efficient Multi-agent Cooperative Visual Exploration
ECCV 2022
Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning
AAAI 2022
A Unified Diversity Measure for Multiagent Reinforcement Learning
NIPS 2022
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
NIPS 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
ICML 2022
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
ICLR 2021
A Joint Training Dual-MRC Framework for Aspect Based Sentiment Analysis
AAAI 2021
Coordinated Proximal Policy Optimization
NIPS 2021
The Price of Governance: A Middle Ground Solution to Coordination in Organizational Control
IJCAI 2019
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework
IJCAI 2019