Yali Du
47 papers · 2017–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π Conference Polyglot (7)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(14)
π€
Dynamic Duo
(14)
π
Triple Crown
π
Grand Slam
π¬
Deep Specialist
(11)
π§¬
Topic Evolution
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(186)
π₯
Unstoppable
(7)
π
Century Club
(45)
β‘
Prolific Year
(11)
Conferences
NIPS (14)
AAAI (8)
ICML (7)
IJCAI (7)
EMNLP (5)
ICLR (4)
ACL (2)
Top co-authors
Keywords
reinforcement learning
(10)
multi-agent reinforcement learning
(8)
multi-agent system
(7)
large language model
(7)
text-based game
(4)
credit assignment
(4)
game theory
(3)
game ai
(3)
policy optimization
(3)
policy learning
(3)
cooperative game
(3)
language model
(3)
sparse reward
(2)
cooperative multi-agent
(2)
preference-based reinforcement learning
(2)
deep reinforcement learning
(2)
knowledge graph
(2)
sample efficiency
(2)
reward learning
(2)
causal inference
(2)
Papers
Safe Multi-agent Reinforcement Learning with Natural Language Constraints
AAAI 2026
Causality-Aware Efficient Exploration for Cooperative Multi-Agent Reinforcement Learning
AAAI 2026
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning
EMNLP 2025
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models
ICLR 2025
RuAG: Learned-rule-augmented Generation for Large Language Models
ICLR 2025
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
EMNLP 2025
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
AAAI 2025
ATLAS: Agent Tuning via Learning Critical Steps
ACL 2025
Spiral of Silence in Large Language Model Agents
EMNLP 2025
Quantifying the Self-Interest Level of Markov Social Dilemmas
IJCAI 2025
GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs
ICML 2025
M$^3$HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
ICML 2025
VLP: Vision-Language Preference Learning for Embodied Manipulation
EMNLP 2025
Human-Guided Moral Decision Making in Text-Based Games
AAAI 2024
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting
NIPS 2024
Learning the Expected Core of Strictly Convex Stochastic Cooperative Games
NIPS 2024
Aligning Individual and Collective Objectives in Multi-Agent Cooperation
NIPS 2024
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
NIPS 2024
Self-Guiding Exploration for Combinatorial Problems
NIPS 2024
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning
AAAI 2024
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
AAAI 2024
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation
ICML 2024
Dual Contrastive Graph-Level Clustering with Multiple Cluster Perspectives Alignment
IJCAI 2024
Off-Agent Trust Region Policy Optimization
IJCAI 2024
ChessGPT: Bridging Policy Learning and Language Modeling
NIPS 2023
Cooperative Open-ended Learning Framework for Zero-Shot Coordination
ICML 2023
Cooperative Multi-Agent Learning in a Complex World: Challenges and Solutions
AAAI 2023
Invariant Learning via Probability of Sufficient and Necessary Causes
NIPS 2023
Stay Moral and Explore: Learn to Behave Morally in Text-based Games
ICLR 2023
Reduced Policy Optimization for Continuous Control with Hard Constraints
NIPS 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
NIPS 2023
Capturing the Long-Distance Dependency in the Control Flow Graph via Structural-Guided Attention for Bug Localization
IJCAI 2023
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination
NIPS 2023
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
NIPS 2022
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach
AAAI 2022
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL
ICLR 2022
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games
ACL 2022
Estimating $Ξ±$-Rank from A Few Entries with Low Rank Matrix Completion
ICML 2021
Ordering-Based Causal Discovery with Reinforcement Learning
IJCAI 2021
Generalization in Text-based Games via Hierarchical Reinforcement Learning
EMNLP 2021
Learning in Nonzero-Sum Stochastic Games with Potentials
ICML 2021
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games
NIPS 2020
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI
ICML 2019
Curriculum-guided Hindsight Experience Replay
NIPS 2019
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
NIPS 2019
Privileged Matrix Factorization for Collaborative Filtering
IJCAI 2017
Collaborative Rating Allocation
IJCAI 2017