Yangchen Pan
20 papers · 2016–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (9)
🌈
Renaissance Researcher
(5)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(9)
🤝
Dynamic Duo
(12)
🏆
Grand Slam
🧬
Topic Evolution
💎
Century Club
(19)
🗃️
Keyword Collector
(56)
🔥
Unstoppable
(10)
📈
Trend Setter
Conferences
ICLR (5)
ICML (4)
IJCAI (3)
NIPS (2)
UAI (2)
AAAI (1)
AISTATS (1)
ECCV (1)
JMLR (1)
Top co-authors
Keywords
reinforcement learning
(4)
sample efficiency
(3)
model-based reinforcement learning
(2)
policy gradient
(2)
state abstraction
(1)
online learning
(1)
continual learning
(1)
continuous state
(1)
function approximation
(1)
domain adaptation
(1)
supervised learning
(1)
convergence analysis
(1)
autonomous driving
(1)
natural gradient
(1)
value function
(1)
optimal control
(1)
policy optimization
(1)
distribution shift
(1)
temporal difference learning
(1)
multi-agent reinforcement learning
(1)
Papers
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models (Abstract Reprint)
AAAI 2026
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
ICML 2025
Improving Adversarial Transferability via Model Alignment
ECCV 2024
Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
ICML 2024
Label Alignment Regularization for Distribution Shift
JMLR 2024
The In-Sample Softmax for Offline Reinforcement Learning
ICLR 2023
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
ICLR 2023
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
NIPS 2023
Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning
UAI 2023
Understanding and mitigating the limitations of prioritized experience replay
UAI 2022
An Alternate Policy Gradient Estimator for Softmax Policies
AISTATS 2022
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
ICLR 2021
An implicit function learning approach for parametric modal regression
NIPS 2020
Frequency-based Search-control in Dyna
ICLR 2020
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
ICLR 2020
Hill Climbing on Value Estimates for Search-control in Dyna
IJCAI 2019
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains
IJCAI 2018
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
ICML 2018
Adapting Kernel Representations Online Using Submodular Maximization
ICML 2017
Incremental Truncated LSTD
IJCAI 2016