Yangchen Pan

20 papers · 2016–2026 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (9)

🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (9) 🤝 Dynamic Duo (12) 🏆 Grand Slam 🧬 Topic Evolution 💎 Century Club (19) 🗃️ Keyword Collector (56) 🔥 Unstoppable (10) 📈 Trend Setter

Conferences

ICLR (5) ICML (4) IJCAI (3) NIPS (2) UAI (2) AAAI (1) AISTATS (1) ECCV (1) JMLR (1)

Top co-authors

Martha White (12) Amir-massoud Farahmand (7) Chenjun Xiao (3) Adam White (3) Pascal Poupart (2) Philip Torr (2) Ehsan Imani (2) Jun Luo (2) Hengshuai Yao (2) Avery Ma (2)

Keywords

reinforcement learning (4) sample efficiency (3) model-based reinforcement learning (2) policy gradient (2) state abstraction (1) online learning (1) continual learning (1) continuous state (1) function approximation (1) domain adaptation (1) supervised learning (1) convergence analysis (1) autonomous driving (1) natural gradient (1) value function (1) optimal control (1) policy optimization (1) distribution shift (1) temporal difference learning (1) multi-agent reinforcement learning (1)

Papers

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models (Abstract Reprint) AAAI 2026 PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling ICML 2025 Improving Adversarial Transferability via Model Alignment ECCV 2024 Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination ICML 2024 Label Alignment Regularization for Distribution Shift JMLR 2024 The In-Sample Softmax for Offline Reinforcement Learning ICLR 2023 Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement ICLR 2023 An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient NIPS 2023 Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning UAI 2023 Understanding and mitigating the limitations of prioritized experience replay UAI 2022 An Alternate Policy Gradient Estimator for Softmax Policies AISTATS 2022 Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online ICLR 2021 An implicit function learning approach for parametric modal regression NIPS 2020 Frequency-based Search-control in Dyna ICLR 2020 Maxmin Q-learning: Controlling the Estimation Bias of Q-learning ICLR 2020 Hill Climbing on Value Estimates for Search-control in Dyna IJCAI 2019 Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains IJCAI 2018 Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control ICML 2018 Adapting Kernel Representations Online Using Submodular Maximization ICML 2017 Incremental Truncated LSTD IJCAI 2016