Wenhao Zhan
12 papers · 2022–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (3)
π
Cross-Pollinator
(13)
π
Century Club
(12)
β‘
Prolific Year
(5)
Conferences
ICLR (8)
COLT (2)
NIPS (2)
Top co-authors
Keywords
sample complexity
(3)
offline reinforcement learning
(2)
image generation
(1)
policy optimization
(1)
preference learning
(1)
natural policy gradient
(1)
language modeling
(1)
adaptive sampling
(1)
vc dimension
(1)
primal-dual algorithm
(1)
generative model
(1)
language model
(1)
worst-case risk
(1)
relative reward
(1)
preference-based learning
(1)
multi-distribution learning
(1)
density ratio
(1)
policy fine-tuning
(1)
hybrid reinforcement learning
(1)
reinforcement learning
(1)
Papers
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
ICLR 2025
Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization
ICLR 2025
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025
Provable Offline Preference-Based Reinforcement Learning
ICLR 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NIPS 2024
Optimal Multi-Distribution Learning
COLT 2024
Provable Reward-Agnostic Preference-Based Reinforcement Learning
ICLR 2024
Provably Efficient CVaR RL in Low-rank MDPs
ICLR 2024
Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games
ICLR 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
NIPS 2023
PAC Reinforcement Learning for Predictive State Representations
ICLR 2023
Offline Reinforcement Learning with Realizability and Single-policy Concentrability
COLT 2022