Runlong Zhou
6 papers · 2021–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(4)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(14)
Conferences
ICLR (2)
ICML (2)
ACL (1)
NIPS (1)
Top co-authors
Keywords
regret bound
(2)
curriculum learning
(1)
policy learning
(1)
markov decision process
(1)
value iteration
(1)
model-based reinforcement learning
(1)
online reinforcement learning
(1)
stochastic shortest path
(1)
minimax optimal
(1)
language model
(1)
exploration bonus
(1)
model-based algorithm
(1)
variance analysis
(1)
stochastic environment
(1)
latent markov decision process
(1)
variance-dependent bound
(1)
deterministic environment
(1)
reinforcement learning
(1)
episode-based learning
(1)
Papers
The Crucial Role of Samplers in Online Direct Preference Optimization
ICLR 2025
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
ACL 2024
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
ICLR 2024
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes
ICML 2023
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments
ICML 2023
Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
NIPS 2021