conftrace_

Runlong Zhou

6 papers · 2021–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

🌍 Conference Polyglot (4) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)

Conferences

ICLR (2) ICML (2) ACL (1) NIPS (1)

Top co-authors

Simon Shaolei Du (4) Zhang Zihan (1) Abhishek Gupta (1) Michal Valko (1) Alessandro Lazaric (1) Zhaoyi Zhou (1) Beibin Li (1) Ruizhe Shi (1) Qiwen Cui (1) Matteo Pirotta (1)

Keywords

regret bound (2) curriculum learning (1) policy learning (1) markov decision process (1) value iteration (1) model-based reinforcement learning (1) online reinforcement learning (1) stochastic shortest path (1) minimax optimal (1) language model (1) exploration bonus (1) model-based algorithm (1) variance analysis (1) stochastic environment (1) latent markov decision process (1) variance-dependent bound (1) deterministic environment (1) reinforcement learning (1) episode-based learning (1)

Papers

The Crucial Role of Samplers in Online Direct Preference Optimization ICLR 2025 Reflect-RL: Two-Player Online RL Fine-Tuning for LMs ACL 2024 Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning ICLR 2024 Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes ICML 2023 Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments ICML 2023 Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret NIPS 2021