Yaqi Duan

12 papers · 2019–2025 · 6 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (8)

🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (24) 🏆 Keyword Champion (3) 🗃️ Keyword Collector (66) 💎 Century Club (12) 📈 Trend Setter ❓ The Questioner 🔥 Unstoppable (7)

Conferences

ICML (5) NIPS (3) ICLR (1) IJCAI (1) JMLR (1) L4DC (1)

Top co-authors

Mengdi Wang (7) Botao Hao (2) Csaba Szepesvári (2) Martin J. Wainwright (2) Munther Dahleh (1) Tracy Ke (1) Hao Lu (1) Chi Jin (1) Ming Yin (1) Aihua Mao (1)

Keywords

off-policy evaluation (4) markov process (3) reinforcement learning (2) linear function approximation (2) batch reinforcement learning (2) fitted q-iteration (2) markov decision process (2) fitted q-evaluation (2) markov chain (2) statistical inference (1) offline reinforcement learning (1) state abstraction (1) 3d reconstruction (1) online learning (1) unsupervised learning (1) policy evaluation (1) representation learning (1) tensor decomposition (1) convergence analysis (1) temporal difference learning (1)

Papers

PILAF: Optimal Human Preference Sampling for Reward Modeling ICML 2025 Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces NIPS 2024 Invertible Residual Neural Networks with Conditional Injector and Interpolator for Point Cloud Upsampling IJCAI 2023 Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition JMLR 2023 A finite-sample analysis of multi-step temporal difference estimates L4DC 2023 Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism ICLR 2022 Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient ICML 2021 Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning ICML 2021 Bootstrapping Fitted Q-Evaluation for Off-Policy Inference ICML 2021 Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation ICML 2020 State Aggregation Learning from Markov Transition Data NIPS 2019 Learning low-dimensional state embeddings and metastable clusters from time series data NIPS 2019