Yaqi Duan
12 papers · 2019–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (6) π Cross-Pollinator (8)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(24)
π
Keyword Champion
(3)
ποΈ
Keyword Collector
(66)
π
Century Club
(12)
π
Trend Setter
β
The Questioner
π₯
Unstoppable
(7)
Conferences
ICML (5)
NIPS (3)
ICLR (1)
IJCAI (1)
JMLR (1)
L4DC (1)
Top co-authors
Keywords
off-policy evaluation
(4)
markov process
(3)
reinforcement learning
(2)
linear function approximation
(2)
batch reinforcement learning
(2)
fitted q-iteration
(2)
markov decision process
(2)
fitted q-evaluation
(2)
markov chain
(2)
statistical inference
(1)
offline reinforcement learning
(1)
state abstraction
(1)
3d reconstruction
(1)
online learning
(1)
unsupervised learning
(1)
policy evaluation
(1)
representation learning
(1)
tensor decomposition
(1)
convergence analysis
(1)
temporal difference learning
(1)
Papers
PILAF: Optimal Human Preference Sampling for Reward Modeling
ICML 2025
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
NIPS 2024
Invertible Residual Neural Networks with Conditional Injector and Interpolator for Point Cloud Upsampling
IJCAI 2023
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition
JMLR 2023
A finite-sample analysis of multi-step temporal difference estimates
L4DC 2023
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
ICLR 2022
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient
ICML 2021
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
ICML 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
ICML 2021
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation
ICML 2020
State Aggregation Learning from Markov Transition Data
NIPS 2019
Learning low-dimensional state embeddings and metastable clusters from time series data
NIPS 2019