Yihan Du
15 papers · 2019–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (5) π Cross-Pollinator (8)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(22)
π
Conference Polyglot
(5)
π
Grand Slam
π
Triple Crown
π
Century Club
(15)
π₯
Unstoppable
(7)
Conferences
ICML (5)
ICLR (4)
NIPS (3)
AAAI (2)
CVPR (1)
Top co-authors
Keywords
multi-armed bandit
(4)
sample complexity
(4)
regret bound
(3)
combinatorial pure exploration
(3)
combinatorial optimization
(2)
graph matching
(1)
function approximation
(1)
active learning
(1)
regret minimization
(1)
online decision making
(1)
bellman equation
(1)
reward function
(1)
covariance modeling
(1)
bandit feedback
(1)
object recognition
(1)
safe reinforcement learning
(1)
best arm identification
(1)
risk-aware learning
(1)
online algorithm
(1)
online learning
(1)
Papers
Reinforcement Learning with Segment Feedback
ICML 2025
Cascading Reinforcement Learning
ICLR 2024
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
ICLR 2024
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
ICML 2024
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
ICLR 2023
Provably Safe Reinforcement Learning with Step-wise Violation Constraints
NIPS 2023
Collaborative Pure Exploration in Kernel Bandit
ICLR 2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits
ICML 2023
Branching Reinforcement Learning
ICML 2022
Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback
AAAI 2021
Combinatorial Pure Exploration with Bottleneck Reward Function
NIPS 2021
A One-Size-Fits-All Solution to Conservative Bandit Problems
AAAI 2021
Continuous Mean-Covariance Bandits
NIPS 2021
Combinatorial Pure Exploration for Dueling Bandit
ICML 2020
Direct Object Recognition Without Line-Of-Sight Using Optical Coherence
CVPR 2019