Yuda Song
17 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (7) π Academic Marathon (5) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (15)
π
Academic Marathon
(5)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Triple Crown
π
Century Club
(17)
π₯
Unstoppable
(6)
Conferences
ICML (7)
ICLR (5)
COLT (1)
ECCV (1)
ICCV (1)
L4DC (1)
NIPS (1)
Top co-authors
Research topics
Keywords
model-based reinforcement learning
(4)
online learning
(2)
policy optimization
(2)
representation learning
(2)
sample complexity
(1)
model predictive control
(1)
computational efficiency
(1)
domain randomization
(1)
value function
(1)
real-time processing
(1)
offline learning
(1)
target task
(1)
online reinforcement learning
(1)
adaptive control
(1)
regret bound
(1)
policy adaptation
(1)
model-free reinforcement learning
(1)
latent state
(1)
block mdp
(1)
source task
(1)
Papers
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
ICLR 2025
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
ICML 2025
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage
NIPS 2024
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
ICLR 2024
Hybrid Reinforcement Learning from Offline Observation Alone
ICML 2024
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics
ICML 2024
Representation Learning for Low-rank General-sum Markov Games
ICLR 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
ICML 2023
Provable Benefits of Representational Transfer in Reinforcement Learning
COLT 2023
Hybrid RL: Using both offline and online data can make RL efficient
ICLR 2023
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
ICLR 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach
ICML 2022
Online No-regret Model-Based Meta RL for Personalized Navigation
L4DC 2022
Multi-Curve Translator for High-Resolution Photorealistic Image Translation
ECCV 2022
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement
ICCV 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
ICML 2021
Provably Efficient Model-based Policy Adaptation
ICML 2020