Shentao Yang
5 papers · 2022–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Cross-Pollinator (14)
π
Trend Setter
Conferences
ICML (2)
NIPS (2)
ICLR (1)
Top co-authors
Keywords
model-based reinforcement learning
(2)
offline reinforcement learning
(2)
preference learning
(1)
text generation
(1)
policy learning
(1)
continuous control
(1)
stationary distribution
(1)
lower bound
(1)
language model
(1)
dynamics model
(1)
language model fine-tuning
(1)
dynamic model
(1)
expected return
(1)
distribution mismatch
(1)
token-level guidance
(1)
model-based policy
(1)
offline model-based reinforcement learning
(1)
model-policy learning
(1)
sequence-level preference
(1)
policy optimization
(1)
Papers
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
ICML 2024
Preference-grounded Token-level Guidance for Language Model Fine-tuning
NIPS 2023
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
ICLR 2023
A Unified Framework for Alternating Offline Model Training and Policy Learning
NIPS 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
ICML 2022