Tong Mu
4 papers · 2022–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
NIPS (2)
AAAI (1)
ICML (1)
Top co-authors
Keywords
reinforcement learning
(3)
zero-shot learning
(1)
multi-task learning
(1)
online learning
(1)
domain adaptation
(1)
language grounding
(1)
distribution shift
(1)
language learning
(1)
offline learning
(1)
domain knowledge
(1)
distributionally robust optimization
(1)
preference modeling
(1)
upper confidence bound
(1)
contextual bandit
(1)
reward model
(1)
meta-reinforcement learning
(1)
embodied agent
(1)
policy constraint
(1)
multi-task generalization
(1)
large language model
(1)
Papers
Rule Based Rewards for Language Model Safety
NIPS 2024
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning
ICML 2023
Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits
NIPS 2022
Constraint Sampling Reinforcement Learning: Incorporating Expertise for Faster Learning
AAAI 2022