Kelvin Xu
12 papers · 2015–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🌍 Conference Polyglot (4) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10)
🐝
Cross-Pollinator
(7)
🌈
Renaissance Researcher
(6)
🌍
Conference Polyglot
(4)
💎
Century Club
(12)
📈
Trend Setter
🚀
Conference Pioneer
Conferences
ICLR (5)
ICML (3)
NIPS (3)
RSS (1)
Top co-authors
Keywords
reinforcement learning
(3)
reward function
(2)
continual learning
(1)
few-shot learning
(1)
imitation learning
(1)
policy optimization
(1)
robotic manipulation
(1)
variational inference
(1)
game theory
(1)
bayesian inference
(1)
policy gradient
(1)
image captioning
(1)
visual representation
(1)
inverse reinforcement learning
(1)
value function
(1)
skill discovery
(1)
visual attention
(1)
deep model
(1)
probabilistic model
(1)
entropy regularization
(1)
Papers
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
ICML 2025
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning
ICLR 2025
Small-scale proxies for large-scale Transformer training instabilities
ICLR 2024
Autonomous Reinforcement Learning: Formalism and Benchmarking
ICLR 2022
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples
ICLR 2020
Continual Learning of Control Primitives : Skill Discovery via Reset-Games
NIPS 2020
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
ICML 2019
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
ICLR 2018
Probabilistic Model-Agnostic Meta-Learning
NIPS 2018
Unsupervised Perceptual Rewards for Imitation Learning
RSS 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
NIPS 2017
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
ICML 2015