conftrace_

Kelvin Xu

12 papers · 2015–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🌍 Conference Polyglot (4) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10)

🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (4) 💎 Century Club (12) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

ICLR (5) ICML (3) NIPS (3) RSS (1)

Top co-authors

Sergey Levine (6) Chelsea Finn (4) Mohammad Norouzi (2) Charlie Victor Snell (2) Jaehoon Lee (2) Ofir Nachum (2) Dale Schuurmans (2) Abhishek Gupta (1) Yuexiang Zhai (1) Katie E Everett (1)

Keywords

reinforcement learning (3) reward function (2) continual learning (1) few-shot learning (1) imitation learning (1) policy optimization (1) robotic manipulation (1) variational inference (1) game theory (1) bayesian inference (1) policy gradient (1) image captioning (1) visual representation (1) inverse reinforcement learning (1) value function (1) skill discovery (1) visual attention (1) deep model (1) probabilistic model (1) entropy regularization (1)

Papers

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models ICML 2025 Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning ICLR 2025 Small-scale proxies for large-scale Transformer training instabilities ICLR 2024 Autonomous Reinforcement Learning: Formalism and Benchmarking ICLR 2022 Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples ICLR 2020 Continual Learning of Control Primitives : Skill Discovery via Reset-Games NIPS 2020 Learning a Prior over Intent via Meta-Inverse Reinforcement Learning ICML 2019 Trust-PCL: An Off-Policy Trust Region Method for Continuous Control ICLR 2018 Probabilistic Model-Agnostic Meta-Learning NIPS 2018 Unsupervised Perceptual Rewards for Imitation Learning RSS 2017 Bridging the Gap Between Value and Policy Based Reinforcement Learning NIPS 2017 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention ICML 2015