Nino Vieillard

15 papers · 2020–2025 · 6 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (5)

🐝 Cross-Pollinator (13) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 👑 Triple Crown 👥 Mega-Team (20) 🤝 Dynamic Duo (11) 🏆 Grand Slam 💎 Century Club (15) 🔥 Unstoppable (6)

Conferences

ICML (5) NIPS (3) AAAI (2) AISTATS (2) ICLR (2) ACL (1)

Top co-authors

Matthieu Geist (11) Olivier Pietquin (9) Olivier Bachem (6) Leonard Hussenot (5) Johan Ferret (4) Nikola Momchev (3) Geoffrey Cideron (3) Alexandre Rame (3) Piotr Stańczyk (3) Robert Dadashi (3)

Keywords

reinforcement learning (5) value iteration (3) off-policy learning (2) offline reinforcement learning (2) temporal difference (2) policy regularization (2) deep reinforcement learning (2) function approximation (1) imitation learning (1) sequence modeling (1) inverse reinforcement learning (1) reinforcement learning from human feedback (1) policy learning (1) kullback-leibler divergence (1) minimax optimality (1) kl regularization (1) reward function (1) approximate dynamic programming (1) policy iteration (1) sample complexity (1)

Papers

Loss Functions and Operators Generated by f-Divergences ICML 2025 On Teacher Hacking in Language Model Distillation ICML 2025 BOND: Aligning LLMs with Best-of-N Distillation ICLR 2025 On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes ICLR 2024 Imitating Language via Scalable Inverse Reinforcement Learning NIPS 2024 WARM: On the Benefits of Weight Averaged Reward Models ICML 2024 Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice ICML 2023 Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback ACL 2023 Implicitly Regularized RL with Implicit Q-values AISTATS 2022 Offline Reinforcement Learning as Anti-exploration AAAI 2022 Offline Reinforcement Learning with Pseudometric Learning ICML 2021 Munchausen Reinforcement Learning NIPS 2020 Momentum in Reinforcement Learning AISTATS 2020 Deep Conservative Policy Iteration AAAI 2020 Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning NIPS 2020