Nino Vieillard
15 papers · 2020–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (6) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird π Academic Marathon (5)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Triple Crown
π₯
Mega-Team
(20)
π€
Dynamic Duo
(11)
π
Grand Slam
π
Century Club
(15)
π₯
Unstoppable
(6)
Conferences
ICML (5)
NIPS (3)
AAAI (2)
AISTATS (2)
ICLR (2)
ACL (1)
Top co-authors
Keywords
reinforcement learning
(5)
value iteration
(3)
off-policy learning
(2)
offline reinforcement learning
(2)
temporal difference
(2)
policy regularization
(2)
deep reinforcement learning
(2)
function approximation
(1)
imitation learning
(1)
sequence modeling
(1)
inverse reinforcement learning
(1)
reinforcement learning from human feedback
(1)
policy learning
(1)
kullback-leibler divergence
(1)
minimax optimality
(1)
kl regularization
(1)
reward function
(1)
approximate dynamic programming
(1)
policy iteration
(1)
sample complexity
(1)
Papers
Loss Functions and Operators Generated by f-Divergences
ICML 2025
On Teacher Hacking in Language Model Distillation
ICML 2025
BOND: Aligning LLMs with Best-of-N Distillation
ICLR 2025
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
ICLR 2024
Imitating Language via Scalable Inverse Reinforcement Learning
NIPS 2024
WARM: On the Benefits of Weight Averaged Reward Models
ICML 2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
ICML 2023
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
ACL 2023
Implicitly Regularized RL with Implicit Q-values
AISTATS 2022
Offline Reinforcement Learning as Anti-exploration
AAAI 2022
Offline Reinforcement Learning with Pseudometric Learning
ICML 2021
Munchausen Reinforcement Learning
NIPS 2020
Momentum in Reinforcement Learning
AISTATS 2020
Deep Conservative Policy Iteration
AAAI 2020
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning
NIPS 2020