Olivier Pietquin
58 papers · 2011–2025 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (20) π Interdisciplinary Bridge π Conference Polyglot (13)
π
Interdisciplinary Bridge
π
Conference Polyglot
(13)
π
Renaissance Researcher
(7)
π€
Dynamic Duo
(35)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(23)
π
Keyword Champion
(2)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(12)
β‘
Prolific Year
(11)
ποΈ
Keyword Collector
(57)
π
Century Club
(58)
β
The Questioner
(5)
Conferences
ICML (12)
NIPS (12)
ICLR (7)
AAAI (5)
AISTATS (5)
IJCAI (5)
ACL (3)
EMNLP (2)
INTERSPEECH (2)
NAACL (2)
ACML (1)
CVPR (1)
IJCNLP (1)
Top co-authors
Keywords
reinforcement learning
(18)
deep reinforcement learning
(7)
fictitious play
(7)
multi-agent system
(7)
mean field game
(6)
policy learning
(6)
nash equilibrium
(6)
policy iteration
(5)
value iteration
(5)
imitation learning
(5)
markov game
(4)
game theory
(4)
policy optimization
(4)
markov decision process
(4)
reward function
(4)
entropy regularization
(3)
continuous control
(3)
off-policy learning
(3)
approximate dynamic programming
(3)
sample complexity
(3)
Papers
NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics
ICLR 2025
Self-Improving Robust Preference Optimization
ICLR 2025
Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs
ACL 2024
Learning Discrete-Time Major-Minor Mean Field Games
AAAI 2024
MusicRL: Aligning Music Generation to Human Preferences
ICML 2024
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
ACL 2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP 2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
ICML 2023
On Imitation in Mean-field Games
NIPS 2023
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
ACL 2023
Generalization in Mean Field Games by Learning Master Policies
AAAI 2022
Learning Natural Language Generation with Truncated Reinforcement Learning
NAACL 2022
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
ICML 2022
Continuous Control with Action Quantization from Demonstrations
ICML 2022
Implicitly Regularized RL with Implicit Q-values
AISTATS 2022
On the role of population heterogeneity in emergent communication
ICLR 2022
Offline Reinforcement Learning as Anti-exploration
AAAI 2022
Emergent Communication: Generalization and Overfitting in Lewis Games
NIPS 2022
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
NIPS 2021
What Matters for Adversarial Imitation Learning?
NIPS 2021
Mean Field Games Flock! The Reinforcement Learning Way
IJCAI 2021
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study
ICLR 2021
Offline Reinforcement Learning with Pseudometric Learning
ICML 2021
Primal Wasserstein Imitation Learning
ICLR 2021
Adversarially Guided Actor-Critic
ICLR 2021
Hyperparameter Selection for Imitation Learning
ICML 2021
Donβt Do What Doesnβt Matter: Intrinsic Motivation with Action Usefulness
IJCAI 2021
Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications
NIPS 2020
Munchausen Reinforcement Learning
NIPS 2020
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning
NIPS 2020
Deep Conservative Policy Iteration
AAAI 2020
On the Convergence of Model Free Learning in Mean Field Games
AAAI 2020
Foolproof Cooperative Learning
ACML 2020
Momentum in Reinforcement Learning
AISTATS 2020
Supervised Seeded Iterated Learning for Interactive Language Learning
EMNLP 2020
Countering Language Drift with Seeded Iterated Learning
ICML 2020
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
IJCAI 2020
A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning
INTERSPEECH 2020
Budgeted Reinforcement Learning in Continuous State Space
NIPS 2019
A Theory of Regularized Markov Decision Processes
ICML 2019
Learning from a Learner
ICML 2019
Actor-Critic Fictitious Play in Simultaneous Move Multistage Games
AISTATS 2018
Noisy Networks For Exploration
ICLR 2018
Is the Bellman residual a bad proxy?
NIPS 2017
Learning Nash Equilibrium for General-Sum Markov Games from Batch Data
AISTATS 2017
GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue
CVPR 2017
Modulating early visual processing by language
NIPS 2017
End-to-end optimization of goal-driven and visually grounded dialogue systems
IJCAI 2017
Softened Approximate Policy Iteration for Markov Games
ICML 2016
On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games
AISTATS 2016
A Stochastic Model for Computer-Aided Human-Human Dialogue
INTERSPEECH 2016
PAC learning of Probabilistic Automaton based on the Method of Moments
ICML 2016
Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
ICML 2015
Inverse Reinforcement Learning in Relational Domains
IJCAI 2015
Difference of Convex Functions Programming for Reinforcement Learning
NIPS 2014
Inverse Reinforcement Learning through Structured Classification
NIPS 2012
Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future?
NAACL 2012
Training a BN-based user model for dialogue simulation with missing data
IJCNLP 2011