conftrace_

Olivier Pietquin

58 papers · 2011–2025 · 13 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+16 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (20) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13) 🌈 Renaissance Researcher (7) 🤝 Dynamic Duo (35) 👑 Triple Crown 🧬 Topic Evolution 🏆 Grand Slam 🔬 Deep Specialist (23) 🏆 Keyword Champion (2) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (12) ⚡ Prolific Year (11) 🗃️ Keyword Collector (57) 💎 Century Club (58) ❓ The Questioner (5)

Conferences

ICML (12) NIPS (12) ICLR (7) AAAI (5) AISTATS (5) IJCAI (5) ACL (3) EMNLP (2) INTERSPEECH (2) NAACL (2) ACML (1) CVPR (1) IJCNLP (1)

Top co-authors

Matthieu Geist (35) Florian Strub (13) Julien Pérolat (11) Bilal Piot (11) Leonard Hussenot (9) Nino Vieillard (9) Mathieu Lauriere (7) Sertan Girgin (7) Robert Dadashi (7) Olivier Bachem (6)

Keywords

reinforcement learning (18) deep reinforcement learning (7) fictitious play (7) multi-agent system (7) mean field game (6) policy learning (6) nash equilibrium (6) policy iteration (5) value iteration (5) imitation learning (5) markov game (4) game theory (4) policy optimization (4) markov decision process (4) reward function (4) entropy regularization (3) continuous control (3) off-policy learning (3) approximate dynamic programming (3) sample complexity (3)

Papers

NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics ICLR 2025 Self-Improving Robust Preference Optimization ICLR 2025 Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs ACL 2024 Learning Discrete-Time Major-Minor Mean Field Games AAAI 2024 MusicRL: Aligning Music Generation to Human Preferences ICML 2024 Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning ACL 2024 Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion EMNLP 2024 Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice ICML 2023 On Imitation in Mean-field Games NIPS 2023 Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback ACL 2023 Generalization in Mean Field Games by Learning Master Policies AAAI 2022 Learning Natural Language Generation with Truncated Reinforcement Learning NAACL 2022 Scalable Deep Reinforcement Learning Algorithms for Mean Field Games ICML 2022 Continuous Control with Action Quantization from Demonstrations ICML 2022 Implicitly Regularized RL with Implicit Q-values AISTATS 2022 On the role of population heterogeneity in emergent communication ICLR 2022 Offline Reinforcement Learning as Anti-exploration AAAI 2022 Emergent Communication: Generalization and Overfitting in Lewis Games NIPS 2022 There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning NIPS 2021 What Matters for Adversarial Imitation Learning? NIPS 2021 Mean Field Games Flock! The Reinforcement Learning Way IJCAI 2021 What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study ICLR 2021 Offline Reinforcement Learning with Pseudometric Learning ICML 2021 Primal Wasserstein Imitation Learning ICLR 2021 Adversarially Guided Actor-Critic ICLR 2021 Hyperparameter Selection for Imitation Learning ICML 2021 Don’t Do What Doesn’t Matter: Intrinsic Motivation with Action Usefulness IJCAI 2021 Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications NIPS 2020 Munchausen Reinforcement Learning NIPS 2020 Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning NIPS 2020 Deep Conservative Policy Iteration AAAI 2020 On the Convergence of Model Free Learning in Mean Field Games AAAI 2020 Foolproof Cooperative Learning ACML 2020 Momentum in Reinforcement Learning AISTATS 2020 Supervised Seeded Iterated Learning for Interactive Language Learning EMNLP 2020 Countering Language Drift with Seeded Iterated Learning ICML 2020 Self-Attentional Credit Assignment for Transfer in Reinforcement Learning IJCAI 2020 A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning INTERSPEECH 2020 Budgeted Reinforcement Learning in Continuous State Space NIPS 2019 A Theory of Regularized Markov Decision Processes ICML 2019 Learning from a Learner ICML 2019 Actor-Critic Fictitious Play in Simultaneous Move Multistage Games AISTATS 2018 Noisy Networks For Exploration ICLR 2018 Is the Bellman residual a bad proxy? NIPS 2017 Learning Nash Equilibrium for General-Sum Markov Games from Batch Data AISTATS 2017 GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue CVPR 2017 Modulating early visual processing by language NIPS 2017 End-to-end optimization of goal-driven and visually grounded dialogue systems IJCAI 2017 Softened Approximate Policy Iteration for Markov Games ICML 2016 On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games AISTATS 2016 A Stochastic Model for Computer-Aided Human-Human Dialogue INTERSPEECH 2016 PAC learning of Probabilistic Automaton based on the Method of Moments ICML 2016 Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games ICML 2015 Inverse Reinforcement Learning in Relational Domains IJCAI 2015 Difference of Convex Functions Programming for Reinforcement Learning NIPS 2014 Inverse Reinforcement Learning through Structured Classification NIPS 2012 Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future? NAACL 2012 Training a BN-based user model for dialogue simulation with missing data IJCNLP 2011