Pierre-Luc Bacon

23 papers · 2018–2025 · 5 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (5) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (27) 👑 Triple Crown 🏆 Grand Slam 🧬 Topic Evolution ❓ The Questioner (2) 💎 Century Club (23) 🗃️ Keyword Collector (81) ⚡ Prolific Year (5) 🔥 Unstoppable (6)

Conferences

ICLR (7) ICML (7) NIPS (6) AAAI (2) AISTATS (1)

Top co-authors

Clement Gehring (3) Evgenii Nikishin (3) Pierluca D'Oro (3) Martin Klissarov (3) Pascal Vincent (3) Doina Precup (3) Michel Ma (3) Tianwei Ni (3) David Kanaa (2) Max Schwarzer (2)

Keywords

reinforcement learning (5) continuous control (2) policy optimization (2) policy gradient (2) sample efficiency (1) off-policy evaluation (1) sequence modeling (1) importance sampling (1) maximum entropy (1) markov decision process (1) function approximation (1) attention mechanism (1) constrained reinforcement learning (1) language modeling (1) hierarchical reinforcement learning (1) markov chain monte carlo (1) imitation learning (1) maximum likelihood (1) contrastive learning (1) transformer architecture (1)

Papers

MaestroMotif: Skill Design from Artificial Intelligence Feedback ICLR 2025 Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning ICML 2025 Scaling Trends in Language Model Robustness ICML 2025 Motif: Intrinsic Motivation from Artificial Intelligence Feedback ICLR 2024 Maximum entropy GFlowNets with soft Q-learning AISTATS 2024 Course Correcting Koopman Representations ICLR 2024 Decoupling regularization from the action space ICLR 2024 Bridging State and History Representations: Understanding Self-Predictive RL ICLR 2024 Do Transformer World Models Give Better Policy Gradients? ICML 2024 Double Gumbel Q-Learning NIPS 2023 Block-State Transformers NIPS 2023 Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control NIPS 2023 Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier ICLR 2023 When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment NIPS 2023 Continuous-Time Meta-Learning with Forward Mode Differentiation ICLR 2022 Direct Behavior Specification via Constrained Reinforcement Learning ICML 2022 Myriad: a real-world testbed to bridge trajectory optimization and deep learning NIPS 2022 Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation AAAI 2022 The Primacy Bias in Deep Reinforcement Learning ICML 2022 Neural Algorithmic Reasoners are Implicit Planners NIPS 2021 Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020 Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling ICML 2020 Convergent Tree Backup and Retrace with Function Approximation ICML 2018