Pierre-yves Oudeyer
30 papers · 2012–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
๐บ๏ธ Taxonomy Completionist (16) ๐งญ Keyword Pioneer ๐ Renaissance Researcher (7) ๐ Interdisciplinary Bridge ๐ Conference Polyglot (9)
๐
Conference Polyglot
(9)
๐
Academic Marathon
(13)
๐
Cross-Pollinator
(8)
๐
Triple Crown
๐ค
Dynamic Duo
(11)
๐งฌ
Topic Evolution
๐ฑ
Topic Pioneer
๐ฌ
Deep Specialist
(12)
๐
Keyword Champion
(2)
๐
Trend Setter
๐๏ธ
Keyword Collector
(108)
๐ฅ
Unstoppable
(8)
๐
Century Club
(30)
โ
The Questioner
โก
Prolific Year
(6)
๐
Conference Pioneer
Conferences
ICLR (7)
ICML (7)
NIPS (7)
CORL (3)
NAACL (2)
ACL (1)
EMNLP (1)
IJCAI (1)
JMLR (1)
Top co-authors
Research topics
Keywords
intrinsic motivation
(6)
deep reinforcement learning
(6)
large language model
(4)
modular architecture
(3)
automatic curriculum learning
(3)
reinforcement learning
(3)
curriculum learning
(2)
sample efficiency
(2)
procedural generation
(2)
question generation
(2)
autonomous learning
(2)
sparse reward
(2)
code generation
(1)
text generation
(1)
sim-to-real transfer
(1)
prompt engineering
(1)
question answering
(1)
representation learning
(1)
in-context learning
(1)
language understanding
(1)
Papers
Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?
EMNLP 2025
When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings
ICLR 2025
Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI
ICML 2025
MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces
ICML 2025
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
NAACL 2025
PhyloLM: Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks
ICLR 2025
Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning
NIPS 2024
ACES: Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative Models
NIPS 2024
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
ICML 2023
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
ACL 2023
Language-biased image classification: evaluation based on semantic representations
ICLR 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
NIPS 2022
Learning to Guide and to be Guided in the Architect-Builder Problem
ICLR 2022
Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language
ICML 2022
Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning
JMLR 2022
Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents
NAACL 2022
Grounding Language to Autonomously-Acquired Skills via Goal Generation
ICLR 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
ICML 2021
Grounding Spatio-Temporal Language with Transformers
NIPS 2021
Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems
NIPS 2020
Intrinsically Motivated Discovery of Diverse Patterns in Self-Organizing Systems
ICLR 2020
Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration
NIPS 2020
Automatic Curriculum Learning For Deep RL: A Short Survey
IJCAI 2020
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
ICML 2019
Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments
CORL 2019
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
CORL 2018
Sim-to-Real Transfer with Neural-Augmented Robot Simulation
CORL 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
ICML 2018
Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration
ICLR 2018
Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress
NIPS 2012