Pierluca D'Oro
9 papers · 2020–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Academic Marathon (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
β
The Questioner
Conferences
ICLR (5)
AAAI (2)
ICCV (1)
NIPS (1)
Top co-authors
Keywords
model-based reinforcement learning
(2)
policy optimization
(2)
policy gradient
(2)
visual grounding
(1)
regret minimization
(1)
importance sampling
(1)
policy search
(1)
multimodal large language model
(1)
randomized exploration
(1)
transition model
(1)
object hallucination
(1)
model learning
(1)
model-based rl
(1)
controlled decoding
(1)
batch policy improvement
(1)
reward-guided decoding
(1)
mediator feedback
(1)
reinforcement learning
(1)
action-value gradient
(1)
temporal difference learning
(1)
Papers
Towards General-Purpose Model-Free Reinforcement Learning
ICLR 2025
MaestroMotif: Skill Design from Artificial Intelligence Feedback
ICLR 2025
Controlling Multimodal LLMs via Reward-guided Decoding
ICCV 2025
The Curse of Diversity in Ensemble-Based Exploration
ICLR 2024
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
ICLR 2024
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
ICLR 2023
Policy Optimization as Online Learning with Mediator Feedback
AAAI 2021
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
NIPS 2020
Gradient-Aware Model-Based Policy Search
AAAI 2020