Paavo Parmas
6 papers · 2018–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (7) π Cross-Pollinator (10)
πΊοΈ
Taxonomy Completionist
(12)
π
Interdisciplinary Bridge
π
Keyword Champion
(4)
Conferences
ICML (2)
NIPS (2)
AISTATS (1)
ICLR (1)
Top co-authors
Keywords
likelihood ratio gradient
(4)
reparameterization gradient
(4)
gradient estimation
(3)
model-based reinforcement learning
(2)
policy gradient
(2)
monte carlo estimation
(1)
importance sampling
(1)
message passing
(1)
policy search
(1)
particle filter
(1)
variance reduction
(1)
likelihood ratio
(1)
graphical model
(1)
gradient estimator
(1)
monte carlo estimator
(1)
exploding gradient
(1)
deep reinforcement learning
(1)
monte carlo gradient
(1)
stochastic gradient
(1)
variational inference
(1)
Papers
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
ICLR 2025
Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators
ICML 2023
Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms
NIPS 2022
A unified view of likelihood ratio and reparameterization gradients
AISTATS 2021
Total stochastic gradient algorithms and applications in reinforcement learning
NIPS 2018
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
ICML 2018