conftrace_

Paavo Parmas

6 papers · 2018–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🏃 Academic Marathon (7) 🐝 Cross-Pollinator (10)

🗺️ Taxonomy Completionist (12) 🌉 Interdisciplinary Bridge 🏆 Keyword Champion (4)

Conferences

ICML (2) NIPS (2) AISTATS (1) ICLR (1)

Top co-authors

Takuma Seno (2) Yohei Hosoe (1) Tadashi Kozuno (1) Carl Edward Rasmussen (1) Toshinori Kitamura (1) Masashi Sugiyama (1) Yutaka Matsuo (1) Jan Peters (1) Kenta Hoshino (1) Masashi Hamaya (1)

Keywords

likelihood ratio gradient (4) reparameterization gradient (4) gradient estimation (3) model-based reinforcement learning (2) policy gradient (2) monte carlo estimation (1) importance sampling (1) message passing (1) policy search (1) particle filter (1) variance reduction (1) likelihood ratio (1) graphical model (1) gradient estimator (1) monte carlo estimator (1) exploding gradient (1) deep reinforcement learning (1) monte carlo gradient (1) stochastic gradient (1) variational inference (1)

Papers

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form ICLR 2025 Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators ICML 2023 Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms NIPS 2022 A unified view of likelihood ratio and reparameterization gradients AISTATS 2021 Total stochastic gradient algorithms and applications in reinforcement learning NIPS 2018 PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos ICML 2018