Prashanth L.A.

11 papers · 2013–2024 · 6 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🌍 Conference Polyglot (6) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (11)

🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Keyword Champion (4) 📈 Trend Setter 🔥 Unstoppable (6) 💎 Century Club (11) 🗃️ Keyword Collector (57)

Conferences

AISTATS (3) ICML (3) NIPS (2) AAAI (1) IJCAI (1) JMLR (1)

Top co-authors

Sanjay P. Bhat (3) Krishna Jagannathan (2) Csaba Szepesvári (2) Ravi Kolla (1) Gandharv Patil (1) Cheng Jie (1) Mizhaan P. Maniyar (1) Shalabh Bhatnagar (1) Dheeraj Nagaraj (1) Steve Marcus (1)

Keywords

concentration bound (4) wasserstein distance (2) value function (2) stochastic bandit (2) policy optimization (2) reinforcement learning (2) multi-armed bandit (2) conditional value-at-risk (2) cumulative prospect theory (2) conditional value at risk (2) risk measure (2) regret minimization (1) markov decision process (1) exploration-exploitation tradeoff (1) sample complexity (1) markov decision processes (1) sequential decision making (1) best arm identification (1) policy gradient (1) sequential decision-making (1)

Papers

A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning AISTATS 2024 Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation AISTATS 2023 A Wasserstein Distance Approach for Concentration of Empirical Risk Estimates JMLR 2022 A Survey of Risk-Aware Multi-Armed Bandits IJCAI 2022 Estimation of Spectral Risk Measures AAAI 2021 Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions ICML 2020 Correlated bandits or: How to minimize mean-squared error online ICML 2019 Concentration of risk measures: A Wasserstein distance approach NIPS 2019 Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control ICML 2016 (Bandit) Convex Optimization with Biased Noisy Gradient Oracles AISTATS 2016 Actor-Critic Algorithms for Risk-Sensitive MDPs NIPS 2013