Ramki Gummadi
8 papers · 2018–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
๐ Interdisciplinary Bridge ๐งญ Keyword Pioneer ๐ Conference Polyglot (5) ๐ Academic Marathon (6) ๐ Cross-Pollinator (14)
๐
Renaissance Researcher
(6)
๐บ๏ธ
Taxonomy Completionist
(18)
๐ฃ
Hot Topic Early Bird
Conferences
ICML (3)
AISTATS (2)
ICLR (1)
NIPS (1)
NSDI (1)
Top co-authors
Keywords
reinforcement learning
(2)
sample efficiency
(1)
variational inference
(1)
policy gradient
(1)
convergence analysis
(1)
sample complexity
(1)
policy learning
(1)
marginal likelihood
(1)
markov decision process
(1)
gradient descent
(1)
machine learning
(1)
importance weighting
(1)
cost-sensitive classification
(1)
mixing time
(1)
contextual bandit
(1)
surrogate objective
(1)
gradient estimator
(1)
bias function
(1)
proximal policy optimization
(1)
stackelberg game
(1)
Papers
Feasible $Q$-Learning for Average Reward Reinforcement Learning
AISTATS 2024
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
ICML 2024
HALP: Heuristic Aided Learned Preference Eviction Policy for YouTube Content Delivery Network
NSDI 2023
A Parametric Class of Approximate Gradient Updates for Policy Optimization
ICML 2022
Understanding and Leveraging Overparameterization in Recursive Value Estimation
ICLR 2022
Characterizing the Gap Between Actor-Critic and Policy Gradient
ICML 2021
Surrogate Objectives for Batch Policy Optimization in One-step Decision Making
NIPS 2019
Variational Rejection Sampling
AISTATS 2018