Dhawal Gupta
6 papers · 2020–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge π Cross-Pollinator (6)
πΊοΈ
Taxonomy Completionist
(11)
π
Grand Slam
Conferences
NIPS (3)
AAAI (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
temporal difference learning
(1)
policy evaluation
(1)
neural network training
(1)
offline reinforcement learning
(1)
policy optimization
(1)
stability analysis
(1)
reward function
(1)
temporal-difference learning
(1)
off-policy learning
(1)
language model
(1)
mixture of expert
(1)
bi-level optimization
(1)
reward shaping
(1)
credit assignment
(1)
conversational agent
(1)
gradient method
(1)
nonlinear function approximation
(1)
eligibility trace
(1)
control problem
(1)
dialogue management
(1)
Papers
From Past to Future: Rethinking Eligibility Traces
AAAI 2024
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
NIPS 2023
Behavior Alignment via Reward Function Optimization
NIPS 2023
A Mixture-of-Expert Approach to RL-based Dialogue Management
ICLR 2023
Structural Credit Assignment in Neural Networks using Reinforcement Learning
NIPS 2021
Gradient Temporal-Difference Learning with Regularized Corrections
ICML 2020