conftrace_

Dhawal Gupta

6 papers · 2020–2024 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (4) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (6)

🗺️ Taxonomy Completionist (11) 🏆 Grand Slam

Conferences

NIPS (3) AAAI (1) ICLR (1) ICML (1)

Top co-authors

Philip S. Thomas (3) Martha White (2) Yinlam Chow (2) Azamat Tulepbergenov (2) Mohammad Ghavamzadeh (2) Craig Boutilier (2) James Kostas (1) Bruno Castro da Silva (1) Bo Liu (1) Scott M. Jordan (1)

Keywords

temporal difference learning (1) policy evaluation (1) neural network training (1) offline reinforcement learning (1) policy optimization (1) stability analysis (1) reward function (1) temporal-difference learning (1) off-policy learning (1) language model (1) mixture of expert (1) bi-level optimization (1) reward shaping (1) credit assignment (1) conversational agent (1) gradient method (1) nonlinear function approximation (1) eligibility trace (1) control problem (1) dialogue management (1)

Papers

From Past to Future: Rethinking Eligibility Traces AAAI 2024 Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management NIPS 2023 Behavior Alignment via Reward Function Optimization NIPS 2023 A Mixture-of-Expert Approach to RL-based Dialogue Management ICLR 2023 Structural Credit Assignment in Neural Networks using Reinforcement Learning NIPS 2021 Gradient Temporal-Difference Learning with Regularized Corrections ICML 2020