Debmalya Mandal
22 papers · 2016–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🌍 Conference Polyglot (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (9)
🌈
Renaissance Researcher
(6)
🌍
Conference Polyglot
(6)
🏃
Academic Marathon
(9)
🤝
Dynamic Duo
(12)
🧬
Topic Evolution
💎
Century Club
(22)
🗃️
Keyword Collector
(81)
⚡
Prolific Year
(7)
🔥
Unstoppable
(7)
Conferences
AISTATS (6)
NIPS (6)
AAAI (4)
ICML (3)
IJCAI (2)
UAI (1)
Top co-authors
Keywords
regret bound
(3)
ground truth recovery
(2)
performative reinforcement learning
(2)
reinforcement learning
(2)
value iteration
(2)
online learning
(2)
episodic learning
(1)
transfer learning
(1)
few-shot learning
(1)
policy optimization
(1)
gradient estimation
(1)
game theory
(1)
convergence analysis
(1)
sample complexity
(1)
robust optimization
(1)
mechanism design
(1)
dynamic regret
(1)
model selection
(1)
corruption robustness
(1)
communication complexity
(1)
Papers
Corruption Robust Offline Reinforcement Learning with Human Feedback
AISTATS 2025
Performative Reinforcement Learning with Linear Markov Decision Process
AISTATS 2025
Policy Teaching via Data Poisoning in Learning from Human Preferences
AISTATS 2025
On Corruption-Robustness in Performative Reinforcement Learning
AAAI 2025
Independent Learning in Performative Markov Potential Games
AISTATS 2025
Performative Reinforcement Learning in Gradually Shifting Environments
UAI 2024
The Surprising Effectiveness of SP Voting with Partial Preferences
NIPS 2024
Learning the Expected Core of Strictly Convex Stochastic Cooperative Games
NIPS 2024
Symmetric Linear Bandits with Hidden Symmetry
NIPS 2024
Corruption-Robust Offline Two-Player Zero-Sum Markov Games
AISTATS 2024
Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
ICML 2024
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs
ICML 2024
Online Reinforcement Learning with Uncertain Episode Lengths
AAAI 2023
Markov Decision Processes with Time-Varying Geometric Discounting
AAAI 2023
Performative Reinforcement Learning
ICML 2023
Sequential Blocked Matching
AAAI 2022
Learning Tensor Representations for Meta-Learning
AISTATS 2022
Surprisingly Popular Voting Recovers Rankings, Surprisingly!
IJCAI 2021
Adversarial Blocking Bandits
NIPS 2020
Ensuring Fairness Beyond the Training Data
NIPS 2020
Efficient and Thrifty Voting by Any Means Necessary
NIPS 2019
Correlated Voting
IJCAI 2016