Debmalya Mandal

22 papers · 2016–2025 · 6 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (9)

🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (6) 🏃 Academic Marathon (9) 🤝 Dynamic Duo (12) 🧬 Topic Evolution 💎 Century Club (22) 🗃️ Keyword Collector (81) ⚡ Prolific Year (7) 🔥 Unstoppable (7)

Conferences

AISTATS (6) NIPS (6) AAAI (4) ICML (3) IJCAI (2) UAI (1)

Top co-authors

Goran Radanovic (12) Adish Singla (5) Andi Nika (4) Long Tran-Thanh (4) Parameswaran Kamalaruban (3) Stelios Triantafyllou (3) The Anh Ta (2) Hau Chan (2) Jiarui Gan (2) Samuel Deng (2)

Keywords

regret bound (3) ground truth recovery (2) performative reinforcement learning (2) reinforcement learning (2) value iteration (2) online learning (2) episodic learning (1) transfer learning (1) few-shot learning (1) policy optimization (1) gradient estimation (1) game theory (1) convergence analysis (1) sample complexity (1) robust optimization (1) mechanism design (1) dynamic regret (1) model selection (1) corruption robustness (1) communication complexity (1)

Papers

Corruption Robust Offline Reinforcement Learning with Human Feedback AISTATS 2025 Performative Reinforcement Learning with Linear Markov Decision Process AISTATS 2025 Policy Teaching via Data Poisoning in Learning from Human Preferences AISTATS 2025 On Corruption-Robustness in Performative Reinforcement Learning AAAI 2025 Independent Learning in Performative Markov Potential Games AISTATS 2025 Performative Reinforcement Learning in Gradually Shifting Environments UAI 2024 The Surprising Effectiveness of SP Voting with Partial Preferences NIPS 2024 Learning the Expected Core of Strictly Convex Stochastic Cooperative Games NIPS 2024 Symmetric Linear Bandits with Hidden Symmetry NIPS 2024 Corruption-Robust Offline Two-Player Zero-Sum Markov Games AISTATS 2024 Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences ICML 2024 Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs ICML 2024 Online Reinforcement Learning with Uncertain Episode Lengths AAAI 2023 Markov Decision Processes with Time-Varying Geometric Discounting AAAI 2023 Performative Reinforcement Learning ICML 2023 Sequential Blocked Matching AAAI 2022 Learning Tensor Representations for Meta-Learning AISTATS 2022 Surprisingly Popular Voting Recovers Rankings, Surprisingly! IJCAI 2021 Adversarial Blocking Bandits NIPS 2020 Ensuring Fairness Beyond the Training Data NIPS 2020 Efficient and Thrifty Voting by Any Means Necessary NIPS 2019 Correlated Voting IJCAI 2016