Milind Tambe
78 papers · 2013–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (8)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(12)
π
Conference Loyalist
(27)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Grand Slam
π¬
Deep Specialist
(17)
π
Conference Pioneer
π₯
Unstoppable
(13)
β‘
Prolific Year
(8)
β
The Questioner
(2)
π
Trend Setter
ποΈ
Keyword Collector
(286)
π
Century Club
(74)
Conferences
AAAI (31)
IJCAI (23)
NIPS (9)
UAI (8)
ICML (4)
ICLR (1)
MLHC (1)
WACV (1)
Top co-authors
Research topics
Keywords
restless multi-armed bandit
(15)
resource allocation
(14)
decision-focused learning
(6)
social network
(6)
influence maximization
(6)
public health
(6)
maternal health
(5)
markov decision process
(5)
reinforcement learning
(4)
healthcare intervention
(4)
whittle index
(4)
combinatorial optimization
(4)
robust optimization
(4)
game theory
(4)
mobile health
(3)
large language model
(3)
security game
(3)
minimax regret
(3)
multi-armed bandit
(3)
health intervention
(3)
Papers
VORTEX: Aligning Task Utility and Human Preferences Through LLM-Guided Reward Shaping
AAAI 2026
Preference Robustness for DPO with Applications to Public Health
AAAI 2026
Optimizing Health Coverage in Ethiopia: A Learning-augmented Approach and Persistent Proportionality Under an Online Budget
AAAI 2026
Generative AI Against Poaching: Latent Composite Flow Matching for Poaching Prediction
AAAI 2026
What is the Right Notion of Distance between Predict-then-Optimize Tasks?
UAI 2025
Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
AAAI 2025
Robust Optimization with Diffusion Models for Green Security
UAI 2025
Optimizing Vital Sign Monitoring in Resource-Constrained Maternal Care: An RL-Based Restless Bandit Approach
AAAI 2025
Learning to Call: A Field Trial of a Collaborative Bandit Algorithm for Optimizing Call Timing in Mobile Maternal Health
MLHC 2025
PRIORITY2REWARD: Incorporating Healthworker Preferences for Resource Allocation Planning
AAAI 2025
Evaluating Index-based Treatment Allocation in Underresourced Communities
AAAI 2025
Navigating the Social Welfare Frontier: Portfolios for Multi-objective Reinforcement Learning
ICML 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
ICLR 2025
The Bandit Whisperer: Communication Learning for Restless Bandits
AAAI 2025
Improving Health Information Access in the Worldβs Largest Maternal Mobile Health Program via Bandit Algorithms
AAAI 2024
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
NIPS 2024
Transcendence: Generative Models Can Outperform The Experts That Train Them
NIPS 2024
Group Fairness in Predict-Then-Optimize Settings for Restless Bandits
UAI 2024
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
IJCAI 2024
Position: Social Environment Design Should be Further Developed for AI-based Policy-Making
ICML 2024
Position: Application-Driven Innovation in Machine Learning
ICML 2024
Leaving the Nest: Going beyond Local Loss Functions for Predict-Then-Optimize
AAAI 2024
Limited Resource Allocation in a Non-Markovian World: The Case of Maternal and Child Healthcare
IJCAI 2023
Find Rhinos without Finding Rhinos: Active Learning with Multimodal Imagery of South African Rhino Habitats
IJCAI 2023
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
AAAI 2023
Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation
AAAI 2023
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health
AAAI 2023
Robust Planning over Restless Groups: Engagement Interventions for a Large-Scale Maternal Telehealth Program
AAAI 2023
Increasing Impact of Mobile Health Programs: SAHELI for Maternal and Child Care
AAAI 2023
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation
ICML 2023
Complex Contagion Influence Maximization: A Reinforcement Learning Approach
IJCAI 2023
Restless and uncertain: Robust policies for restless bandits via deep multi-agent reinforcement learning
UAI 2022
Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses
NIPS 2022
Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games
AAAI 2022
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-profits in Improving Maternal and Child Health
AAAI 2022
Micronutrient Deficiency Prediction via Publicly Available Satellite Data
AAAI 2022
Using Public Data to Predict Demand for Mobile Health Clinics
AAAI 2022
Facilitating Human-Wildlife Cohabitation through Conflict Prediction
AAAI 2022
Evolutionary Approach to Security Games with Signaling
IJCAI 2022
ADVISER: AI-Driven Vaccination Intervention Optimiser for Increasing Vaccine Uptake in Nigeria
IJCAI 2022
Ranked Prioritization of Groups in Combinatorial Bandit Allocation
IJCAI 2022
Solving structured hierarchical games using differential backward induction
UAI 2022
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
IJCAI 2021
Clinical Trial of an AI-Augmented Intervention for HIV Prevention in Youth Experiencing Homelessness
AAAI 2021
Fair Influence Maximization: a Welfare Optimization Approach
AAAI 2021
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
AAAI 2021
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning
NIPS 2021
Contingency-aware influence maximization: A reinforcement learning approach
UAI 2021
Robust reinforcement learning under minimax regret for green security
UAI 2021
Tracking Disease Outbreaks from Sparse Data with Bayesian Inference
AAAI 2021
BIRDSAI: A Dataset for Detection and Tracking in Aerial Thermal Infrared Videos
WACV 2020
Collapsing Bandits and Their Application to Public Health Intervention
NIPS 2020
Automatically Learning Compact Quality-aware Surrogates for Optimization Problems
NIPS 2020
Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning
AAAI 2020
MIPaaL: Mixed Integer Program as a Layer
AAAI 2020
End-to-End Game-Focused Learning of Adversary Behavior in Security Games
AAAI 2020
To Signal or Not To Signal: Exploiting Uncertain Real-Time Information in Signaling Games for Security and Sustainability
AAAI 2020
Robust Spatial-Temporal Incident Prediction
UAI 2020
Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization
AAAI 2019
End to end learning and optimization on graphs
NIPS 2019
Group-Fairness in Influence Maximization
IJCAI 2019
Exploring Algorithmic Fairness in Robust Graph Covering Problems
NIPS 2019
On the Inducibility of Stackelberg Equilibrium for Security Games
AAAI 2019
Near Real-Time Detection of Poachers from Drones in AirSim
IJCAI 2018
The Price of Usability: Designing Operationalizable Strategies for Security Games
IJCAI 2018
Bridging the Gap Between Theory and Practice in Influence Maximization: Raising Awareness about HIV among Homeless Youth
IJCAI 2018
Stackelberg Security Games: Looking Beyond a Decade of Success
IJCAI 2018
Maximizing Awareness about HIV in Social Networks of Homeless Youth with Limited Information
IJCAI 2017
Staying Ahead of the Game: Adaptive Robust Optimization for Dynamic Allocation of Threat Screening Resources
IJCAI 2017
Don't Bury your Head in Warnings: A Game-Theoretic Approach for Intelligent Allocation of Cyber-security Alerts
IJCAI 2017
Three Strategies to Success: Learning Adversary Models in Security Games
IJCAI 2016
Security Games with Information Leakage: Modeling and Computation
IJCAI 2015
When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing
IJCAI 2015
Diverse Randomized Agents Vote to Win
NIPS 2014
Defender (Mis)Coordination in Security Games
IJCAI 2013
Efficiently Solving Joint Activity Based Security Games
IJCAI 2013
Scaling-Up Security Games with Boundedly Rational Adversaries: A Cutting-Plane Approach
IJCAI 2013
Multi-Agent Team Formation: Diversity Beats Strength?
IJCAI 2013