Craig Boutilier
42 papers · 2010–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (14) π Interdisciplinary Bridge π Conference Polyglot (8)
π£
Hot Topic Early Bird
π
Conference Polyglot
(8)
π
Academic Marathon
(15)
π
Grand Slam
π§¬
Topic Evolution
π€
Dynamic Duo
(10)
π
Keyword Champion
(2)
π
Triple Crown
ποΈ
Keyword Collector
(142)
β‘
Prolific Year
(8)
π
Conference Pioneer
π
Century Club
(41)
π₯
Unstoppable
(9)
π
Trend Setter
Conferences
IJCAI (15)
NIPS (10)
ICML (5)
ICLR (4)
AAAI (3)
AISTATS (2)
EACL (1)
JMLR (1)
UAI (1)
Top co-authors
Keywords
recommender system
(5)
regret bound
(5)
reinforcement learning
(4)
thompson sampling
(4)
user modeling
(3)
recommendation system
(3)
multi-armed bandit
(3)
model-based reinforcement learning
(3)
markov decision process
(3)
policy gradient
(3)
expected value of information
(3)
value iteration
(3)
policy optimization
(2)
contextual bandit
(2)
exploration strategy
(2)
social welfare
(2)
bayesian inference
(2)
policy learning
(2)
regret minimization
(2)
online algorithm
(2)
Papers
ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders
EACL 2026
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
ICLR 2025
Preference Adaptive and Sequential Text-to-Image Generation
ICML 2025
Demystifying Embedding Spaces using Large Language Models
ICLR 2024
Embedding-Aligned Language Models
NIPS 2024
Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
NIPS 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
NIPS 2024
Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization
AAAI 2024
Model-Free Preference Elicitation
IJCAI 2024
A Mixture-of-Expert Approach to RL-based Dialogue Management
ICLR 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
NIPS 2023
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
NIPS 2023
Reinforcement Learning with History Dependent Dynamic Contexts
ICML 2023
Thompson Sampling with a Mixture Prior
AISTATS 2022
Subjective Attributes in Conversational Recommendation Systems: Challenges and Opportunities
AAAI 2022
IMO^3: Interactive Multi-Objective Off-Policy Optimization
IJCAI 2022
Meta-Thompson Sampling
ICML 2021
BRPO: Batch Residual Policy Optimization
IJCAI 2020
Differentiable Meta-Learning of Bandit Policies
NIPS 2020
Latent Bandits Revisited
NIPS 2020
Gradient-Based Optimization for Bayesian Preference Elicitation
AAAI 2020
Randomized Exploration in Generalized Linear Bandits
AISTATS 2020
CAQL: Continuous Action Q-Learning
ICLR 2020
Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
ICML 2020
ConQUR: Mitigating Delusional Bias in Deep Q-Learning
ICML 2020
SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets
IJCAI 2019
Perturbed-History Exploration in Stochastic Linear Bandits
UAI 2019
Advantage Amplification in Slowly Evolving Latent-State Environments
IJCAI 2019
Perturbed-History Exploration in Stochastic Multi-Armed Bandits
IJCAI 2019
Data center cooling using model-predictive control
NIPS 2018
Planning and Learning with Stochastic Action Sets
IJCAI 2018
Non-delusional Q-learning and value-iteration
NIPS 2018
Logistic Markov Decision Processes
IJCAI 2017
Multiple-Profile Prediction-of-Use Games
IJCAI 2017
Approximately Stable Pricing for Coordinated Purchasing of Electricity
IJCAI 2015
Effective Sampling and Learning for Mallows Models with Pairwise-Preference Data
JMLR 2014
Multi-Winner Social Choice with Incomplete Preferences
IJCAI 2013
Elicitation and Approximately Stable Matching with Partial Preferences
IJCAI 2013
Multi-Dimensional Single-Peaked Consistency and Its Approximations
IJCAI 2013
Analysis and Optimization of Multi-Dimensional Percentile Mechanisms
IJCAI 2013
Efficient Vote Elicitation under Candidate Uncertainty
IJCAI 2013
Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets
NIPS 2010