Craig Boutilier

42 papers · 2010–2026 · 9 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (14) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🏃 Academic Marathon (15) 🏆 Grand Slam 🧬 Topic Evolution 🤝 Dynamic Duo (10) 🏆 Keyword Champion (2) 👑 Triple Crown 🗃️ Keyword Collector (142) ⚡ Prolific Year (8) 🚀 Conference Pioneer 💎 Century Club (41) 🔥 Unstoppable (9) 📈 Trend Setter

Conferences

IJCAI (15) NIPS (10) ICML (5) ICLR (4) AAAI (3) AISTATS (2) EACL (1) JMLR (1) UAI (1)

Top co-authors

Yinlam Chow (10) Martin Mladenov (9) Guy Tennenholtz (8) Branislav Kveton (8) Mohammad Ghavamzadeh (7) Tyler Lu (7) Ofer Meshi (6) Manzil Zaheer (5) Csaba Szepesvári (5) Dale Schuurmans (5)

Keywords

recommender system (5) regret bound (5) reinforcement learning (4) thompson sampling (4) user modeling (3) recommendation system (3) multi-armed bandit (3) model-based reinforcement learning (3) markov decision process (3) policy gradient (3) expected value of information (3) value iteration (3) policy optimization (2) contextual bandit (2) exploration strategy (2) social welfare (2) bayesian inference (2) policy learning (2) regret minimization (2) online algorithm (2)

Papers

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders EACL 2026 Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models ICLR 2025 Preference Adaptive and Sequential Text-to-Image Generation ICML 2025 Demystifying Embedding Spaces using Large Language Models ICLR 2024 Embedding-Aligned Language Models NIPS 2024 Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval NIPS 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning NIPS 2024 Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization AAAI 2024 Model-Free Preference Elicitation IJCAI 2024 A Mixture-of-Expert Approach to RL-based Dialogue Management ICLR 2023 DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models NIPS 2023 Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management NIPS 2023 Reinforcement Learning with History Dependent Dynamic Contexts ICML 2023 Thompson Sampling with a Mixture Prior AISTATS 2022 Subjective Attributes in Conversational Recommendation Systems: Challenges and Opportunities AAAI 2022 IMO^3: Interactive Multi-Objective Off-Policy Optimization IJCAI 2022 Meta-Thompson Sampling ICML 2021 BRPO: Batch Residual Policy Optimization IJCAI 2020 Differentiable Meta-Learning of Bandit Policies NIPS 2020 Latent Bandits Revisited NIPS 2020 Gradient-Based Optimization for Bayesian Preference Elicitation AAAI 2020 Randomized Exploration in Generalized Linear Bandits AISTATS 2020 CAQL: Continuous Action Q-Learning ICLR 2020 Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach ICML 2020 ConQUR: Mitigating Delusional Bias in Deep Q-Learning ICML 2020 SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets IJCAI 2019 Perturbed-History Exploration in Stochastic Linear Bandits UAI 2019 Advantage Amplification in Slowly Evolving Latent-State Environments IJCAI 2019 Perturbed-History Exploration in Stochastic Multi-Armed Bandits IJCAI 2019 Data center cooling using model-predictive control NIPS 2018 Planning and Learning with Stochastic Action Sets IJCAI 2018 Non-delusional Q-learning and value-iteration NIPS 2018 Logistic Markov Decision Processes IJCAI 2017 Multiple-Profile Prediction-of-Use Games IJCAI 2017 Approximately Stable Pricing for Coordinated Purchasing of Electricity IJCAI 2015 Effective Sampling and Learning for Mallows Models with Pairwise-Preference Data JMLR 2014 Multi-Winner Social Choice with Incomplete Preferences IJCAI 2013 Elicitation and Approximately Stable Matching with Partial Preferences IJCAI 2013 Multi-Dimensional Single-Peaked Consistency and Its Approximations IJCAI 2013 Analysis and Optimization of Multi-Dimensional Percentile Mechanisms IJCAI 2013 Efficient Vote Elicitation under Candidate Uncertainty IJCAI 2013 Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets NIPS 2010