conftrace_

Craig Boutilier

42 papers · 2010–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+14 more ↓ 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird πŸ—ΊοΈ Taxonomy Completionist (14) πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (8)
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) πŸƒ Academic Marathon (15) πŸ† Grand Slam 🧬 Topic Evolution 🀝 Dynamic Duo (10) πŸ† Keyword Champion (2) πŸ‘‘ Triple Crown πŸ—ƒοΈ Keyword Collector (142) ⚑ Prolific Year (8) πŸš€ Conference Pioneer πŸ’Ž Century Club (41) πŸ”₯ Unstoppable (9) πŸ“ˆ Trend Setter

Conferences

IJCAI (15) NIPS (10) ICML (5) ICLR (4) AAAI (3) AISTATS (2) EACL (1) JMLR (1) UAI (1)

Papers

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders EACL 2026 Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models ICLR 2025 Preference Adaptive and Sequential Text-to-Image Generation ICML 2025 Demystifying Embedding Spaces using Large Language Models ICLR 2024 Embedding-Aligned Language Models NIPS 2024 Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval NIPS 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning NIPS 2024 Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization AAAI 2024 Model-Free Preference Elicitation IJCAI 2024 A Mixture-of-Expert Approach to RL-based Dialogue Management ICLR 2023 DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models NIPS 2023 Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management NIPS 2023 Reinforcement Learning with History Dependent Dynamic Contexts ICML 2023 Thompson Sampling with a Mixture Prior AISTATS 2022 Subjective Attributes in Conversational Recommendation Systems: Challenges and Opportunities AAAI 2022 IMO^3: Interactive Multi-Objective Off-Policy Optimization IJCAI 2022 Meta-Thompson Sampling ICML 2021 BRPO: Batch Residual Policy Optimization IJCAI 2020 Differentiable Meta-Learning of Bandit Policies NIPS 2020 Latent Bandits Revisited NIPS 2020 Gradient-Based Optimization for Bayesian Preference Elicitation AAAI 2020 Randomized Exploration in Generalized Linear Bandits AISTATS 2020 CAQL: Continuous Action Q-Learning ICLR 2020 Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach ICML 2020 ConQUR: Mitigating Delusional Bias in Deep Q-Learning ICML 2020 SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets IJCAI 2019 Perturbed-History Exploration in Stochastic Linear Bandits UAI 2019 Advantage Amplification in Slowly Evolving Latent-State Environments IJCAI 2019 Perturbed-History Exploration in Stochastic Multi-Armed Bandits IJCAI 2019 Data center cooling using model-predictive control NIPS 2018 Planning and Learning with Stochastic Action Sets IJCAI 2018 Non-delusional Q-learning and value-iteration NIPS 2018 Logistic Markov Decision Processes IJCAI 2017 Multiple-Profile Prediction-of-Use Games IJCAI 2017 Approximately Stable Pricing for Coordinated Purchasing of Electricity IJCAI 2015 Effective Sampling and Learning for Mallows Models with Pairwise-Preference Data JMLR 2014 Multi-Winner Social Choice with Incomplete Preferences IJCAI 2013 Elicitation and Approximately Stable Matching with Partial Preferences IJCAI 2013 Multi-Dimensional Single-Peaked Consistency and Its Approximations IJCAI 2013 Analysis and Optimization of Multi-Dimensional Percentile Mechanisms IJCAI 2013 Efficient Vote Elicitation under Candidate Uncertainty IJCAI 2013 Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets NIPS 2010