conftrace_

Thorsten Joachims

30 papers · 2005–2025 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+11 more ↓ 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🌈 Renaissance Researcher (6) πŸ—ΊοΈ Taxonomy Completionist (17) 🐣 Hot Topic Early Bird
πŸƒ Academic Marathon (20) 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🌟 Keyword Trendsetter Combo (5) 🌱 Topic Pioneer πŸ† Keyword Champion πŸ’Ž Century Club (30) πŸ“ˆ Trend Setter πŸš€ Conference Pioneer πŸ”₯ Unstoppable (8) πŸ—ƒοΈ Keyword Collector (50)

Conferences

ICML (10) NIPS (7) AISTATS (2) EMNLP (2) ICLR (2) IJCAI (2) JMLR (2) EACL (1) ICCV (1) UAI (1)

Papers

POTEC: Off-Policy Contextual Bandits for Large Action Spaces via Policy Decomposition ICLR 2025 REBEL: Reinforcement Learning via Regressing Relative Rewards NIPS 2024 Coactive Learning for Large Language Models using Implicit User Feedback ICML 2024 Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling ICML 2023 Boosted Off-Policy Learning AISTATS 2023 Bandits with costly reward observations UAI 2023 Improving Screening Processes via Calibrated Subset Selection ICML 2022 Off-Policy Evaluation for Large Action Spaces via Embeddings ICML 2022 Fairness in Ranking under Uncertainty NIPS 2021 Controlling Fairness and Bias in Dynamic Learning-to-Rank (Extended Abstract) IJCAI 2021 Fairness of Exposure in Stochastic Bandits ICML 2021 MOReL: Model-Based Offline Reinforcement Learning NIPS 2020 CAB: Continuous Adaptive Blending for Policy Evaluation and Learning ICML 2019 Policy Learning for Fairness in Ranking NIPS 2019 Deep Learning with Logged Bandit Feedback ICLR 2018 Unbiased Learning-to-Rank with Biased Feedback IJCAI 2018 Recommendations as Treatments: Debiasing Learning and Evaluation ICML 2016 Evaluation methods for unsupervised word embeddings EMNLP 2015 The Self-Normalized Estimator for Counterfactual Learning NIPS 2015 Counterfactual Risk Minimization: Learning from Logged Bandit Feedback ICML 2015 Batch Learning from Logged Bandit Feedback through Counterfactual Risk Minimization JMLR 2015 Invited Talk: Learning from Rational Behavior EMNLP 2014 Reducing Dueling Bandits to Cardinal Bandits ICML 2014 Stable Coactive Learning via Perturbation ICML 2013 Learning Trajectory Preferences for Manipulators via Iterative Improvement NIPS 2013 Structured Learning of Sum-of-Submodular Higher Order Energy Functions ICCV 2013 Large-Margin Learning of Submodular Summarization Models EACL 2012 Multi-armed Bandit Problems with History AISTATS 2012 Semantic Labeling of 3D Point Clouds for Indoor Scenes NIPS 2011 Large Margin Methods for Structured and Interdependent Output Variables JMLR 2005