conftrace_

Papers

Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries ICML 2025 DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback AAAI 2025 Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data AAAI 2025 Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards ICLR 2025 INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer ICML 2024 Revisiting Data Augmentation in Deep Reinforcement Learning ICLR 2024 Solving Complex Manipulation Tasks with Model-Assisted Model-Free Reinforcement Learning CORL 2022 CVaR-Regret Bounds for Multi-armed Bandits ACML 2022 Neuro-Symbolic Hierarchical Rule Induction ICML 2022 Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning ICML 2021 Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards ICML 2020 Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains IJCAI 2019 Multi-objective Bandits: Optimizing the Generalized Gini Index ICML 2017 Optimization of Probabilistic Argumentation with Markov Decision Models IJCAI 2015 Qualitative Multi-Armed Bandits: A Quantile-Based Approach ICML 2015 Solving MDPs with Skew Symmetric Bilinear Utility Functions IJCAI 2015 Interactive Value Iteration for Markov Decision Processes with Unknown Rewards IJCAI 2013 Top-k Selection based on Adaptive Sampling of Noisy Preferences ICML 2013