conftrace_

Aviv Rosenberg

20 papers · 2019–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+7 more ↓ 🌍 Conference Polyglot (7) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge πŸƒ Academic Marathon (6)
🌈 Renaissance Researcher (6) πŸ—ΊοΈ Taxonomy Completionist (23) 🌍 Conference Polyglot (7) πŸ† Grand Slam πŸ’Ž Century Club (20) πŸ”₯ Unstoppable (7) πŸ—ƒοΈ Keyword Collector (65)

Conferences

NIPS (7) ICML (6) AAAI (2) COLT (2) ICLR (1) IJCAI (1) JMLR (1)

Papers

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs JMLR 2025 Building Math Agents with Multi-Turn Iterative Preference Learning ICLR 2025 Multi-turn Reinforcement Learning with Preference Human Feedback NIPS 2024 Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback ICML 2024 Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes NIPS 2024 Online Weighted Paging with Unknown Weights NIPS 2024 A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs COLT 2023 Planning and Learning with Adaptive Lookahead AAAI 2023 Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback ICML 2023 Cooperative Online Learning in Stochastic and Adversarial MDPs ICML 2022 Policy Optimization for Stochastic Shortest Path COLT 2022 Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback NIPS 2022 Learning Adversarial Markov Decision Processes with Delayed Feedback AAAI 2022 Stochastic Shortest Path with Adversarially Changing Costs IJCAI 2021 Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure NIPS 2021 Minimax Regret for Stochastic Shortest Path NIPS 2021 Optimistic Policy Optimization with Bandit Feedback ICML 2020 Near-optimal Regret Bounds for Stochastic Shortest Path ICML 2020 Online Convex Optimization in Adversarial Markov Decision Processes ICML 2019 Online Stochastic Shortest Path with Bandit Feedback and Unknown Transition Function NIPS 2019