Asaf Cassel
9 papers · 2018–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (4) π Academic Marathon (7) π Cross-Pollinator (10) π Renaissance Researcher (6)
πΊοΈ
Taxonomy Completionist
(19)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
Conferences
ICML (4)
NIPS (3)
AAAI (1)
COLT (1)
Top co-authors
Keywords
regret bound
(5)
stochastic bandit
(3)
online learning
(3)
stochastic optimization
(2)
policy optimization
(2)
multi-armed bandit
(2)
regret minimization
(1)
exploration exploitation
(1)
optimal policy
(1)
linear quadratic regulator
(1)
adversarial setting
(1)
bandit linear control
(1)
upper confidence bound
(1)
language model alignment
(1)
conditional value-at-risk
(1)
ensemble method
(1)
preference feedback
(1)
reinforcement learning
(1)
control theory
(1)
bandit algorithm
(1)
Papers
Batch Ensemble for Variance Dependent Regret in Stochastic Bandits
AAAI 2025
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
ICML 2024
Multi-turn Reinforcement Learning with Preference Human Feedback
NIPS 2024
Eluder-based Regret for Stochastic Contextual MDPs
ICML 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
NIPS 2024
Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation
ICML 2023
Bandit Linear Control
NIPS 2020
Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently
ICML 2020
A General Approach to Multi-Armed Bandits Under Risk Criteria
COLT 2018