Tal Lancewicki
10 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (5) π Cross-Pollinator (10)
πΊοΈ
Taxonomy Completionist
(17)
π
Keyword Champion
(2)
π
Century Club
(10)
Conferences
ICML (5)
AAAI (2)
COLT (1)
JMLR (1)
NIPS (1)
Top co-authors
Keywords
regret bound
(8)
delayed feedback
(5)
markov decision process
(5)
adversarial mdp
(3)
online learning
(3)
multi-armed bandit
(3)
stochastic optimization
(2)
combinatorial semi-bandit
(2)
policy optimization
(2)
reinforcement learning
(2)
multi-agent learning
(2)
bandit feedback
(2)
linear bandit
(2)
adversarial learning
(2)
follow the regularized leader
(2)
optimality gap
(1)
deep reinforcement learning
(1)
function approximation
(1)
regret minimization
(1)
correlated equilibrium
(1)
Papers
Near-optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback
ICML 2025
A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs
JMLR 2025
Delay as Payoff in MAB
AAAI 2025
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
ICML 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
ICML 2023
A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs
COLT 2023
Cooperative Online Learning in Stochastic and Adversarial MDPs
ICML 2022
Learning Adversarial Markov Decision Processes with Delayed Feedback
AAAI 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
NIPS 2022
Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions
ICML 2021