Mohammad Sadegh Talebi
11 papers · 2018–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (7) π Academic Marathon (7)
π
Cross-Pollinator
(14)
πΊοΈ
Taxonomy Completionist
(13)
π
Century Club
(11)
Conferences
NIPS (3)
ACML (2)
AISTATS (2)
ALT (1)
ICLR (1)
ICML (1)
UAI (1)
Top co-authors
Keywords
regret bound
(4)
regret minimization
(3)
markov decision process
(3)
reinforcement learning
(2)
sample complexity
(1)
policy learning
(1)
automata learning
(1)
value iteration
(1)
model-based reinforcement learning
(1)
markov chain
(1)
concentration inequality
(1)
confidence set
(1)
bandit algorithm
(1)
average reward
(1)
model-based algorithm
(1)
adversarial bandit
(1)
reward machine
(1)
decision process
(1)
variance analysis
(1)
regular decision process
(1)
Papers
Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics
ICLR 2025
Differentially Private No-regret Exploration in Adversarial Markov Decision Processes
UAI 2024
Exploration in Reward Machines with Low Regret
AISTATS 2023
Provably Efficient Offline Reinforcement Learning in Regular Decision Processes
NIPS 2023
Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits
ACML 2023
Improved Exploration in Factored Average-Reward MDPs
AISTATS 2021
Tightening Exploration in Upper Confidence Reinforcement Learning
ICML 2020
Adversarial Bandits with Corruptions: Regret Lower Bound and No-regret Algorithm
NIPS 2020
Learning Multiple Markov Chains via Adaptive Allocation
NIPS 2019
Model-Based Reinforcement Learning Exploiting State-Action Equivalence
ACML 2019
Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs
ALT 2018