Miroslav Pajic
15 papers · 2022–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Cross-Pollinator (13)
πΊοΈ
Taxonomy Completionist
(19)
π
Century Club
(14)
π₯
Unstoppable
(5)
β‘
Prolific Year
(5)
Conferences
L4DC (6)
ICLR (3)
NIPS (3)
AAAI (1)
AISTATS (1)
IJCAI (1)
Top co-authors
Keywords
reinforcement learning
(4)
langevin monte carlo
(2)
off-policy evaluation
(2)
time series classification
(1)
multivariate time series
(1)
discount factor
(1)
thompson sampling
(1)
markov chain
(1)
cooperative multi-agent
(1)
concentration inequality
(1)
latent space
(1)
human feedback
(1)
regret bound
(1)
state transition
(1)
nonlinear dynamical system
(1)
bellman equation
(1)
cooperative multi-agent system
(1)
exploration strategy
(1)
adversarial learning
(1)
multi-agent reinforcement learning
(1)
Papers
Bot Blitz: A Scalable Hands-On Workshop for Teaching AI and Robotics Concepts Through Narrative-Driven Problem Solving
AAAI 2026
Neuro-Symbolic Deadlock Resolution in Multi-Robot Systems
L4DC 2025
Variational Adversarial Training Towards Policies with Improved Robustness
AISTATS 2025
Safe Cooperative Multi-Agent Reinforcement Learning with Function Approximation
L4DC 2025
On the uniqueness of solution for the Bellman equation of LTL objectives
L4DC 2024
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
NIPS 2024
On Trajectory Augmentations for Off-Policy Evaluation
ICLR 2024
Robust exploration with adversary via Langevin Monte Carlo
L4DC 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
NIPS 2024
Off-Policy Evaluation for Human Feedback
NIPS 2023
Variational Latent Branching Model for Off-Policy Evaluation
ICLR 2023
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
L4DC 2023
Resiliency of Perception-Based Controllers Against Attacks
L4DC 2022
A Reinforcement Learning-Informed Pattern Mining Framework for Multivariate Time Series Classification
IJCAI 2022
Gradient Importance Learning for Incomplete Observations
ICLR 2022