conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2,932 papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Reward-Free Policy Space Compression for Reinforcement Learning
AISTATS 2022
Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation
AISTATS 2022
Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
AISTATS 2022
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
AISTATS 2022
Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes
AISTATS 2022
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
AISTATS 2022
Offline Policy Selection under Uncertainty
AISTATS 2022
Off-Policy Risk Assessment for Markov Decision Processes
AISTATS 2022
Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems
AISTATS 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
AISTATS 2022
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning
AISTATS 2022
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
AISTATS 2022
An Alternate Policy Gradient Estimator for Softmax Policies
AISTATS 2022
Adaptive Multi-Goal Exploration
AISTATS 2022
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
AISTATS 2022
Tile Networks: Learning Optimal Geometric Layout for Whole-page Recommendation
AISTATS 2022
The Curse of Passive Data Collection in Batch Reinforcement Learning
AISTATS 2022
Sample Complexity of Robust Reinforcement Learning with a Generative Model
AISTATS 2022
Finite Sample Analysis of Mean-Volatility Actor-Critic for Risk-Averse Reinforcement Learning
AISTATS 2022
Towards an Understanding of Default Policies in Multitask Policy Optimization
AISTATS 2022
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation
AISTATS 2022
Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions
AISTATS 2022
Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
COLING 2022
TopKG: Target-oriented Dialog via Global Planning on Knowledge Graph
COLING 2022
Different Data, Different Modalities! Reinforced Data Splitting for Effective Multimodal Information Extraction from Social Media Posts
COLING 2022
<
1
…
57
58
59
…
118
>