Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Reinforcement Learning
767 directly classified papers
Papers per year
2006: 1
2007: 6
2008: 3
2009: 2
2010: 4
2011: 3
2012: 8
2013: 3
2014: 4
2016: 4
2017: 21
2018: 48
2019: 75
2020: 73
2021: 86
2022: 107
2023: 116
2024: 127
2025: 76
Papers
Off-Policy Evaluation in Partially Observable Environments
AAAI 2020
Planning with Abstract Learned Models While Learning Transferable Subtasks
AAAI 2020
Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes
AAAI 2020
Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization
AAAI 2020
Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization
AAAI 2020
Deep Conservative Policy Iteration
AAAI 2020
Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning
AAAI 2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
AAAI 2020
Sparse Graphical Memory for Robust Planning
NIPS 2020
Latent World Models For Intrinsically Motivated Exploration
NIPS 2020
Online Decision Based Visual Tracking via Reinforcement Learning
NIPS 2020
Learning the Linear Quadratic Regulator from Nonlinear Observations
NIPS 2020
Recurrent Switching Dynamical Systems Models for Multiple Interacting Neural Populations
NIPS 2020
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition
NIPS 2020
Effective Diversity in Population Based Reinforcement Learning
NIPS 2020
BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement
AAAI 2020
Just Ask: An Interactive Learning Framework for Vision and Language Navigation
AAAI 2020
Adaptive Quantitative Trading: An Imitative Deep Reinforcement Learning Approach
AAAI 2020
Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction
AAAI 2020
MetaLight: Value-Based Meta-Reinforcement Learning for Traffic Signal Control
AAAI 2020
Finding Needles in a Moving Haystack: Prioritizing Alerts with Adversarial Reinforcement Learning
AAAI 2020
Learning Behaviors with Uncertain Human Feedback
UAI 2020
Learning Intrinsic Rewards as a Bi-Level Optimization Problem
UAI 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
ACL 2020
Learning Efficient Dialogue Policy from Demonstrations through Shaping
ACL 2020
<
1
…
22
23
24
…
31
>