Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning
1263 directly classified papers
Papers per year
2006: 1
2007: 2
2008: 3
2009: 2
2010: 1
2011: 2
2012: 3
2013: 2
2014: 3
2015: 2
2016: 8
2017: 44
2018: 95
2019: 134
2020: 123
2021: 131
2022: 143
2023: 127
2024: 194
2025: 240
2026: 3
Papers
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
L4DC 2022
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator
AAAI 2022
Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach
ACL 2022
Exploring Safer Behaviors for Deep Reinforcement Learning
AAAI 2022
Experience Replay with Likelihood-free Importance Weights
L4DC 2022
Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline
NIPS 2022
ALPHAPROG: Reinforcement Generation of Valid Programs for Compiler Fuzzing
AAAI 2022
Using Graph-Aware Reinforcement Learning to Identify Winning Strategies in Diplomacy Games (Student Abstract)
AAAI 2022
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
AAAI 2022
PantheonRL: A MARL Library for Dynamic Training Interactions
AAAI 2022
A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
AAAI 2022
Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs
AAAI 2022
MDPGT: Momentum-Based Decentralized Policy Gradient Tracking
AAAI 2022
Eye of the Beholder: Improved Relation Generalization for Text-Based Reinforcement Learning Agents
AAAI 2022
Episodic Policy Gradient Training
AAAI 2022
Dimensionality Reduction and Prioritized Exploration for Policy Search
AISTATS 2022
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
AAAI 2022
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
AAAI 2022
ValueNet: A New Dataset for Human Value Driven Dialogue System
AAAI 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
EMNLP 2022
State Deviation Correction for Offline Reinforcement Learning
AAAI 2022
Efficient (Soft) Q-Learning for Text Generation with Limited Good Data
EMNLP 2022
Revisiting the Roles of “Text” in Text Games
EMNLP 2022
Robust Action Gap Increasing with Clipped Advantage Learning
AAAI 2022
<
1
…
23
24
25
…
51
>