Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Reinforcement Learning
767 directly classified papers
Papers per year
2006: 1
2007: 6
2008: 3
2009: 2
2010: 4
2011: 3
2012: 8
2013: 3
2014: 4
2016: 4
2017: 21
2018: 48
2019: 75
2020: 73
2021: 86
2022: 107
2023: 116
2024: 127
2025: 76
Papers
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
NIPS 2022
Improving Policy Learning via Language Dynamics Distillation
NIPS 2022
Near-Optimal Multi-Agent Learning for Safe Coverage Control
NIPS 2022
Constrained Update Projection Approach to Safe Policy Optimization
NIPS 2022
Understanding the Evolution of Linear Regions in Deep Reinforcement Learning
NIPS 2022
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning
NIPS 2022
Direct Advantage Estimation
NIPS 2022
Human-AI Shared Control via Policy Dissection
NIPS 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
NIPS 2022
Action-modulated midbrain dopamine activity arises from distributed control policies
NIPS 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
EMNLP 2022
Self-Organized Group for Cooperative Multi-agent Reinforcement Learning
NIPS 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
NIPS 2022
Grounded Reinforcement Learning: Learning to Win the Game under Human Commands
NIPS 2022
Meta-Reinforcement Learning with Self-Modifying Networks
NIPS 2022
Revisiting the Roles of “Text” in Text Games
EMNLP 2022
How to Reduce Action Space for Planning Domains? (Student Abstract)
AAAI 2022
Reinforcement Learning Explainability via Model Transforms (Student Abstract)
AAAI 2022
Reinforcement Learning for Datacenter Congestion Control
AAAI 2022
iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control
AAAI 2022
You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
NIPS 2022
Efficient Dialog Policy Learning by Reasoning with Contextual Knowledge
AAAI 2022
Unsupervised Skill Discovery via Recurrent Skill Training
NIPS 2022
A Direct Approximation of AIXI Using Logical State Abstractions
NIPS 2022
<
1
…
13
14
15
…
31
>