Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning
1263 directly classified papers
Papers per year
2006: 1
2007: 2
2008: 3
2009: 2
2010: 1
2011: 2
2012: 3
2013: 2
2014: 3
2015: 2
2016: 8
2017: 44
2018: 95
2019: 134
2020: 123
2021: 131
2022: 143
2023: 127
2024: 194
2025: 240
2026: 3
Papers
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons
ACL 2023
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
ACL 2023
Model-Based Simulation for Optimising Smart Reply
ACL 2023
Critic-Guided Decoding for Controlled Text Generation
ACL 2023
Enhancing Educational Dialogues: A Reinforcement Learning Approach for Generating AI Teacher Responses
ACL 2023
Reward Gaming in Conditional Text Generation
ACL 2023
Learning Optimal Policy for Simultaneous Machine Translation via Binary Search
ACL 2023
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
ACL 2023
Robust Average-Reward Markov Decision Processes
AAAI 2023
User-Oriented Robust Reinforcement Learning
AAAI 2023
Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-Based Guidance
AAAI 2023
Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning
ACL 2023
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
AAAI 2023
Reward-Based Negotiating Agent Strategies
AAAI 2023
DM²: Decentralized Multi-Agent Reinforcement Learning via Distribution Matching
AAAI 2023
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
AAAI 2023
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
AAAI 2023
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers
AAAI 2023
H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem
AAAI 2023
Off-Policy Proximal Policy Optimization
AAAI 2023
Simultaneously Updating All Persistence Values in Reinforcement Learning
AAAI 2023
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization for Heterogeneous Representational Coarseness
AAAI 2023
State-Conditioned Adversarial Subgoal Generation
AAAI 2023
DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning
AAAI 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
EMNLP 2023
<
1
…
18
19
20
…
51
>