conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3,861 papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
EasyRL: A Simple and Extensible Reinforcement Learning Framework
AAAI 2021
A DQN-based Approach to Finding Precise Evidences for Fact Verification
ACL 2021
Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach
ACL 2021
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs
ACL 2021
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
ACL 2021
Language Model Augmented Relevance Score
ACL 2021
How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation?
ACL 2021
Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards
ACL 2021
Turn-Level User Satisfaction Estimation in E-commerce Customer Service
ACL 2021
Meta-Reinforcement Learning for Mastering Multiple Skills and Generalizing across Environments in Text-based Games
ACL 2021
Interactive Reinforcement Learning for Table Balancing Robot
ACL 2021
RewardsOfSum: Exploring Reinforcement Learning Rewards for Summarisation
ACL 2021
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
ACL 2021
Meta-Model-Based Meta-Policy Optimization
ACML 2021
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning
ACML 2021
Cautious Actor-Critic
ACML 2021
Language Representations for Generalization in Reinforcement Learning
ACML 2021
Multi-task Actor-Critic with Knowledge Transfer via a Shared Critic
ACML 2021
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
ACML 2021
Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control
ACML 2021
Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation
ACML 2021
Learning 3-opt heuristics for traveling salesman problem via deep reinforcement learning
ACML 2021
Time-Constrained Multi-Agent Path Finding in Non-Lattice Graphs with Deep Reinforcement Learning
ACML 2021
Relation Also Need Attention: Integrating Relation Information Into Image Captioning
ACML 2021
Learning to Switch Optimizers for Quadratic Programming
ACML 2021
<
1
…
87
88
89
…
155
>