Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification
CVPR 2024
Learning Sampling Policy to Achieve Fewer Queries for Zeroth-Order Optimization
AISTATS 2024
Online Reinforcement Learning-Based Pedagogical Planning for Narrative-Centered Learning Environments
AAAI 2024
Towards Achieving Sub-linear Regret and Hard Constraint Violation in Model-free RL
AISTATS 2024
EgoGen: An Egocentric Synthetic Data Generator
CVPR 2024
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving
CVPR 2024
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
AISTATS 2024
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
CVPR 2024
DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
CVPR 2024
On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation
AISTATS 2024
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning
CVPR 2024
TFWT: Tabular Feature Weighting with Transformer
IJCAI 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
ACL 2024
RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation
ACL 2024
Rich Human Feedback for Text-to-Image Generation
CVPR 2024
Improving Autonomous Separation Assurance through Distributed Reinforcement Learning with Attention Networks
AAAI 2024
Mimicking the Maestro: Exploring the Efficacy of a Virtual AI Teacher in Fine Motor Skill Acquisition
AAAI 2024
Diversification of Adaptive Policy for Effective Offline Reinforcement Learning
IJCAI 2024
SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward
IJCAI 2024
Actor Prioritized Experience Replay (Abstract Reprint)
AAAI 2024
Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes
AAAI 2024
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
CVPR 2024
Reward Certification for Policy Smoothed Reinforcement Learning
AAAI 2024
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint)
AAAI 2024
<
1
…
20
21
22
…
155
>