Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming
AAAI 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
AAAI 2024
Factored Online Planning in Many-Agent POMDPs
AAAI 2024
Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
AAAI 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
NIPS 2024
Controlling Character Motions Without Observable Driving Source
WACV 2024
Learning Multi-Object Positional Relationships via Emergent Communication
AAAI 2024
Response Enhanced Semi-supervised Dialogue Query Generation
AAAI 2024
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
NIPS 2024
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning
AAAI 2024
Perplexity-aware Correction for Robust Alignment with Noisy Preferences
NIPS 2024
A Critical Evaluation of AI Feedback for Aligning Large Language Models
NIPS 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
AAAI 2024
Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning
AAAI 2024
A PAC Learning Algorithm for LTL and Omega-Regular Objectives in MDPs
AAAI 2024
A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning
AAAI 2024
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons
EMNLP 2024
Risk-Conditioned Reinforcement Learning: A Generalized Approach for Adapting to Varying Risk Measures
AAAI 2024
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
AAAI 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
AAAI 2024
On learning history-based policies for controlling Markov decision processes
AISTATS 2024
EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading
AAAI 2024
mDPO: Conditional Preference Optimization for Multimodal Large Language Models
EMNLP 2024
Critic-Guided Decision Transformer for Offline Reinforcement Learning
AAAI 2024
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024
<
1
…
21
22
23
…
118
>