Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Policy Learning
2068 directly classified papers
Papers per year
2002: 6
2003: 1
2004: 1
2006: 11
2007: 10
2008: 14
2009: 9
2010: 23
2011: 15
2012: 25
2013: 25
2014: 24
2015: 23
2016: 27
2017: 61
2018: 107
2019: 187
2020: 216
2021: 274
2022: 259
2023: 321
2024: 247
2025: 153
2026: 29
Papers
Extracting Action Sequences from Texts Based on Deep Reinforcement Learning
IJCAI 2018
Toward Automatically Measuring Learner Ability from Human-Machine Dialog Interactions using Novel Psychometric Models
NAACL 2018
Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
JMLR 2018
Fighting Boredom in Recommender Systems with Linear Reinforcement Learning
NIPS 2018
Goal-Oriented Chatbot Dialog Management Bootstrapping with Transfer Learning
IJCAI 2018
Jumper: Learning When to Make Classification Decision in Reading
IJCAI 2018
A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning
IJCAI 2018
Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation
IJCAI 2018
Multi-modal Predicate Identification using Dynamically Learned Robot Controllers
IJCAI 2018
Planning and Learning with Stochastic Action Sets
IJCAI 2018
Expectation Optimization with Probabilistic Guarantees in POMDPs with Discounted-Sum Objectives
IJCAI 2018
Emergency Response Optimization using Online Hybrid Planning
IJCAI 2018
Goal-HSVI: Heuristic Search Value Iteration for Goal POMDPs
IJCAI 2018
Learning to Infer Final Plans in Human Team Planning
IJCAI 2018
PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making
IJCAI 2018
Risk-Aware Active Inverse Reinforcement Learning
CORL 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization
ICML 2018
Path Consistency Learning in Tsallis Entropy Regularized MDPs
ICML 2018
Mix & Match Agent Curricula for Reinforcement Learning
ICML 2018
Beyond the One-Step Greedy Approach in Reinforcement Learning
ICML 2018
Scheduled Policy Optimization for Natural Language Communication with Intelligent Agents
IJCAI 2018
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach
NIPS 2018
Lifelong Inverse Reinforcement Learning
NIPS 2018
Fourier Policy Gradients
ICML 2018
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
ICML 2018
<
1
…
70
71
72
…
83
>