conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2,932 papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
ReGen: Reinforcement Learning for Text and Knowledge Base Generation using Pretrained Language Models
EMNLP 2021
Generalized Linear Bandits with Local Differential Privacy
NIPS 2021
On the Linear Convergence of Policy Gradient Methods for Finite MDPs
AISTATS 2021
Metrics and Continuity in Reinforcement Learning
AAAI 2021
Two steps to risk sensitivity
NIPS 2021
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
NIPS 2021
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation
NIPS 2021
Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms
AISTATS 2021
Bandits with partially observable confounded data
UAI 2021
Neuro-Symbolic Approaches for Text-Based Policy Learning
EMNLP 2021
Conservative Offline Distributional Reinforcement Learning
NIPS 2021
Variance Penalized On-Policy and Off-Policy Actor-Critic
AAAI 2021
Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System
L4DC 2021
A DQN-based Approach to Finding Precise Evidences for Fact Verification
ACL 2021
Solving JumpIN’ Using Zero-Dependency Reinforcement Learning (Student Abstract)
AAAI 2021
Multi-Objective Reinforcement Learning for Designing Ethical Environments
IJCAI 2021
Encoding Human Domain Knowledge to Warm Start Reinforcement Learning
AAAI 2021
Robust Reinforcement Learning: A Constrained Game-theoretic Approach
L4DC 2021
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
AAAI 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation under Adaptivity Constraints
NIPS 2021
Neural Algorithmic Reasoners are Implicit Planners
NIPS 2021
Context-Aware Scene Graph Generation With Seq2Seq Transformers
ICCV 2021
Settling the Variance of Multi-Agent Policy Gradients
NIPS 2021
Learning Routines for Effective Off-Policy Reinforcement Learning
ICML 2021
Detecting Rewards Deterioration in Episodic Reinforcement Learning
ICML 2021
<
1
…
67
68
69
…
118
>