Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Reinforcement Learning
1263 directly classified papers
Papers per year
2006: 1
2007: 2
2008: 3
2009: 2
2010: 1
2011: 2
2012: 3
2013: 2
2014: 3
2015: 2
2016: 8
2017: 44
2018: 95
2019: 134
2020: 123
2021: 131
2022: 143
2023: 127
2024: 194
2025: 240
2026: 3
Papers
Trading off Utility, Informativeness, and Complexity in Emergent Communication
NIPS 2022
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
NIPS 2022
Behavior Transformers: Cloning $k$ modes with one stone
NIPS 2022
Dungeons and Data: A Large-Scale NetHack Dataset
NIPS 2022
Mask-based Latent Reconstruction for Reinforcement Learning
NIPS 2022
DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems
NIPS 2022
HyperTree Proof Search for Neural Theorem Proving
NIPS 2022
The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design
NIPS 2022
Learning General World Models in a Handful of Reward-Free Deployments
NIPS 2022
QUARK: Controllable Text Generation with Reinforced Unlearning
NIPS 2022
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control
NIPS 2022
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs
NIPS 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
NIPS 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
NIPS 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
NIPS 2022
Distributional Reinforcement Learning for Risk-Sensitive Policies
NIPS 2022
Supported Policy Optimization for Offline Reinforcement Learning
NIPS 2022
TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification
NIPS 2022
Robust Reinforcement Learning using Offline Data
NIPS 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
NIPS 2022
The least-control principle for local learning at equilibrium
NIPS 2022
Uni[MASK]: Unified Inference in Sequential Decision Problems
NIPS 2022
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
NIPS 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
NIPS 2022
The Pitfalls of Regularization in Off-Policy TD Learning
NIPS 2022
<
1
…
27
28
29
…
51
>