← Learning Types

Deep Learning › Learning Types ›

Reinforcement Learning

1263 directly classified papers

Papers per year

Papers

Palm up: Playing in the Latent Manifold for Unsupervised Pretraining NIPS 2022

Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning NIPS 2022

Learning Representations via a Robust Behavioral Metric for Deep Reinforcement Learning NIPS 2022

Heterogeneous Skill Learning for Multi-agent Tasks NIPS 2022

DMAP: a Distributed Morphological Attention Policy for learning to locomote with a changing body NIPS 2022

Deep Surrogate Assisted Generation of Environments NIPS 2022

You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments NIPS 2022

Learning and Analyzing Generation Order for Undirected Sequence Models EMNLP 2021

Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration EMNLP 2021

Language Resource Efficient Learning for Captioning EMNLP 2021

Language Models are Few-Shot Butlers EMNLP 2021

A Generative Framework for Simultaneous Machine Translation EMNLP 2021

Modeling Document-Level Context for Event Detection via Important Context Selection EMNLP 2021

Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy EMNLP 2021

Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation EMNLP 2021

Bayesian Distributional Policy Gradients AAAI 2021

PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector AAAI 2021

Distributional Reinforcement Learning via Moment Matching AAAI 2021

Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach AAAI 2021

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation AAAI 2021

Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction AAAI 2021

Self-Supervised Attention-Aware Reinforcement Learning AAAI 2021

Self-correcting Q-learning AAAI 2021

Value-Decomposition Multi-Agent Actor-Critics AAAI 2021

Coordination Between Individual Agents in Multi-Agent Reinforcement Learning AAAI 2021