reinforcement learning

4122 papers

Explore in graph

Also known as

RLVR HARL GRPO RL PPO REINFORCE RFT DRL RL NULL LQR RLHF

Co-occurring keywords

large language model (12755) policy learning (699) markov decision process (788) policy gradient (518) policy optimization (630) deep reinforcement learning (903) multi-agent system (1743) imitation learning (741) regret bound (1918) language model (4573)

Papers

Learning Behaviors with Uncertain Human Feedback UAI 2020

A Natural Lottery Ticket Winner: Reinforcement Learning with Ordinary Neural Circuits ICML 2020

Hierarchically Decoupled Imitation For Morphological Transfer ICML 2020

From Importance Sampling to Doubly Robust Policy Gradient ICML 2020

One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control ICML 2020

Generalization to New Actions in Reinforcement Learning ICML 2020

Reward-Free Exploration for Reinforcement Learning ICML 2020

Evaluating the Performance of Reinforcement Learning Algorithms ICML 2020

Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation ICML 2020

Statistically Efficient Off-Policy Policy Gradients ICML 2020

Active World Model Learning with Progress Curiosity ICML 2020

CURL: Contrastive Unsupervised Representations for Reinforcement Learning ICML 2020

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning ICML 2020

Off-Policy Actor-Critic with Shared Experience Replay ICML 2020

Planning to Explore via Self-Supervised World Models ICML 2020

Optimistic Policy Optimization with Bandit Feedback ICML 2020

Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making ICML 2020

Multi-Agent Determinantal Q-Learning ICML 2020

Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate ICML 2020

Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning IJCAI 2020

Self-Guided Evolution Strategies with Historical Estimated Gradients IJCAI 2020

Exploration Based Language Learning for Text-Based Games IJCAI 2020

IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL IJCAI 2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge IJCAI 2020

Generalized Mean Estimation in Monte-Carlo Tree Search IJCAI 2020