conftrace_

reinforcement learning

4352 papers

Explore in graph

Also known as

RL REINFORCE

Co-occurring keywords

large language model (13587) policy learning (702) markov decision process (790) policy optimization (657) policy gradient (520) deep reinforcement learning (903) multi-agent system (1819) imitation learning (744) regret bound (1926) language model (4599)

Papers

Description Based Text Classification with Reinforcement Learning ICML 2020

Discount Factor as a Regularizer in Reinforcement Learning ICML 2020

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms ICML 2020

Designing Optimal Dynamic Treatment Regimes: A Causal Reinforcement Learning Approach ICML 2020

Unknown mixing times in apprenticeship and reinforcement learning UAI 2020

Learning Behaviors with Uncertain Human Feedback UAI 2020

A Natural Lottery Ticket Winner: Reinforcement Learning with Ordinary Neural Circuits ICML 2020

Hierarchically Decoupled Imitation For Morphological Transfer ICML 2020

From Importance Sampling to Doubly Robust Policy Gradient ICML 2020

One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control ICML 2020

Generalization to New Actions in Reinforcement Learning ICML 2020

Reward-Free Exploration for Reinforcement Learning ICML 2020

Evaluating the Performance of Reinforcement Learning Algorithms ICML 2020

Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation ICML 2020

Statistically Efficient Off-Policy Policy Gradients ICML 2020

Active World Model Learning with Progress Curiosity ICML 2020

CURL: Contrastive Unsupervised Representations for Reinforcement Learning ICML 2020

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning ICML 2020

Off-Policy Actor-Critic with Shared Experience Replay ICML 2020

Planning to Explore via Self-Supervised World Models ICML 2020

Optimistic Policy Optimization with Bandit Feedback ICML 2020

Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making ICML 2020

Multi-Agent Determinantal Q-Learning ICML 2020

Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate ICML 2020

Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning IJCAI 2020