reinforcement learning

4122 papers

Explore in graph

Also known as

RLVR HARL GRPO RL PPO REINFORCE RFT DRL RL NULL LQR RLHF

Co-occurring keywords

large language model (12755) policy learning (699) markov decision process (788) policy gradient (518) policy optimization (630) deep reinforcement learning (903) multi-agent system (1743) imitation learning (741) regret bound (1918) language model (4573)

Papers

Interactive Fiction Games: A Colossal Adventure AAAI 2020

Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning AAAI 2020

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning AAAI 2020

A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound AAAI 2020

Reinforcement Learning When All Actions Are Not Always Available AAAI 2020

Lifelong Learning with a Changing Action Set AAAI 2020

BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement AAAI 2020

RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning AAAI 2020

Neural Approximate Dynamic Programming for On-Demand Ride-Pooling AAAI 2020

Discretizing Continuous Action Space for On-Policy Optimization AAAI 2020

Learning to Optimize Variational Quantum Circuits to Solve Combinatorial Problems AAAI 2020

Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory NIPS 2020

Sparse Graphical Memory for Robust Planning NIPS 2020

Effective Diversity in Population Based Reinforcement Learning NIPS 2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior NIPS 2020

Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation NIPS 2020

Learning Affordance Landscapes for Interaction Exploration in 3D Environments NIPS 2020

Discovering Reinforcement Learning Algorithms NIPS 2020

Latent World Models For Intrinsically Motivated Exploration NIPS 2020

Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions NIPS 2020

Learning Dynamic Belief Graphs to Generalize on Text-Based Games NIPS 2020

SAPIEN: A SimulAted Part-Based Interactive ENvironment CVPR 2020

Learning Situational Driving CVPR 2020

Exploring Data Aggregation in Policy Learning for Vision-Based Urban Autonomous Driving CVPR 2020

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation CVPR 2020