Artificial Intelligence › Core AI ›

Reinforcement Learning

767 directly classified papers

Papers per year

Papers

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds NIPS 2022

Improving Policy Learning via Language Dynamics Distillation NIPS 2022

Near-Optimal Multi-Agent Learning for Safe Coverage Control NIPS 2022

Constrained Update Projection Approach to Safe Policy Optimization NIPS 2022

Understanding the Evolution of Linear Regions in Deep Reinforcement Learning NIPS 2022

Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning NIPS 2022

Direct Advantage Estimation NIPS 2022

Human-AI Shared Control via Policy Dissection NIPS 2022

EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL NIPS 2022

Action-modulated midbrain dopamine activity arises from distributed control policies NIPS 2022

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems EMNLP 2022

Self-Organized Group for Cooperative Multi-agent Reinforcement Learning NIPS 2022

Offline-to-Online Co-Evolutional User Simulator and Dialogue System EMNLP 2022

Towards a Standardised Performance Evaluation Protocol for Cooperative MARL NIPS 2022

Grounded Reinforcement Learning: Learning to Win the Game under Human Commands NIPS 2022

Meta-Reinforcement Learning with Self-Modifying Networks NIPS 2022

Revisiting the Roles of “Text” in Text Games EMNLP 2022

How to Reduce Action Space for Planning Domains? (Student Abstract) AAAI 2022

Reinforcement Learning Explainability via Model Transforms (Student Abstract) AAAI 2022

Reinforcement Learning for Datacenter Congestion Control AAAI 2022

iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control AAAI 2022

You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments NIPS 2022

Efficient Dialog Policy Learning by Reasoning with Contextual Knowledge AAAI 2022

Unsupervised Skill Discovery via Recurrent Skill Training NIPS 2022

A Direct Approximation of AIXI Using Logical State Abstractions NIPS 2022