Artificial Intelligence › Core AI ›

Reinforcement Learning

767 directly classified papers

Papers per year

Papers

Off-Policy Evaluation in Partially Observable Environments AAAI 2020

Planning with Abstract Learned Models While Learning Transferable Subtasks AAAI 2020

Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes AAAI 2020

Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization AAAI 2020

Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization AAAI 2020

Deep Conservative Policy Iteration AAAI 2020

Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning AAAI 2020

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards AAAI 2020

Sparse Graphical Memory for Robust Planning NIPS 2020

Latent World Models For Intrinsically Motivated Exploration NIPS 2020

Online Decision Based Visual Tracking via Reinforcement Learning NIPS 2020

Learning the Linear Quadratic Regulator from Nonlinear Observations NIPS 2020

Recurrent Switching Dynamical Systems Models for Multiple Interacting Neural Populations NIPS 2020

Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition NIPS 2020

Effective Diversity in Population Based Reinforcement Learning NIPS 2020

BAR — A Reinforcement Learning Agent for Bounding-Box Automated Refinement AAAI 2020

Just Ask: An Interactive Learning Framework for Vision and Language Navigation AAAI 2020

Adaptive Quantitative Trading: An Imitative Deep Reinforcement Learning Approach AAAI 2020

Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction AAAI 2020

MetaLight: Value-Based Meta-Reinforcement Learning for Traffic Signal Control AAAI 2020

Finding Needles in a Moving Haystack: Prioritizing Alerts with Adversarial Reinforcement Learning AAAI 2020

Learning Behaviors with Uncertain Human Feedback UAI 2020

Learning Intrinsic Rewards as a Bi-Level Optimization Problem UAI 2020

Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation ACL 2020

Learning Efficient Dialogue Policy from Demonstrations through Shaping ACL 2020