← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

Addressing Exposure Bias With Document Minimum Risk Training: Cambridge at the WMT20 Biomedical Translation Task EMNLP 2020

A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound AAAI 2020

Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control AAAI 2020

Reinforcement Learning When All Actions Are Not Always Available AAAI 2020

Lifelong Learning with a Changing Action Set AAAI 2020

Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning AAAI 2020

A Reinforcement Learning Approach to Strategic Belief Revelation with Social Influence AAAI 2020

Off-Policy Evaluation in Partially Observable Environments AAAI 2020

Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments AAAI 2020

Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models AAAI 2020

Sequence Generation with Optimal-Transport-Enhanced Reinforcement Learning AAAI 2020

Effective Diversity in Population Based Reinforcement Learning NIPS 2020

Collapsing Bandits and Their Application to Public Health Intervention NIPS 2020

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms NIPS 2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs NIPS 2020

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward ACL 2020

Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis NIPS 2020

Online Planning with Lookahead Policies NIPS 2020

Memory-Efficient Learning of Stable Linear Dynamical Systems for Prediction and Control NIPS 2020

Predictive Information Accelerates Learning in RL NIPS 2020

The Mean-Squared Error of Double Q-Learning NIPS 2020

Off-Policy Evaluation via the Regularized Lagrangian NIPS 2020

Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning? CVPR 2020

Fast Template Matching and Update for Video Object Tracking and Segmentation CVPR 2020

Achieving Fairness in the Stochastic Multi-Armed Bandit Problem AAAI 2020