conftrace_

reinforcement learning

4352 papers

Explore in graph

Also known as

RL REINFORCE

Co-occurring keywords

large language model (13587) policy learning (702) markov decision process (790) policy optimization (657) policy gradient (520) deep reinforcement learning (903) multi-agent system (1819) imitation learning (744) regret bound (1926) language model (4599)

Papers

Generative Adversarial Regularized Mutual Information Policy Gradient Framework for Automatic Diagnosis AAAI 2020

Accelerating and Improving AlphaZero Using Population Based Training AAAI 2020

A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards ACL 2020

How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization NIPS 2020

The Mean-Squared Error of Double Q-Learning NIPS 2020

Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis NIPS 2020

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms NIPS 2020

SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving CORL 2020

Dueling Posterior Sampling for Preference-Based Reinforcement Learning UAI 2020

Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch UAI 2020

Zero-shot Text Classification via Reinforced Self-training ACL 2020

A Reinforced Generation of Adversarial Examples for Neural Machine Translation ACL 2020

Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports ACL 2020

Learning to Ask Medical Questions using Reinforcement Learning MLHC 2020

REST: Performance Improvement of a Black Box Model via RL-Based Spatial Transformation AAAI 2020

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey JMLR 2020

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning JMLR 2020

Balancing Quality and Human Involvement: An Effective Approach to Interactive Neural Machine Translation AAAI 2020

Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models AAAI 2020

Dialog State Tracking with Reinforced Data Augmentation AAAI 2020

Weak Supervision for Fake News Detection via Reinforcement Learning AAAI 2020

Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task CVPR 2020

BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps ACL 2020

CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation ACL 2020

Meta-Reinforced Multi-Domain State Generator for Dialogue Systems ACL 2020