conftrace_

reinforcement learning

4352 papers

Explore in graph

Also known as

RL REINFORCE

Co-occurring keywords

large language model (13587) policy learning (702) markov decision process (790) policy optimization (657) policy gradient (520) deep reinforcement learning (903) multi-agent system (1819) imitation learning (744) regret bound (1926) language model (4599)

Papers

Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation EACL 2021

Learning to Ask Conversational Questions by Optimizing Levenshtein Distance IJCNLP 2021

Transferable Dialogue Systems and User Simulators IJCNLP 2021

Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach IJCNLP 2021

Automated Concatenation of Embeddings for Structured Prediction IJCNLP 2021

Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards IJCNLP 2021

Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations IJCNLP 2021

LOA: Logical Optimal Actions for Text-based Interaction Games IJCNLP 2021

Turn-Level User Satisfaction Estimation in E-commerce Customer Service IJCNLP 2021

A Proposal: Interactively Learning to Summarise Timelines by Reinforcement Learning IJCNLP 2021

Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text IJCNLP 2021

Fully Gap-Dependent Bounds for Multinomial Logit Bandit AISTATS 2021

Adaptive Approximate Policy Iteration AISTATS 2021

Provable Hierarchical Imitation Learning via EM AISTATS 2021

Optimizing Percentile Criterion using Robust MDPs AISTATS 2021

Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders AISTATS 2021

On the Linear Convergence of Policy Gradient Methods for Finite MDPs AISTATS 2021

Reinforcement Learning for Mean Field Games with Strategic Complementarities AISTATS 2021

Reinforcement Learning for Constrained Markov Decision Processes AISTATS 2021

Explore the Context: Optimal Data Collection for Context-Conditional Dynamics Models AISTATS 2021

A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces AISTATS 2021

Logistic Q-Learning AISTATS 2021

A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms JMLR 2021

Auxiliary Tasks for Efficient Learning of Point-Goal Navigation WACV 2021

Adaptive Streaming of 360-Degree Videos With Reinforcement Learning WACV 2021