reinforcement learning

4122 papers

Explore in graph

Also known as

RLVR HARL GRPO RL PPO REINFORCE RFT DRL RL NULL LQR RLHF

Co-occurring keywords

large language model (12755) policy learning (699) markov decision process (788) policy gradient (518) policy optimization (630) deep reinforcement learning (903) multi-agent system (1743) imitation learning (741) regret bound (1918) language model (4573)

Papers

Learning When and Where to Zoom With Deep Reinforcement Learning CVPR 2020

Optical Non-Line-of-Sight Physics-Based 3D Human Pose Estimation CVPR 2020

PADS: Policy-Adapted Sampling for Visual Similarity Learning CVPR 2020

Progressive Relation Learning for Group Activity Recognition CVPR 2020

NAS-FCOS: Fast Neural Architecture Search for Object Detection CVPR 2020

Better Captioning With Sequence-Level Exploration CVPR 2020

Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning? CVPR 2020

Scene Recomposition by Learning-Based ICP CVPR 2020

Active Vision for Early Recognition of Human Actions CVPR 2020

Optimal Auction Based Automated Negotiation in Realistic Decentralised Market Environments AAAI 2020

Named Entity Recognition Only from Word Embeddings EMNLP 2020

Keep CALM and Explore: Language Models for Action Generation in Text-based Games EMNLP 2020

Knowledge-guided Open Attribute Value Extraction with Reinforcement Learning EMNLP 2020

Learning Collaborative Agents with Rule Guidance for Knowledge Graph Reasoning EMNLP 2020

Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation EMNLP 2020

Supervised Seeded Iterated Learning for Interactive Language Learning EMNLP 2020

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games EMNLP 2020

Q-learning with Language Model for Edit-based Unsupervised Summarization EMNLP 2020

Lookahead-Bounded Q-learning ICML 2020

Interactive Machine Comprehension with Information Seeking Agents ACL 2020

Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning ACL 2020

Stylized Text Generation: Approaches and Applications ACL 2020

Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension ACL 2020

Learning Efficient Dialogue Policy from Demonstrations through Shaping ACL 2020

Composing Elementary Discourse Units in Abstractive Summarization ACL 2020