conftrace_

reinforcement learning

4352 papers

Explore in graph

Also known as

RL REINFORCE

Co-occurring keywords

large language model (13587) policy learning (702) markov decision process (790) policy optimization (657) policy gradient (520) deep reinforcement learning (903) multi-agent system (1819) imitation learning (744) regret bound (1926) language model (4599)

Papers

Domain Adaptation for Conversational Query Production with the RAG Model Feedback EMNLP 2023

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression EMNLP 2023

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment EMNLP 2023

Simultaneous Machine Translation with Tailored Reference EMNLP 2023

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning INTERSPEECH 2023

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback EMNLP 2023

Intervention-Based Alignment of Code Search with Execution Feedback EMNLP 2023

Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation ICML 2023

Hybrid Systems Neural Control with Region-of-Attraction Planner L4DC 2023

Agile Catching with Whole-Body MPC and Blackbox Policy Learning L4DC 2023

Continuous Versatile Jumping Using Learned Action Residuals L4DC 2023

Regret Guarantees for Online Deep Control L4DC 2023

Hierarchical Policy Blending As Optimal Transport L4DC 2023

A Minimal Approach for Natural Language Action Space in Text-based Games CONLL 2023

Hierarchical State Abstraction based on Structural Information Principles IJCAI 2023

Soft Action Priors: Towards Robust Policy Transfer AAAI 2023

Learning to Play General-Sum Games against Multiple Boundedly Rational Agents AAAI 2023

Low Emission Building Control with Zero-Shot Reinforcement Learning AAAI 2023

Generalization through Diversity: Improving Unsupervised Environment Design IJCAI 2023

Active Observing in Continuous-time Control NIPS 2023

On the Importance of Exploration for Generalization in Reinforcement Learning NIPS 2023

Abstract then Play: A Skill-centric Reinforcement Learning Framework for Text-based Games ACL 2023

GeoDRL: A Self-Learning Framework for Geometry Problem Solving using Reinforcement Learning in Deductive Reasoning ACL 2023

Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System ACL 2023

Adaptive Ordered Information Extraction with Deep Reinforcement Learning ACL 2023