conftrace
_
Papers
Trends
Conferences
Explore
More
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Keywords
reinforcement learning
4352 papers
Explore in graph
Also known as
RL
REINFORCE
Co-occurring keywords
large language model
(13587)
policy learning
(702)
markov decision process
(790)
policy optimization
(657)
policy gradient
(520)
deep reinforcement learning
(903)
multi-agent system
(1819)
imitation learning
(744)
regret bound
(1926)
language model
(4599)
Papers
Domain Adaptation for Conversational Query Production with the RAG Model Feedback
EMNLP 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
EMNLP 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
EMNLP 2023
Simultaneous Machine Translation with Tailored Reference
EMNLP 2023
Boosting Punctuation Restoration with Data Generation and Reinforcement Learning
INTERSPEECH 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
EMNLP 2023
Intervention-Based Alignment of Code Search with Execution Feedback
EMNLP 2023
Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation
ICML 2023
Hybrid Systems Neural Control with Region-of-Attraction Planner
L4DC 2023
Agile Catching with Whole-Body MPC and Blackbox Policy Learning
L4DC 2023
Continuous Versatile Jumping Using Learned Action Residuals
L4DC 2023
Regret Guarantees for Online Deep Control
L4DC 2023
Hierarchical Policy Blending As Optimal Transport
L4DC 2023
A Minimal Approach for Natural Language Action Space in Text-based Games
CONLL 2023
Hierarchical State Abstraction based on Structural Information Principles
IJCAI 2023
Soft Action Priors: Towards Robust Policy Transfer
AAAI 2023
Learning to Play General-Sum Games against Multiple Boundedly Rational Agents
AAAI 2023
Low Emission Building Control with Zero-Shot Reinforcement Learning
AAAI 2023
Generalization through Diversity: Improving Unsupervised Environment Design
IJCAI 2023
Active Observing in Continuous-time Control
NIPS 2023
On the Importance of Exploration for Generalization in Reinforcement Learning
NIPS 2023
Abstract then Play: A Skill-centric Reinforcement Learning Framework for Text-based Games
ACL 2023
GeoDRL: A Self-Learning Framework for Geometry Problem Solving using Reinforcement Learning in Deductive Reasoning
ACL 2023
Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System
ACL 2023
Adaptive Ordered Information Extraction with Deep Reinforcement Learning
ACL 2023
<
1
…
79
80
81
…
175
>