Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
preference learning
411 papers
Explore in graph
Also known as
DPO
PL
Co-occurring keywords
large language model
(12755)
reinforcement learning
(4122)
direct preference optimization
(317)
reinforcement learning from human feedback
(261)
language model alignment
(142)
reward model
(251)
human feedback
(161)
reward modeling
(159)
model alignment
(219)
human preference
(120)
Papers
HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
NAACL 2024
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
ACL 2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
NIPS 2024
Decoding-Time Language Model Alignment with Multiple Objectives
NIPS 2024
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback
ACL 2024
Aligner: Efficient Alignment by Learning to Correct
NIPS 2024
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization
EMNLP 2024
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment
EMNLP 2024
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
NIPS 2024
Learning to Paraphrase for Alignment with LLM Preference
EMNLP 2024
PEARL: Preference Extraction with Exemplar Augmentation and Retrieval with LLM Agents
EMNLP 2024
UltraMedical: Building Specialized Generalists in Biomedicine
NIPS 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NIPS 2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
NIPS 2024
Deep Submodular Peripteral Networks
NIPS 2024
Improving Context-Aware Preference Modeling for Language Models
NIPS 2024
Improved Analysis for Bandit Learning in Matching Markets
NIPS 2024
Learning Preference Models with Sparse Interactions of Criteria
IJCAI 2023
Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes
ICML 2023
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning
ICML 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
NIPS 2023
Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning
CORL 2023
Preference learning for guiding the tree search in continuous POMDPs
CORL 2023
Learning Human Contribution Preferences in Collaborative Human-Robot Tasks
CORL 2023
Learning Choice Functions with Gaussian Processes
UAI 2023
<
1
…
9
10
11
…
17
>