preference learning

411 papers

Explore in graph

Also known as

DPO PL

Co-occurring keywords

large language model (12755) reinforcement learning (4122) direct preference optimization (317) reinforcement learning from human feedback (261) language model alignment (142) reward model (251) human feedback (161) reward modeling (159) model alignment (219) human preference (120)

Papers

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM NAACL 2024

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs ACL 2024

Online Iterative Reinforcement Learning from Human Feedback with General Preference Model NIPS 2024

Decoding-Time Language Model Alignment with Multiple Objectives NIPS 2024

Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback ACL 2024

Aligner: Efficient Alignment by Learning to Correct NIPS 2024

V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization EMNLP 2024

How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment EMNLP 2024

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning NIPS 2024

Learning to Paraphrase for Alignment with LLM Preference EMNLP 2024

PEARL: Preference Extraction with Exemplar Augmentation and Retrieval with LLM Agents EMNLP 2024

UltraMedical: Building Specialized Generalists in Biomedicine NIPS 2024

REBEL: Reinforcement Learning via Regressing Relative Rewards NIPS 2024

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning NIPS 2024

Deep Submodular Peripteral Networks NIPS 2024

Improving Context-Aware Preference Modeling for Language Models NIPS 2024

Improved Analysis for Bandit Learning in Matching Markets NIPS 2024

Learning Preference Models with Sparse Interactions of Criteria IJCAI 2023

Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes ICML 2023

Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning ICML 2023

Preference-grounded Token-level Guidance for Language Model Fine-tuning NIPS 2023

Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning CORL 2023

Preference learning for guiding the tree search in continuous POMDPs CORL 2023

Learning Human Contribution Preferences in Collaborative Human-Robot Tasks CORL 2023

Learning Choice Functions with Gaussian Processes UAI 2023