preference learning

411 papers

Explore in graph

Also known as

DPO PL

Co-occurring keywords

large language model (12755) reinforcement learning (4122) direct preference optimization (317) reinforcement learning from human feedback (261) language model alignment (142) reward model (251) human feedback (161) reward modeling (159) model alignment (219) human preference (120)

Papers

Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM ACL 2024

Automated Multi-level Preference for MLLMs NIPS 2024

Neural Reasoning about Agents’ Goals, Preferences, and Actions AAAI 2024

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation ACL 2024

Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal IJCAI 2024

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies AISTATS 2024

Teaching Language Models to Self-Improve by Learning from Language Feedback ACL 2024

Preference-based Pure Exploration NIPS 2024

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs ACL 2024

Looping in the Human: Collaborative and Explainable Bayesian Optimization AISTATS 2024

Let Me Teach You: Pedagogical Foundations of Feedback for Language Models EMNLP 2024

Decoding-Time Language Model Alignment with Multiple Objectives NIPS 2024

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM NAACL 2024

Differentially Private Reward Estimation with Preference Feedback AISTATS 2024

Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM ACL 2024

Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning INTERSPEECH 2024

Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources AISTATS 2024

Contrastive Preference Learning for Neural Machine Translation NAACL 2024

A Preference-driven Paradigm for Enhanced Translation with Large Language Models NAACL 2024

Learning Populations of Preferences via Pairwise Comparison Queries AISTATS 2024

Improving Attributed Text Generation of Large Language Models via Preference Learning ACL 2024

The Paradox of Preference: A Study on LLM Alignment Algorithms and Data Acquisition Methods NAACL 2024

A General Theoretical Paradigm to Understand Learning from Human Preferences AISTATS 2024

CURATRON: Complete and Robust Preference Data for Rigorous Alignment of Large Language Models NAACL 2024

Learning Conditional Preference Networks: An Approach Based on the Minimum Description Length Principle IJCAI 2024