Papers
261 papers found
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
Junshu Pan, Wei Shen, Shulin Huang et al.
LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models
Tiesunlong Shen, Rui Mao, Jin Wang et al.
Multi-level Style Preference Optimization: An Adaptive Detection Framework for Human-Machine Hybrid Text
Zehao Wang, Lianwei Wu, Wenbo An et al.
AP2O-Coder: Adaptively Progressive Preference Optimization for Reducing Compilation and Runtime Errors in LLM-Generated Code
Jianqing Zhang, Wei Xia, Hande Dong et al.
MetaGDPO: Alleviating Catastrophic Forgetting with Metacognitive Knowledge Through Group Direct Preference Optimization
Lanxue Zhang, Yuqiang Xie, Fang Fang et al.
Preference Optimization via Contrastive Divergence: Your Policy Is Secretly an NLL Estimator
Zhuotong Chen, Fang Liu, Xuan Zhu et al.
AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment
Ruibo Deng, Duanyu Feng, Wenqiang Lei
DETONATE – A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization
Renjith Prasad Kaippilly Mana, Abhilekh Borah, Hasnat Md Abdullah et al.
NHK Submission to WAT 2025: Leveraging Preference Optimization for Article-level Japanese–English News Translation
Hideya Mino, Rei Endo, Yoshihiko Kawai
High-Dimensional Dueling Optimization with Preference Embedding
Yangwenhui Zhang, Hong Qian, Xiang Shu et al.
Preference Ranking Optimization for Human Alignment
Feifan Song, Bowen Yu, Minghao Li et al.
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Junru Lu, Siyu An, Min Zhang et al.
Relation-Augmented Dueling Bayesian Optimization via Preference Propagation
Xiang Xia, Xiang Shu, Shuo Liu et al.
Gradient-Based Optimization for Bayesian Preference Elicitation
Ivan Vendrov, Tyler Lu, Qingqing Huang et al.
Multi-Objective Bayesian Optimization with Active Preference Learning
Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki et al.
DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment
Gaofeng Liu, Zhiyuan Ma, Tao Fang
Multi-attribute Bayesian optimization with interactive preference learning
Raul Astudillo, Peter Frazier
DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment
Rongzhi Zhang, Chenwei Zhang, Xinyang Zhang et al.
Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning
Xinyu Sun, Zhikun Zhao, Congyan Lang et al.
Targeted Hyperparameter Optimization with Lexicographic Preferences Over Multiple Objectives
Shaokun Zhang, Feiran Jia, Chi Wang et al.
Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval
GuangHao Meng, Jinpeng Wang, Jieming Zhu et al.
Token-level Preference Self-Alignment Optimization for Multi-style Outline Controllable Generation
Zihao Li, Xuekong Xu, Ziyao Chen et al.
MWPO: Enhancing LLMs Performance through Multi-Weight Preference Strength and Length Optimization
Shiyue Xu, Fu Zhang, Jingwei Cheng et al.