Papers

261 papers found
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park, Rafael Rafailov, Stefano Ermon et al.
2024 ACL
Direct Preference Optimization with an Offset
Afra Amini, Tim Vieira, Ryan Cotterell
2024 ACL
2025 ACL
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
Nicholas E. Corrado, Julian Katz-Samuels, Adithya M Devraj et al.
2025 ACL
LPOI: Listwise Preference Optimization for Vision Language Models
Fatemeh Pesaran Zadeh, Yoojin Oh, Gunhee Kim
2025 ACL
T-REG: Preference Optimization with Token-Level Reward Regularization
Wenxuan Zhou, Shujian Zhang, Lingxiao Zhao et al.
2025 ACL
K-order Ranking Preference Optimization for Large Language Models
Shihao Cai, Chongming Gao, Yang Zhang et al.
2025 ACL
Robust Preference Optimization via Dynamic Target Margins
Jie Sun, Junkang Wu, Jiancan Wu et al.
2025 ACL
2025 ACL