Papers

261 papers found
Direct Judgement Preference Optimization
PeiFeng Wang, Austin Xu, Yilun Zhou et al.
2025 EMNLP
2025 EMNLP
Weights-Rotated Preference Optimization for Large Language Models
Chenxu Yang, Ruipeng Jia, Mingyu Zheng et al.
2025 EMNLP
Image Difference Captioning via Adversarial Preference Optimization
Zihan Huang, Junda Wu, Rohan Surana et al.
2025 EMNLP
Learning to Translate Ambiguous Terminology by Preference Optimization on Post-Edits
Nathaniel Berger, Johannes Eschbach-Dymanus, Miriam Exel et al.
2025 EMNLP
SPO: Self Preference Optimization with Self Regularization
Yuhao Sun, Yifan Zhang, Quandong Wang et al.
2025 EMNLP
Creative Preference Optimization
Mete Ismayilzada, Antonio Laverghetta Jr., Simone A. Luchini et al.
2025 EMNLP
2025 EMNLP
CoTD-PO: Chain-of-Thought Distillation with Preference Optimization
Lujie Niu, Haochen Sun, Fangkun Zhao et al.
2025 EMNLP