Papers

261 papers found
Reverse Preference Optimization for Complex Instruction Following
Xiang Huang, Ting-En Lin, Feiteng Fang et al.
2025 ACL
2025 ACL
Using LLMs and Preference Optimization for Agreement-Aware HateWiC Classification
Sebastian Loftus, Adrian Mülthaler, Sanne Hoeken et al.
2025 ACL
Edit-Wise Preference Optimization for Grammatical Error Correction
Jiehao Liang, Haihui Yang, Shiping Gao et al.
2025 COLING
2025 COLING
Diffusion Model Alignment Using Direct Preference Optimization
Bram Wallace, Meihua Dang, Rafael Rafailov et al.
2024 CVPR